CM2026:P000073

Beyond Graph: Risk-Aware Learning Method for Elimination Orderings in Sparse Triangular Decomposition

*张济名 (Sino-French Carbon Neutrality Research Center, Ecole Centrale de Pekin/School of General Engineering, Beihang University)
黄博 (LMIB-School of Mathematical Sciences, Beihang University)
牛薇 (Sino-French Carbon Neutrality Research Center, Ecole Centrale de Pekin/School of General Engineering, Beihang University)

Elimination ordering is often the key determinant of runtime and decomposability in sparse triangular decomposition for steady-state polynomial systems, whereas existing graph-guided methods rely primarily on graph legality and therefore fail to capture prefix-dependent algebraic risk. We formulate elimination ordering as a risk-aware sequential decision problem, construct state-wise action supervision by evaluating complete candidate orderings under a unified terminal Maple backend, and train a GRPO-style pairwise ranking policy on Proxy States combining prefix, graph, and algebraic summaries. On a repaired and audited benchmark containing 141 systems in total, with 13 systems reserved exclusively for training-data construction, the held-out comparison shows that the learned policy completes more systems than the baseline method, yields fewer timeouts, and is faster on more systems completed by both methods. These results show that a risk-aware approach provides a stronger overall treatment of elimination ordering and can produce higher-quality complete orderings under a shared algebraic backend.

Elimination ordering is often the key determinant of runtime and decomposability in sparse triangular decomposition for steady-state polynomial systems, whereas existing graph-guided methods rely primarily on graph legality and therefore fail to capture prefix-dependent algebraic risk. We formulate elimination ordering as a risk-aware sequential decision problem, construct state-wise action supervision by evaluating complete candidate orderings under a unified terminal Maple backend, and train a GRPO-style pairwise ranking policy on Proxy States combining prefix, graph, and algebraic summaries. On a repaired and audited benchmark containing 141 systems in total, with 13 systems reserved exclusively for training-data construction, the held-out comparison shows that the learned policy completes more systems than the baseline method, yields fewer timeouts, and is faster on more systems completed by both methods. These results show that a risk-aware approach provides a stronger overall treatment of elimination ordering and can produce higher-quality complete orderings under a shared algebraic backend.

第十六届中国数学会计算机数学大会

Beyond Graph: Risk-Aware Learning Method for Elimination Orderings in Sparse Triangular Decomposition