arxiv_ml 90% Match Research Paper Researchers in scientific computing,HPC engineers,Machine learning practitioners in scientific domains 2 days ago

Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs

graph-neural-networks › graph-learning

📄 Abstract

Abstract: The conjugate gradient solver (CG) is a prevalent method for solving symmetric and positive definite linear systems Ax=b, where effective preconditioners are crucial for fast convergence. Traditional preconditioners rely on prescribed algorithms to offer rigorous theoretical guarantees, while limiting their ability to exploit optimization from data. Existing learning-based methods often utilize Graph Neural Networks (GNNs) to improve the performance and speed up the construction. However, their reliance on incomplete factorization leads to significant challenges: the associated triangular solve hinders GPU parallelization in practice, and introduces long-range dependencies which are difficult for GNNs to model. To address these issues, we propose a learning-based method to generate GPU-friendly preconditioners, particularly using GNNs to construct Sparse Approximate Inverse (SPAI) preconditioners, which avoids triangular solves and requires only two matrix-vector products at each CG step. The locality of matrix-vector product is compatible with the local propagation mechanism of GNNs. The flexibility of GNNs also allows our approach to be applied in a wide range of scenarios. Furthermore, we introduce a statistics-based scale-invariant loss function. Its design matches CG's property that the convergence rate depends on the condition number, rather than the absolute scale of A, leading to improved performance of the learned preconditioner. Evaluations on three PDE-derived datasets and one synthetic dataset demonstrate that our method outperforms standard preconditioners (Diagonal, IC, and traditional SPAI) and previous learning-based preconditioners on GPUs. We reduce solution time on GPUs by 40%-53% (68%-113% faster), along with better condition numbers and superior generalization performance. Source code available at https://github.com/Adversarr/LearningSparsePreconditioner4GPU

Authors (6)

Zherui Yang

Zhehao Li

Kangbo Lyu

Yixuan Li

Tao Du

Ligang Liu

Submitted

October 31, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper proposes a novel learning-based method using GNNs to construct GPU-friendly Sparse Approximate Inverse (SAI) preconditioners for Conjugate Gradient solvers. This approach avoids the computationally expensive triangular solves inherent in traditional methods and relies only on matrix-vector products, enabling better GPU parallelization and faster convergence.

Business Value

Significantly speeds up simulations in scientific and engineering domains by accelerating the solution of large linear systems, leading to faster design cycles and reduced computational costs.

Paper Metadata

Innovation Type

Algorithmic and Architectural

Deployment Feasibility

High, especially in environments with significant GPU resources and a need for solving large linear systems.

Limitations Addressed

Addresses the limitations of existing learning-based preconditioners that rely on incomplete factorization, which hinders GPU parallelization due to triangular solves and introduces long-range dependencies difficult for GNNs to model. It also overcomes the rigidity of traditional, algorithm-prescribed preconditioners by allowing data-driven optimization.

Technical Tags

Conjugate Gradient (CG)preconditionersSparse Approximate Inverse (SAI)Graph Neural Networks (GNNs)GPU parallelizationmatrix-vector productslinear systemssymmetric positive definitedata-driven optimizationcomputational efficiency

Research Topics

Numerical Linear AlgebraMachine Learning for Scientific ComputingGraph Neural NetworksGPU ComputingOptimization Algorithms

Methods & Architectures

Graph Neural Networks (GNNs)Sparse Approximate Inverse (SAI) preconditionersGPU acceleration Graph Neural Networks

Applications & Tasks

Scientific Computing High-Performance Computing (HPC) Engineering Simulations Solving linear systemsAccelerating iterative solversImproving preconditioner construction Preconditioning for CG solversGPU-accelerated linear system solvingLearning sparse approximate inverses

Related Fields

Numerical AnalysisScientific Machine LearningParallel ComputingGraph Theory

Keywords

Conjugate GradientpreconditionerSparse Approximate InverseGraph Neural NetworksGPUlinear systemsscientific computingnumerical linear algebramatrix-vector productparallelizationconvergencecomputational efficiency

Academic Context

#Numerical Linear Algebra#Machine Learning for Scientific Computing#Graph Neural Networks#GPU Computing#Optimization Algorithms

Commercial Potential

Potential Products

Accelerated solvers for engineering simulation softwareHPC libraries for linear algebra

Target Industries

AerospaceAutomotiveSemiconductor manufacturingComputational fluid dynamicsFinite element analysis

Use Case Examples

Simulating fluid dynamicsAnalyzing structural integritySolving large-scale optimization problems in engineering

Competitive Edge

Outperforms traditional and existing learning-based preconditioners by offering better GPU utilization and faster convergence for CG solvers.

Market Opportunity

Large market in scientific and engineering simulation software.

Revenue Models

Integration into commercial HPC softwarelicensing of specialized solver components.

Resource Requirements

Compute Needs

Requires GPUs for efficient execution.

Data Requirements

Requires matrices (or properties of matrices) from the target problem domain for training.

Deployment Constraints

Integration into existing solver pipelines might require modifications.

Scalability

Designed for scalability on GPUs, enabling solutions for larger systems.

Production Readiness

Maturity Level

Research

Time to Market

Medium

Patent Potential

Moderate

View Full Paper Back to Papers