Factors Impacting Performance of Multithreaded Sparse Triangular Solve

Wolf, Michael M.; Heroux, Michael A.; Boman, Erik G.

doi:10.1007/978-3-642-19328-6_6

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6449))

Included in the following conference series:

International Conference on High Performance Computing for Computational Science

1578 Accesses

Abstract

As computational science applications grow more parallel with multi-core supercomputers having hundreds of thousands of computational cores, it will become increasingly difficult for solvers to scale. Our approach is to use hybrid MPI/threaded numerical algorithms to solve these systems in order to reduce the number of MPI tasks and increase the parallel efficiency of the algorithm. However, we need efficient threaded numerical kernels to run on the multi-core nodes in order to achieve good parallel efficiency. In this paper, we focus on improving the performance of a multithreaded triangular solver, an important kernel for preconditioning. We analyze three factors that affect the parallel performance of this threaded kernel and obtain good scalability on the multi-core nodes for a range of matrix sizes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

€32.70 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: EUR 29.95; Price includes VAT (France)

eBook: EUR 42.79; Price includes VAT (France)

Softcover Book: EUR 52.74; Price includes VAT (France)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multithreaded Multifrontal Sparse Cholesky Factorization Using Threading Building Blocks

A Trade-off Analysis of the Parallel Hybrid SPIKE Preconditioner in a Unique Multi-core Computer

swParaFEM: a highly efficient parallel finite element solver on Sunway many-core architecture

Article 28 February 2023

References

Lin, P., Shadid, J., Sala, M., Tuminaro, R., Hennigan, G., Hoekstra, R.: Performance of a parallel algebraic multilevel preconditioner for stabilized finite element semiconductor device modeling. Journal of Computational Physics 228(17), 6250–6267 (2009)
Article MathSciNet MATH Google Scholar
Hennigan, G., Hoekstra, R., Castro, J., Fixel, D., Shadid, J.: Simulation of neutron radiation damage in silicon semiconductor devices. Technical Report SAND2007-7157, Sandia National Laboratories (2007)
Google Scholar
Lin, P.T., Shadid, J.N.: Performance of an MPI-only semiconductor device simulator on a quad socket/quad core InfiniBand platform. Technical Report SAND2009-0179, Sandia National Laboratories (2009)
Google Scholar
Li, X.S., Shao, M., Yamazaki, I., Ng, E.G.: Factorization-based sparse solvers and preconditioners. Journal of Physics: Conference Series 180(1), 012015 (2009)
Google Scholar
Saltz, J.H.: Aggregation methods for solving sparse triangular systems on multiprocessors. SIAM Journal on Scientific and Statistical Computing 11(1), 123–144 (1990)
Article MathSciNet MATH Google Scholar
Rothberg, E., Gupta, A.: Parallel iccg on a hierarchical memory multiprocessor – addressing the triangular solve bottleneck. Parallel Computing 18(7), 719–741 (1992)
Article MATH Google Scholar
Mayer, J.: Parallel algorithms for solving linear systems with sparse triangular matrices. Computing 86(4), 291–312 (2009)
Article MathSciNet MATH Google Scholar
Davis, T.A.: The University of Florida Sparse Matrix Collection (1994), Matrices found at http://www.cise.ufl.edu/research/sparse/matrices/

Download references

Author information

Authors and Affiliations

Scalable Algorithms Dept., Sandia National Laboratories, Albuquerque, NM, USA
Michael M. Wolf, Michael A. Heroux & Erik G. Boman

Authors

Michael M. Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Heroux
View author publications
You can also search for this author in PubMed Google Scholar
Erik G. Boman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Engenharia da, Universidade do Porto, Rua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal
José M. Laginha M. Palma
INP (ENSEEIHT) IRIT, University of Toulouse, rue Charles-Camichel, CEDEX 7, 31071, Toulouse, France
Michel Daydé
Lawrence Berkeley National Laboratory, Berkeley, USA
Osni Marques
Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, s/n, 4200-465, Porto, Portugal
João Correia Lopes

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wolf, M.M., Heroux, M.A., Boman, E.G. (2011). Factors Impacting Performance of Multithreaded Sparse Triangular Solve. In: Palma, J.M.L.M., Daydé, M., Marques, O., Lopes, J.C. (eds) High Performance Computing for Computational Science – VECPAR 2010. VECPAR 2010. Lecture Notes in Computer Science, vol 6449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19328-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-19328-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19327-9
Online ISBN: 978-3-642-19328-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Factors Impacting Performance of Multithreaded Sparse Triangular Solve

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multithreaded Multifrontal Sparse Cholesky Factorization Using Threading Building Blocks

A Trade-off Analysis of the Parallel Hybrid SPIKE Preconditioner in a Unique Multi-core Computer

swParaFEM: a highly efficient parallel finite element solver on Sunway many-core architecture

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Factors Impacting Performance of Multithreaded Sparse Triangular Solve

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multithreaded Multifrontal Sparse Cholesky Factorization Using Threading Building Blocks

A Trade-off Analysis of the Parallel Hybrid SPIKE Preconditioner in a Unique Multi-core Computer

swParaFEM: a highly efficient parallel finite element solver on Sunway many-core architecture

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation