Bibliometrics
Skip Table Of Content Section
research-article
Non-overlapping High-accuracy Parallel Closure for Compact Schemes: Application in Multiphysics and Complex Geometry
Article No.: 1, pp 1–28https://doi.org/10.1145/3580005

Compact schemes are often preferred in performing scientific computing for their superior spectral resolution. Error-free parallelization of a compact scheme is a challenging task due to the requirement of additional closures at the inter-processor ...

research-article
Performance Analysis and Optimal Node-aware Communication for Enlarged Conjugate Gradient Methods
Article No.: 2, pp 1–25https://doi.org/10.1145/3580003

Krylov methods are a key way of solving large sparse linear systems of equations but suffer from poor strong scalability on distributed memory machines. This is due to high synchronization costs from large numbers of collective communication calls ...

research-article
Efficient Distributed Matrix-free Multigrid Methods on Locally Refined Meshes for FEM Computations
Article No.: 3, pp 1–38https://doi.org/10.1145/3580314

This work studies three multigrid variants for matrix-free finite-element computations on locally refined meshes: geometric local smoothing, geometric global coarsening (both h-multigrid), and polynomial global coarsening (a variant of p-multigrid). We ...

research-article
Open Access
Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation Systems
Article No.: 4, pp 1–33https://doi.org/10.1145/3580373

In this article, we present a CUDA library with a C API for solving block cyclic tridiagonal and banded systems on one GPU. The library can process block tridiagonal systems with block sizes from 1 × 1 (scalar) to 4 × 4 and banded systems with up to four ...

Subjects

Comments

About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!