NIRD is enhanced with adaptivity in preprocessing, wider partitioning functions, and modified error measurement; an updated heuristic convergence proof and new performance model are given, backed by tests showing fast convergence on many elliptic PDEs.
Node Aware Sparse Matrix-Vector Multiplication
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
The sparse matrix-vector multiply (SpMV) operation is a key computational kernel in many simulations and linear solvers. The large communication requirements associated with a reference implementation of a parallel SpMV result in poor parallel scalability. The cost of communication depends on the physical locations of the send and receive processes: messages injected into the network are more costly than messages sent between processes on the same node. In this paper, a node aware parallel SpMV (NAPSpMV) is introduced to exploit knowledge of the system topology, specifically the node-processor layout, to reduce costs associated with communication. The values of the input vector are redistributed to minimize both the number and the size of messages that are injected into the network during a SpMV, leading to a reduction in communication costs. A variety of computational experiments that highlight the efficiency of this approach are presented.
fields
math.NA 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Advances in Implementation, Theoretical Motivation, and Numerical Results for the Nested Iteration with Range Decomposition Algorithm
NIRD is enhanced with adaptivity in preprocessing, wider partitioning functions, and modified error measurement; an updated heuristic convergence proof and new performance model are given, backed by tests showing fast convergence on many elliptic PDEs.