Mass matrix assembly for implicit PIC methods can be exactly reformulated cell-by-cell as tensor-core matrix products, delivering up to 3x kernel speedup and 15% end-to-end runtime reduction in ECSIM simulations.
pd-gem5: Simulation Infrastructure for Parallel/Distributed Computer Systems ,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
CXL-ClusterSim is a full-system simulation framework combining gem5 and SST to model CXL disaggregated memory for pooling and sharing.
citing papers explorer
-
Mass Matrix Assembly on Tensor Cores for Implicit Particle-In-Cell Methods
Mass matrix assembly for implicit PIC methods can be exactly reformulated cell-by-cell as tensor-core matrix products, delivering up to 3x kernel speedup and 15% end-to-end runtime reduction in ECSIM simulations.
-
CXL-ClusterSim: Modeling CXL-based Disaggregated Memory Cluster for Pooling and Sharing using gem5 and SST
CXL-ClusterSim is a full-system simulation framework combining gem5 and SST to model CXL disaggregated memory for pooling and sharing.