pith. the verified trust layer for science. sign in

arxiv: 1510.08879 · v1 · pith:6GFEILOGnew · submitted 2015-10-29 · ✦ hep-lat

Accelerating Twisted Mass LQCD with QPhiX

classification ✦ hep-lat
keywords xeonhaswellcodecpusdslashimplementationintellibrary
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{6GFEILOG}

Prints a linked pith:6GFEILOG badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We present the implementation of twisted mass fermion operators for the QPhiX library. We analyze the performance on the Intel Xeon Phi (Knights Corner) coprocessor as well as on Intel Xeon Haswell CPUs. In particular, we demonstrate that on the Xeon Phi 7120P the Dslash kernel is able to reach 80\% of the theoretical peak bandwidth, while on a Xeon Haswell E5-2630 CPU our generated code for the Dslash operator with AVX2 instructions outperforms the corresponding implementation in the tmLQCD library by a factor of $\sim 5\times$ in single precision. We strong scale the code up to 6.8 (14.1) Tflops in single (half) precision on 64 Xeon Haswell CPUs.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Charmonium radiative transitions to dileptons from lattice QCD: The case of $h_c \to \eta_c \ell^+\ell^-$ and $\chi_{c1} \to J/\psi\,\ell^+\ell^-$

    hep-lat 2026-04 unverdicted novelty 8.0

    First fully dynamical lattice QCD yields Γ(h_c → η_c e⁺e⁻) = 5.45(19) keV (3σ above BESIII) and Γ(χ_c1 → J/ψ e⁺e⁻) = 2.869(90) keV, with continuum-extrapolated results and q² distributions.