This is a poster that was recently presented at the NVIDIA GPU Technology Conference (GTC).
Abstract
———
Using the CUDA platform we have implemented a mixed precision Krylov solver for the Wilson-Dirac matrix for lattice QCD. The matrix-vector product which accounts for the vast majority of the operations runs in excess of 130 Gflops in single precision on [click on link for more...]


