A quick comparison of Trotter-Suzuki-MPI, GPELab, and GPUE for simulating the evolution of Bose-Einstein Condensates
Posts tagged: GPU
Benchmarking the speed of random number generation in C++11 with GCC and with VSL and ICC.
Getting around Fortran-style array indexing in CuBlas from C code without transponation. Bonus Thrust vector casting added.
Thrust-based summing of the elements of a submatrix at a given offset according to a stencil.
A detailed description of how to use Thrust reduce by key to calculate the argmins of the rows of a matrix