Benchmarking the speed of random number generation in C++11 with GCC and with VSL and ICC.
Posts tagged: C++
Random number generation with C++11 and VSL
Using Cython with Pypy and Numpy
Load Cython modules from Pypy while also using Numpy.
SDPA with different compilers and linear algebra libraries
GCC, ICC, PGI compilers with BLAS/LAPACK, MKL, and ACML are compared in solving an SDP with SDPA.
Training emergent self-organizing maps on sparse data with Somoclu
Self-organizing maps are computationally expensive to train -- emergent maps are even more so. This post looks at the constraints with sparse data.
CuBlas matrix multiplication with C-style arrays
Getting around Fortran-style array indexing in CuBlas from C code without transponation. Bonus Thrust vector casting added.
Summing the entries of a matrix using a stencil with Thrust
Thrust-based summing of the elements of a submatrix at a given offset according to a stencil.
Argmin on the rows of a matrix with Thrust
A detailed description of how to use Thrust reduce by key to calculate the argmins of the rows of a matrix