GPULib 1.6 has greatly enhanced linear algebra capabilities: GPU accelerated LAPACK routines provided by MAGMA. See details on the GPULib blog. We provide a low-level interface to over 100 LAPACK routines.
MAGMA is a hybrid code that uses the CPU to do parts of the calculations that are best suited to it. We use Intel MKL to provide the CPU LAPACK.
MAGMA has been difficult to build, but I’m happy to say we have builds for OS X, Linux, and Windows!
Full disclosure: I work for Tech-X Corporation and I am the product manager for GPULib.