psilambda - Slashdot User

Comment Parallel computation libraries (Score 1) 137

by psilambda on Sunday July 18, 2010 @01:15PM (#32943524) Attached to: Why 'Gaming' Chips Are Moving Into the Server Room

If you wish for your computations to be parallel at a level higher than algorithm steps (i.e. you can build libraries upon libraries that are efficient parallel computation throughout the layers of libraries), then neither the CUDA driver or the CUDA runtime API (or OpenCL or DirectCompute) are very good. An example of this for CUDA is that even usage of the Fermi concurrent kernel execution feature is not generally possible using all (or even very many) CUDA kernels in a program by just using the CUDA APIs.

MPI (message passing interface) gives parallel computation at the clustering level and the Kappa Library gives you this at the library component level. If somebody knows about something other than MPI or Kappa that does this and is available for general use, I would be interested to hear about it.

Comment Re:CUDA (Score 2, Informative) 137

by psilambda on Thursday July 15, 2010 @09:24PM (#32922102) Attached to: Why 'Gaming' Chips Are Moving Into the Server Room

Indeed. With Cuda, DirectCompute, and OpenCL, nearly 100% of your code is boilerplate interfacing to the API. There needs to be a language where this stuff is a first-class citizen and not just something provided by an API.

If you use CUDA, OpenCL or DirectComputeX it is--try the Kappa library--it has its own scheduling language that make this much easier. The next version that is about to come out goes much further yet.

Comment Re:A whole new level of parallelism (Score 3, Interesting) 137

by psilambda on Thursday July 15, 2010 @09:19PM (#32922064) Attached to: Why 'Gaming' Chips Are Moving Into the Server Room

The article and everybody else are ignoring one large, valid use of GPUs in the data center--whether you call it business intelligence or OLAP--it needs to be in the data center and it needs some serious number crunching. There is not as much difference between this and scientific number crunching as most people might think. I have been involved in both crunching numbers for financials at a major multinational and had the privilege of being the first to process the first full genome (complete genetic sequence--terabytes of data) for a single individual and actually the genomic analysis was much more integer based than the financials. Based on my experience with both, I created the Kappa library for doing CUDA or OpenMP analysis in a datacenter--whether for business or scientific work.

Slashdot Top Deals