I’ve previously posted on the topics of CUDA and Larrabee. I continue to be intrigued by the possibilities that open up as multi-core GPU programming becomes available. For applications that need many threads this should present interesting opportunities. Why bother struggling to run your parallel application in the meager 4 or 8 cores of your CPU when you can offload the work to 32 cores?