Submission + - Cheap GPU Accelerated Database System Competes With Top TPC-H Scores (nvidia.com) 1
An anonymous reader writes: Alenka is an open source SQL-lite database system that leverages CUDA to offload all of the query processing to NVIDIA GPUs. Newly published results show that an $800 desktop system (with an NVIDIA GTX580, and 1 120GB SSD) outperforms the top10 ranked $55,000 HP server with 2 Quad Core (3Ghz) Xeons, 144GB of RAM, and 12 60GB SSDs running Microsoft SQL Server 2008 in one query, and offers comparable performance in another.
To achieve these results, the author (Anton) recently updated the code with a new version that includes a faster implementation of database JOIN. The code leverages the Thrust library for fast SORT, SELECT, and SET parallel algorithms. It also uses the CUDPP library to implement a parallel hash JOIN.
While the codebase is not a complete implementation of SQL, it can execute several queries from TPC-H (an industry standard data-warehousing benchmark). For Query 1 (SELECT, GROUP-BY) Alenka processes a 100GB dataset in 9.5 seconds, compared to 42.3 seconds on the HP system. For Query 3 (JOIN, GROUP-BY, SORT), Alenka takes 5.3 seconds, compared to 4.3 seconds for the HP system.
It will be interesting to see if Alenka can offer similar results for the entire TPC-H benchmark suite, or if other database implementations can be accelerated by GPUs.
The source code for the Alenka system is available on github.
To achieve these results, the author (Anton) recently updated the code with a new version that includes a faster implementation of database JOIN. The code leverages the Thrust library for fast SORT, SELECT, and SET parallel algorithms. It also uses the CUDPP library to implement a parallel hash JOIN.
While the codebase is not a complete implementation of SQL, it can execute several queries from TPC-H (an industry standard data-warehousing benchmark). For Query 1 (SELECT, GROUP-BY) Alenka processes a 100GB dataset in 9.5 seconds, compared to 42.3 seconds on the HP system. For Query 3 (JOIN, GROUP-BY, SORT), Alenka takes 5.3 seconds, compared to 4.3 seconds for the HP system.
It will be interesting to see if Alenka can offer similar results for the entire TPC-H benchmark suite, or if other database implementations can be accelerated by GPUs.
The source code for the Alenka system is available on github.