Rongchai Wang
Oct 06, 2025 06:01
NVIDIA and IBM collaborate to combine GPU-native Velox with NVIDIA cuDF, enhancing knowledge analytics efficiency on platforms like Presto and Apache Spark.
As data-driven calls for develop, NVIDIA and IBM have partnered to reinforce knowledge analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration goals to ship important efficiency enhancements over conventional CPU-based methods by leveraging the excessive reminiscence bandwidth and thread depend of GPUs, in line with NVIDIA. These enhancements are notably useful for compute-heavy workloads involving a number of joins, complicated aggregations, and string processing.
Velox and cuDF: A Highly effective Mixture
The mixing of NVIDIA cuDF into the Velox execution engine permits for GPU-native question execution on widely-used platforms like Presto and Apache Spark. This open challenge goals to deal with efficiency bottlenecks, enabling real-time insights from large datasets. Velox acts as an middleman, translating question plans from methods like Presto and Spark into executable GPU pipelines powered by cuDF.
Accelerating Presto with GPU Energy
By transferring the complete Presto question plan to GPU, the combination goals to spice up execution velocity considerably. Enhancements to GPU operators resembling TableScan, HashJoin, and HashAggregations in Velox allow end-to-end GPU execution in Presto. Preliminary benchmarks present spectacular runtime reductions, with Presto on NVIDIA GPUs attaining runtimes considerably decrease than CPU counterparts.
Multi-GPU Execution for Enhanced Efficiency
The collaboration introduces a UCX-based Change operator, which helps the complete execution pipeline on GPUs, leveraging excessive bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup permits for substantial efficiency good points, with Presto on GPU showcasing greater than a sixfold speedup in knowledge alternate processes.
Hybrid Execution in Apache Spark
In Apache Spark, the combination with Apache Gluten and cuDF focuses on offloading compute-intensive question phases to GPUs, optimizing useful resource use in hybrid clusters. This technique permits for environment friendly use of GPU sources whereas sustaining CPU availability for different duties, leading to important efficiency enhancements.
Neighborhood Involvement and Future Prospects
The open-source nature of this challenge encourages neighborhood involvement, aiming to drive additional improvements throughout the information processing ecosystem. By implementing reusable GPU operators in Velox, the collaboration seeks to scale back duplication and simplify upkeep whereas accelerating numerous methods.
Picture supply: Shutterstock