The cuDF interop in the roadmap [1] will be huge for my workloads. XGBoost has t...

reactordev · 2025-11-20T11:17:43 1763637463

Can you explain how it’s faster? GPU memory is just a blob with an address. Is it because the loading algorithms for vortex align better with XGBoost or just plain uploading to the GPU?

robert3005 · 2025-11-20T15:56:59 1763654219

What you can do if you have gpu friendly format is you send compressed data over PCI-E and then decompress on the gpu. Thus your overall throughput will increase since PCI-E bandwidth is the limiting factor of the overall system.

reactordev · 2025-11-20T17:03:57 1763658237

That doesn’t explain how vortex is faster. Yes, you should send compressed data to the GPU and let it uncompress. You should maximize your PCI-E throughput to minimize latency in execution, but what does Vortex bring? Other than Parque bad, Vortex good.

kipukun · 2025-11-21T02:37:00 1763692620

XGBoost is just faster on the GPU, regardless of the file format. A sibling post also pointed out compression helping out on bandwidth.