Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA’s RAPIDS cuDF improves pandas performance by 30x on large datasets.
ADOPTION NEWS

NVIDIA’s RAPIDS cuDF improves pandas performance by 30x on large datasets.

By Crypto FlexsAugust 11, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA’s RAPIDS cuDF improves pandas performance by 30x on large datasets.
Share
Facebook Twitter LinkedIn Pinterest Email

Felix Pinkston
Aug 10, 2024 02:42

NVIDIA launches RAPIDS cuDF unified memory, improving Pandas performance up to 30x on large text-intensive datasets.





NVIDIA has unveiled new features in RAPIDS cuDF that significantly improve the performance of the pandas library when processing large text-intensive datasets. According to the NVIDIA Technical Blog, these improvements will allow data scientists to accelerate their workloads by up to 30x.

RAPIDS cuDF and Panda

RAPIDS is a collection of open source GPU-accelerated data science and AI libraries, and cuDF is a Python GPU DataFrame library designed for loading, combining, aggregating, and filtering data. pandas, a widely used data analysis and manipulation library in Python, has struggled with processing speed and efficiency as dataset sizes have grown, especially on CPU-only systems.

At GTC 2024, NVIDIA announced that RAPIDS cuDF can accelerate pandas by about 150x without any code changes. Google later announced that RAPIDS cuDF will be natively available in Google Colab, making it easier for data scientists to use.

Pushing the limits

User feedback on the initial release of cuDF highlighted some limitations, particularly with regard to the size and type of datasets that could benefit from acceleration.

  • To maximize acceleration, datasets must fit into GPU memory, which limits the data size and complexity of operations that can be performed.
  • Text-heavy data sets face limitations, with the original cuDF release only supporting a maximum of 2.1 billion characters per column.

To address these issues, the latest release of RAPIDS cuDF includes:

  • Up to 30x speedup on larger data sets and more complex workloads with optimized CUDA unified memory.
  • The number of characters in a column has been expanded from 2.1 billion to 2.1 billion rows of tabular text.

Accelerated data processing through unified memory

cuDF relies on CPU fallback to ensure a smooth experience. If memory requirements exceed GPU capacity, cuDF transfers data to CPU memory and uses pandas for processing. However, to avoid frequent CPU fallbacks, the data set should ideally fit into GPU memory.

With CUDA Unified Memory, cuDF can now scale pandas workloads beyond GPU memory. Unified Memory provides a single address space across CPUs and GPUs, enabling virtual memory allocations larger than the available GPU memory and migrating data as needed. This helps maximize performance, but datasets still need to be sized to fit GPU memory for maximum acceleration.

Benchmarks show that using cuDF for data joins on a 10GB dataset using a 16GB memory GPU can achieve up to 30x speedup compared to CPU-only pandas. This is a significant improvement, especially when handling datasets larger than 4GB, which previously faced performance issues due to GPU memory constraints.

Processing large-scale tabular text data

The original cuDF release’s 2.1 billion character per column limit presented challenges for large datasets. With the new release, cuDF can now handle tabular text data of up to 2.1 billion rows, making pandas a viable tool for data preparation in generative AI pipelines.

These improvements will make Pandas code run much faster, especially for text-heavy datasets like product reviews, customer service logs, or datasets with significant location or user ID data.

Get started

All of these features are available in RAPIDS 24.08 and can be downloaded from the RAPIDS Installation Guide. The Unified Memory feature is only supported on Linux-based systems.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Bitcoin is at risk of liquidation of $1.4 billion if BTC rises to $80,000.

April 28, 2026

Polymarket Seeks $400 Million Raise to $15 Billion Valuation: Report

April 20, 2026

Ether risks a $1.7K retest as traders fail to overcome a key resistance area.

April 4, 2026
Add A Comment

Comments are closed.

Recent Posts

How to Connect OpenClaw with Binance for Live AI Trading (2026)

April 28, 2026

BitMart X $EAT Trade-to-Feed Competition To Pay Out $4.4M USDT To Traders In May 2026

April 28, 2026

ORBS) Reports Total Holdings Of Approximately $333 Million, Includes OpenAI, Beast Industries, More Than 11,000 ETH And Over 283 Million WLD Tokens

April 28, 2026

Core Scientific moves forward with 1.5GW AI data center campus in Texas

April 28, 2026

AxeCasino To Attend IGB L!VE 2026 Following Front-End Update Focused On Usability And Cross-Device Performance

April 28, 2026

Ondo Finance adds proxy voting for holders of $700 million worth of tokenized shares.

April 28, 2026

Bitcoin is at risk of liquidation of $1.4 billion if BTC rises to $80,000.

April 28, 2026

MBitmine Immersion Technologies Reports ETH Holdings Of 5.078M Tokens, Total Assets At $13.3B

April 28, 2026

Harvey AI opens Dallas office, expands legal AI presence

April 28, 2026

Nexus AiCOS Defines “Proofs Of Behavior” As The On-Chain Credit Standard On Base

April 27, 2026

Digital ledger technology explained: a guide for crypto

April 27, 2026

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

How to Connect OpenClaw with Binance for Live AI Trading (2026)

April 28, 2026

BitMart X $EAT Trade-to-Feed Competition To Pay Out $4.4M USDT To Traders In May 2026

April 28, 2026

ORBS) Reports Total Holdings Of Approximately $333 Million, Includes OpenAI, Beast Industries, More Than 11,000 ETH And Over 283 Million WLD Tokens

April 28, 2026
Most Popular

Bitcoin rewards platform Lolli raises $8 million in Series B funding to expand enterprise offering

December 15, 2023

Dogecoin Price Prediction: DOGE Falls 3% As Roaring Kitty Rally Fades Off, This Red Hot Meme Coin Offers Last Chance to Buy

May 15, 2024

Coinbase and MicroStrategy stocks surged as Bitcoin surpassed $72,000.

April 8, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.