Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA Explores RAPIDS cuVS IVF-PQ for Accelerated Vector Search
ADOPTION NEWS

NVIDIA Explores RAPIDS cuVS IVF-PQ for Accelerated Vector Search

By Crypto FlexsJuly 18, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA Explores RAPIDS cuVS IVF-PQ for Accelerated Vector Search
Share
Facebook Twitter LinkedIn Pinterest Email

Jack Anderson
18 July 2024 20:12

NVIDIA Researches RAPIDS cuVS IVF-PQ Algorithm to Improve Vector Search Performance with Compression and GPU Acceleration





In a detailed blog post, NVIDIA provides insight into the RAPIDS cuVS IVF-PQ algorithm, which aims to accelerate vector search by leveraging GPU technology and advanced compression techniques. This is the first part of a two-part series that follows our previous exploration of the IVF-Flat algorithm.

Introduction to the IVF-PQ Algorithm

In this blog post, we introduce IVF-PQ (Inverted File Index with Product Quantization), an algorithm designed to improve search performance and reduce memory usage by storing data in a compressed form. However, this method comes at the expense of accuracy, a trade-off that we will explore in more detail in the second part of the series.

IVF-PQ builds on the concept of IVF-Flat, which uses an inverted file index to limit search complexity to smaller subsets of data through clustering. Product Quantization (PQ) adds a compression layer by encoding the database vectors, making the process more efficient for large data sets.

Performance Benchmarks

NVIDIA shared benchmarks using the DEEP dataset, which contains 1 billion records and 96 dimensions, and is 360GiB in size. A typical IVF-PQ configuration compresses it to a 54GiB index without much impact on search performance, or as little as 24GiB with a slight slowdown. This compression allows the index to fit into GPU memory.

Comparing with the widely used CPU algorithm HNSW on a 100 million-item subset of the DEEP dataset, we show that cuVS IVF-PQ can significantly accelerate both index building and vector search.

Algorithm Overview

IVF-PQ follows a two-step process: coarse search and precise search. The coarse search is the same as IVF-Flat, but the precise search calculates the distance between the query point and the vector of the explored cluster, but the vector is stored in a compressed format.

This compression is achieved by PQ, which approximates vectors using two-stage quantization. This allows IVF-PQ to fit more data into GPU memory, improving memory bandwidth utilization and accelerating the search process.

Optimization and Performance

NVIDIA has implemented several optimizations in cuVS to ensure that the IVF-PQ algorithm performs efficiently on GPUs. These include:

  • Fuse tasks to reduce output size and optimize memory bandwidth utilization.
  • When possible, store lookup tables (LUTs) in GPU shared memory for faster access.
  • Use custom 8-bit floating point data types in LUTs for faster data conversion.
  • To optimize data transfer, data is aligned in 16-byte chunks.
  • Implement an “early stop” check to avoid unnecessary distance calculations.

NVIDIA benchmarks on a 100 million-size dataset show that IVF-PQ outperforms IVF-Flat, delivering up to 3-4x more queries per second for larger batch sizes.

conclusion

IVF-PQ is a powerful ANN search algorithm that leverages clustering and compression to improve search performance and throughput. The first part of the NVIDIA blog series provides a comprehensive overview of how the algorithm works and its benefits on GPU platforms. For more detailed performance tuning recommendations, NVIDIA encourages readers to explore the second part of the series.

For more information, visit the NVIDIA Technology Blog.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Stellar (XLM) Highlights the Superiority of Native Tokenization in Securities

May 6, 2026

Bitcoin is at risk of liquidation of $1.4 billion if BTC rises to $80,000.

April 28, 2026

Polymarket Seeks $400 Million Raise to $15 Billion Valuation: Report

April 20, 2026
Add A Comment

Comments are closed.

Recent Posts

Washington DC Summit As Real Estate Tokenization Enters Its Next Phase

May 15, 2026

Could BNB price fall above $750 if a double bottom pattern forms?

May 15, 2026

MEXC’s First USD1 Event Concludes With Over 160K Participants & $2.4 Billion In Futures Trading Volume

May 15, 2026

Eightco Holdings Inc. Updates Strategic Exposure Across AI, Digital Identity, Creator Economy

May 15, 2026

MapleStory Universe Marks One Year Of Live Ops, Surpasses 150M On-chain Transactions, Entering MSU 2.0 Phase

May 14, 2026

Base58Labs officially launches cryptocurrency arbitrage platform

May 14, 2026

MEXC Confirms Strong Asset Backing In Hacken-Audited May 2026 Proof Of Reserves Report

May 14, 2026

New Tokens Average At 2,341%, TradFi Futures Volume Climbs 55%: MEXC April Report

May 14, 2026

Cloudbet Expands Provably Fair Casino With 21 New Titles And 13 Originals

May 14, 2026

JPMorgan leverages both Ethereum and Solana for separate reasons for its institutional cash stack.

May 14, 2026

Tiny Bermuda chooses Stellar for its entire financial operations.

May 14, 2026

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

Washington DC Summit As Real Estate Tokenization Enters Its Next Phase

May 15, 2026

Could BNB price fall above $750 if a double bottom pattern forms?

May 15, 2026

MEXC’s First USD1 Event Concludes With Over 160K Participants & $2.4 Billion In Futures Trading Volume

May 15, 2026
Most Popular

Play-to-Earn Games Trailblazer Tamadoge Reveals $TAMA V2 On Polygon

January 31, 2024

The Conflux (CFX) v2.4.1 upgrade focuses on RPC and storage improvements.

November 29, 2024

IOTA moves more than 40% after announcing $100 million ecosystem foundation.

November 30, 2023
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.