NVIDIA unveils the enhanced function of NCCL 2.23 for improved GPU communication.

Ted Hirokawa
January 31, 2025 06:38

NVIDIA’s NCCL 2.23 release introduces new scaling algorithms, acceleration initialization and profiler Russian APIs to optimize GPUs and multi -node communication between AI and HPC applications.

The latest release of NVIDIA Collective Communications Library (NCCL) 2.23 introduces improvement products that aims to optimize GPUs and multi -node communications essential for artificial intelligence (AI) and high -performance computing (HPC) applications. According to NVIDIA, this improvement is designed to improve the efficiency and scalability of parallel computing.

Highlights and functions release

NCCL 2.23 release shows some major innovations.

Parallel Tree Tree (PAT) Algorithm: A new algorithm for reducescatter and allgather tasks that provide log scaling to improve the performance of small and medium -sized messages.

Acceleration initialization: Improving performance by using networking in bands for boot strap communication, new performance, new ncclCommInitRankScalable API.

Intra Node User Buffer Registration: Reduces memory sub -system pressure and improves communication overlap to provide performance improvement.

New Profiler Plugin API: It provides API hooks that measure microfolves NCCL performance and improve diagnostic function.

PAT algorithm and initialization improvement

Inspired by the Bruck algorithm, the PAT algorithm minimizes buffering demands, enabling efficient communication of various network sizes. This improvement is particularly advantageous for large language model education, where pipelines and tensor parallel processing are important.

that ncclCommInitRankScalable The API allows multiple unique IDs to facilitate expandable initialization, relieving bottlenecks related to all communication patterns in large -scale tasks.

Intra node user buffer registration

NCCL 2.23 supports intra node user buffer registration, supporting data transmission optimization through NVLINK and PCIe. This feature uses a registered user buffer that is automatically registered during the CUDA graph capture to reduce overhead and improve performance.

Profiler Plugin API

The new profiler plug -in API is increasing demand for monitoring tools for each domain in a wide range of GPU clusters. This API helps to detect performance abnormalities and optimize resource allocation by enabling profiling of NCCL events.

conclusion

NVIDIA’s NCCL 2.23 promises to enhance utilities in AI and HPC domains by greatly improving the performance and scalability of GPU communication by introducing these advanced features. For more information about these updates, visit the official NVIDIA blog.

Image Source: Shutter Stock

NVIDIA unveils the enhanced function of NCCL 2.23 for improved GPU communication.

As you challenge the mixed technology signal, OnDo Price Hovers challenges the August Bullish predictions.

XRP Open Interests decrease by $ 2.4B after recent sale

KAITO unveils Capital Launchpad, a Web3 crowdfunding platform that will be released later this week.

Ethereum (ETH), SEI (Sei), and Bonk (Bonk) gathered in July, but one token is prepared to dominate next.

Floki and OnDo expand their profits as Robinhood Listing strengthens.

Vitalik Buterin regains the title of ‘Onchain Billionaire’, where ether reaches $ 4.2K.

Did you miss the TRON ‘S (TRX) 100X? Ruvi AI (Ruvi)

Re -creation attack in ERC -721 -Ackee Blockchain

The New Bybit Web3 Is Here–Fueling On-Chain Thrills With $200,000 Up For Grabs

Stella (XLM) Eye 35% Rally and Ripple and SEC END 5 years legal battle

Builders Are Proving What’s Possible With CARV’s AI Stack

Caldera Announces Partnership With EigenCloud To Integrate EigenDA V2

Are Monero in danger? Five orphan blocks were found during the Cubic Mining War.

One Card To Seamlessly Bridge Web3 Assets And Real-World Spending

Top Insights

Ethereum (ETH), SEI (Sei), and Bonk (Bonk) gathered in July, but one token is prepared to dominate next.

Floki and OnDo expand their profits as Robinhood Listing strengthens.

Vitalik Buterin regains the title of ‘Onchain Billionaire’, where ether reaches $ 4.2K.

Most Popular

NVIDIA to Unveil Advanced AI and Gaming Technologies at Gamescom 2024

Origin Protocol Price Prediction for Today, February 2nd – OGN Technical Analysis

Former Bithumb Chairman Lee Jung-hoon acquitted on $100 million fraud charge

NVIDIA unveils the enhanced function of NCCL 2.23 for improved GPU communication.

Highlights and functions release

PAT algorithm and initialization improvement

Intra node user buffer registration

Profiler Plugin API

conclusion

Related Posts