Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA’s EMBark revolutionizes training large-scale recommender systems.
ADOPTION NEWS

NVIDIA’s EMBark revolutionizes training large-scale recommender systems.

By Crypto FlexsNovember 25, 20242 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA’s EMBark revolutionizes training large-scale recommender systems.
Share
Facebook Twitter LinkedIn Pinterest Email

Ted Hisokawa
November 21, 2024 02:40

NVIDIA introduces EMBark, which optimizes the embedding process to power deep learning recommendation models and significantly increases training efficiency for large-scale systems.





In an effort to increase the efficiency of large-scale recommender systems, NVIDIA introduced EMBark, a new approach that aims to optimize the embedding process of deep learning recommendation models. According to NVIDIA, recommender systems play a central role in the Internet industry, and training them efficiently is a critical task for many companies.

Challenges of training recommendation systems

Deep learning recommendation models (DLRMs) often incorporate billions of identity features and require robust training solutions. Recent advances in GPU technology, such as NVIDIA Merlin HugeCTR and TorchRec, have improved DLRM training by leveraging GPU memory to handle large-scale identity feature embeddings. However, as the number of GPUs increases, the communication overhead during embedding becomes a bottleneck, sometimes accounting for more than half of the total training overhead.

EMBark’s innovative approach

EMBark, presented at RecSys 2024, addresses these challenges by implementing a 3D flexible sharding strategy and communication compression techniques, aiming to balance the load during training and reduce communication time for embedding. The EMBark system includes three core components: an embedding cluster, a flexible 3D sharding scheme, and a sharding planner.

Includes cluster

These clusters promote efficient training by grouping similar features and applying custom compression strategies. EMBark categorizes clusters into data-parallel (DP), reduction-based (RB), and unique-based (UB) types, each suitable for different training scenarios.

Flexible 3D sharding method

This innovative scheme allows precise control of workload balancing across GPUs by leveraging 3D tuples to represent each shard. This flexibility addresses imbalance issues found in traditional sharding methods.

Sharding Planner

The sharding planner uses a greedy search algorithm to determine the optimal sharding strategy and improves the training process based on hardware and embedding configuration.

Performance and Evaluation

The efficiency of EMBark was tested on NVIDIA DGX H100 nodes, demonstrating significant improvements in training throughput. Across a variety of DLRM models, EMBark achieves an average 1.5x increase in training speed, with some configurations being up to 1.77x faster than existing methods.

EMBark significantly improves the efficiency of large-scale recommender system models by strengthening the embedding process, setting a new standard for deep learning recommender systems. To get more detailed insight into EMBark’s performance, you can view its research paper.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Ether Funds Turn Negative, But Bears Still Retain Control: Why?

March 11, 2026

BNB holders gained 177% in 15 months through Binance Rewards Program.

February 23, 2026

ETH ETF loses $242M despite holding $2K in Ether

February 15, 2026
Add A Comment

Comments are closed.

Recent Posts

Phemex TradFi Hits $10B Monthly Volume, Advancing Cross-Market Trading Infrastructure

March 12, 2026

BMNR), Cathie Wood’s ARK Invest, And Payward To Expand Into Next Generation Technology

March 12, 2026

Ethereum attempts to hold above $2,000 as whales withdraw $155 million from ETH.

March 12, 2026

PrimeXBT Launches PXTrader 2.0, Bringing Crypto And Traditional Markets Into One Trading Platform

March 12, 2026

BYDFi Perpetual Futures Data Now Live On TradingView

March 12, 2026

3/11 Price Prediction: BTC, ETH, BNB, XRP, SOL, DOGE, ADA, BCH, HYPE, XMR

March 12, 2026

Ethereum Price Rejects Again, Market Watches Key Support Closely

March 11, 2026

Ethereum Price Rejects Again, Market Watches Key Support Closely

March 11, 2026

CoinPoker launches new app with Rake Free Poker, recruits Abby Merk and Papo MC

March 11, 2026

This Is Fine (Until the Grant Runs Out)

March 11, 2026

Ether Funds Turn Negative, But Bears Still Retain Control: Why?

March 11, 2026

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

Phemex TradFi Hits $10B Monthly Volume, Advancing Cross-Market Trading Infrastructure

March 12, 2026

BMNR), Cathie Wood’s ARK Invest, And Payward To Expand Into Next Generation Technology

March 12, 2026

Ethereum attempts to hold above $2,000 as whales withdraw $155 million from ETH.

March 12, 2026
Most Popular

Pepe Price Prediction: PEPE Soars 22% in One Week as This Multi-Chain Meme Coin Gives Investors One Last Opportunity to Buy.

May 6, 2024

DOJ Appoints Forensic Risk Alliance as Binance Monitor

May 10, 2024

Best practices for designing Solidity events on Ethereum and the EVM

July 31, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.