Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA surpasses 1,000 TPS/users with llama 4 Maverick and Blackwell GPUS.
ADOPTION NEWS

NVIDIA surpasses 1,000 TPS/users with llama 4 Maverick and Blackwell GPUS.

By Crypto FlexsMay 23, 20253 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA surpasses 1,000 TPS/users with llama 4 Maverick and Blackwell GPUS.
Share
Facebook Twitter LinkedIn Pinterest Email

Lawrence Zenga
May 23, 2025 02:10

NVIDIA uses the BLACKWELL GPUS and LLAMA 4 Maverick to achieve the world’s record reasoning speed of 1,000 TPS/users to set new standards for AI model performance.





NVIDIA has set up a new benchmark with AI performance, breaking LLAMA 4 Maverick Model and Blackwell GPU to break 1,000 tokens (TPS) per user barrier. This achievement has been independently verified by artificial analysis of AI benchmarking service, and significant milestones in the speed of LLM (Lange Language Model) reasoning.

Technology development

This breakthrough has been achieved in a single NVIDIA DGX B200 node equipped with eight NVIDIA BLACKWELL GPUs that can handle more than 1,000 tp per user in LLAMA 4 MAVERICK, an 800 million parameter model. Due to this performance, Blackwell is an optimal hardware for deploying LLAMA 4 to maximize throughput or minimize atmospheric time.

Optimization

NVIDIA has completely utilized the Blackwell GPU by using TensOrt-Llm to implement extensive software optimization. The company also trained a speculative decoding draft model using the EAGLE-3 technology, resulting in a four-fold increase compared to the previous baseline. This improvement maintains response accuracy while improving performance and uses the FP8 data type for gemms and professional mixing to ensure the accuracy that can be compared with BF16 metrics.

The importance of low standby time

In the generated AI application, throughput balance and waiting time are important. In the case of important applications that require quick decision -making, NVIDIA’s BLACKWELL GPU is excellent by minimizing the delay time as shown in the TPS/user record. The function of hardware that handles high throughput and low standby time is ideal for various AI tasks.

CUDA kernel and speculation decoding

NVIDIA optimized the CUDA kernel for the work of Gemms, MoE and stocks to maximize performance by using spatial partitioning and efficient memory data rods. Dumping decoding was used to accelerate the speed of LLM reasoning using a smaller and faster draft model proven by smaller Target LLM. This approach increases significant speed, especially when the prediction of the draft model is correct.

Programming method dependency launch

To further improve performance, NVIDIA has reduced GPU idle time between continuous CUDA kernels using PDL (Programmatic Dependent Lunch). This technique allows you to run the kernel to improve the GPU usage rate and remove the performance interval.

The performance of NVIDIA emphasizes leadership in the field of AI infrastructure and data center technology, setting a new standard for the speed and efficiency of the AI ​​model deployment. Innovation of the Blackwell architecture and software optimization continues to react with possible boundaries of AI performance and guarantee real -time user experience and powerful AI applications.

For more information, visit the NVIDIA official blog.

Image Source: Shutter Stock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Hong Kong regulators have set a sustainable finance roadmap for 2026-2028.

January 30, 2026

ETH has recorded a negative funding rate, but is ETH under $3K discounted?

January 22, 2026

AAVE price prediction: $185-195 recovery target in 2-4 weeks

January 6, 2026
Add A Comment

Comments are closed.

Recent Posts

A sharp drop in spot trading volume triggered a significant Bitcoin correction, with Anchor Mining standing out amidst market turmoil with a stable daily return of $3,656.

February 2, 2026

Brevis and BNB Chain Expand Privacy Infrastructure Partnership –

February 2, 2026

LabGemTraders Launches FairCarats FCAR Utility Vouchers, Private Sales Coming Soon

February 1, 2026

How high can $SHIB go in the next cryptocurrency rally?

January 31, 2026

Onre Tokenized Pool Audit Summary

January 31, 2026

NFT sales drop 38% due to weakening cryptocurrency market

January 31, 2026

The cryptocurrency veteran is back with caricatures, privacy apps, and Gasless L2.

January 30, 2026

Ethereum leverage remains at an all-time high. What happens next?

January 30, 2026

Hong Kong regulators have set a sustainable finance roadmap for 2026-2028.

January 30, 2026

Bybit Unveils 2026 Vision As “The New Financial Platform,” Expanding Beyond Exchange Into Global Financial Infrastructure

January 30, 2026

How to Claim Vault12 Promo Code FALLOUT26 for Android and iOS

January 29, 2026

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

A sharp drop in spot trading volume triggered a significant Bitcoin correction, with Anchor Mining standing out amidst market turmoil with a stable daily return of $3,656.

February 2, 2026

Brevis and BNB Chain Expand Privacy Infrastructure Partnership –

February 2, 2026

LabGemTraders Launches FairCarats FCAR Utility Vouchers, Private Sales Coming Soon

February 1, 2026
Most Popular

Trader names one altcoin as his 2024 ‘AI Bet’ and updates outlook for two additional crypto assets.

May 7, 2024

Encryption stocks were reduced, and IPOs punched in tariffs.

April 5, 2025

MEXC Launches Year-End Golden Era Showdown With 2,000g Gold Bar And BTC From 10 Million USDT Prize Pool

November 26, 2025
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.