Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • HACKING
  • SLOT
  • CASINO
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • HACKING
  • SLOT
  • CASINO
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA Unveils BigVGAN v2: Pioneering Zero-Shot Waveform Audio Generation
ADOPTION NEWS

NVIDIA Unveils BigVGAN v2: Pioneering Zero-Shot Waveform Audio Generation

By Crypto FlexsSeptember 11, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA Unveils BigVGAN v2: Pioneering Zero-Shot Waveform Audio Generation
Share
Facebook Twitter LinkedIn Pinterest Email

Jack Anderson
September 6, 2024 11:03

NVIDIA’s BigVGAN v2 sets a new standard for zero-shot waveform audio generation, delivering state-of-the-art quality at up to 3x faster synthesis speeds.





NVIDIA has announced the release of BigVGAN v2, a groundbreaking generative AI model for zero-shot waveform audio generation, according to the NVIDIA Technical Blog. The new model represents a significant improvement in speed and quality, establishing it as the state-of-the-art solution in audio generation AI.

BigVGAN: A universal neural vocoder

BigVGAN is a general-purpose neural vocoder designed to synthesize audio waveforms from Mel spectrograms. The model uses a fully synthetic architecture with multiple upsampling blocks and residual augmented synthesis layers. The main feature is an anti-aliasing multi-periodical composition (AMP) module that is optimized to generate high-frequency and periodic sound waves, reducing artifacts in the process.

Improvements in BigVGAN v2

BigVGAN v2 introduces several improvements over its predecessors.

  • Cutting edge audio quality Across a variety of measurement criteria and audio types.
  • Up to 3x faster synthesis speed Through optimized CUDA kernels.
  • Pre-trained checkpoints For a variety of audio configurations.
  • Supports sampling rates up to 44kHzContains the highest frequencies that humans can hear.

Generate all the sounds in the world

Waveform audio generation is essential to virtual worlds and has been a major focus of research. BigVGAN v2 overcomes previous limitations by providing high-quality audio with improved fine details. Trained using NVIDIA A100 Tensor Core GPUs and a dataset 100x larger than its predecessor, BigVGAN v2 can generate high-quality sound waves in a variety of domains, including speech, environmental sounds, and music.

Reaching the highest frequency sound that the human ear can detect

Previous models were limited to sampling rates between 22kHz and 24kHz. BigVGAN v2 extends this range to 44kHz, capturing the entire human auditory spectrum. This allows the model to reproduce a wide range of soundscapes, from powerful drums to the crisp cymbals of music.

Faster synthesis using custom CUDA kernels

BigVGAN v2 also provides accelerated synthesis speeds, achieving up to 3x faster inference than the original BigVGAN using custom CUDA kernels. These kernels enable audio waveform generation up to 240x faster than real-time on a single NVIDIA A100 GPU.

Audio quality results

BigVGAN v2 demonstrates superior audio quality for speech and general audio compared to previous models, and achieves similar results to Descript Audio Codec at 44kHz sampling rate. This demonstrates the model’s ability to generate high-quality waveforms for a wide range of audio types.

conclusion

NVIDIA’s BigVGAN v2 sets a new standard for audio synthesis, achieving state-of-the-art quality across all audio types and covering the full range of human hearing. The model’s synthesis speed is now up to 3x faster, making it highly efficient for a wide range of audio configurations.

For more details, see the BigVGAN v2 model card on GitHub.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

The best Solana depin project to form the future -Part 2

September 8, 2025

Ether Lee (ETH) tests major support for $ 4,453 after the highest rejection.

August 31, 2025

Bitcoin analysts bet on $ 200K after hints of Fed.

August 23, 2025
Add A Comment

Comments are closed.

Recent Posts

BitMine Immersion (BMNR) Announces Crypto And Cash Holdings Of $10.8 Billion, ETH Holdings Exceeding 2.151 Million

September 15, 2025

How SWLMiner Could Help You Get The IPhone 17 Air

September 15, 2025

Metabuses are increasing again -Records in August +13k NFT users

September 15, 2025

Rabby Wallet integrates XRPL EVM chain with peersyst

September 15, 2025

Stop Dreaming About The Lottery. Join H Mining And Start Earning!

September 14, 2025

Web3 EXEC warns that the US dollar Stablecoin end game is not priced.

September 14, 2025

Binance’s new Defi Initiative sparked Rollish Momentum, and BNB hit a new ATH of more than $ 900.

September 13, 2025

Top 5 Crypto PR Agencies to Scale Your Blockchain Project in Europe

September 13, 2025

The price of Etherrium surges beyond $ 4,500. -Main level for monitoring more profits

September 12, 2025

BNBCapital Emerges As Top Immutable DeFi Protocol With 239% Returns And Zero Admin Functions

September 12, 2025

MEXC Enhances Futures Trading With Multi-Asset Margin Mode Across 14 Tokens

September 12, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

BitMine Immersion (BMNR) Announces Crypto And Cash Holdings Of $10.8 Billion, ETH Holdings Exceeding 2.151 Million

September 15, 2025

How SWLMiner Could Help You Get The IPhone 17 Air

September 15, 2025

Metabuses are increasing again -Records in August +13k NFT users

September 15, 2025
Most Popular

Coinbase will delist wrapped Bitcoin as judge rejects BiT Global request

December 19, 2024

Top 4 accounting firms switch to Ethereum for blockchain-based business contracts

April 17, 2024

Sei Blockchain is now live on the Dune Analytics Platform.

July 24, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.