Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
Home»ADOPTION NEWS»TEAL, Introducing Training-Free Activation Sparsity to Improve LLM Efficiency
ADOPTION NEWS

TEAL, Introducing Training-Free Activation Sparsity to Improve LLM Efficiency

By Crypto FlexsSeptember 1, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
TEAL, Introducing Training-Free Activation Sparsity to Improve LLM Efficiency
Share
Facebook Twitter LinkedIn Pinterest Email

Jack Anderson
September 1, 2024 08:34

TEAL provides a learning-free approach to activation sparsity that significantly improves the efficiency of large-scale language models (LLMs) with minimal degradation.





TEAL (Training-Free Activation Sparsity in LLMs) has emerged as a groundbreaking approach to improve the efficiency of large-scale language models (LLMs) without additional training. According to together.ai, the method achieves 40-50% activation sparsity with minimal degradation by applying size pruning to the hidden state throughout the model. This innovation allows transferring fewer weights to on-chip memory, solving the memory-bound nature of LLM inference and translating into a 1.53-1.8x wall-clock speedup in single-batch decoding.

background

LLM is known for its enormous size, which makes it difficult during inference, mainly due to the speed limitation of transferring parameters from device memory to registers. Various techniques such as quantization, weight sparsity, and speculative decoding have been developed to address this ‘memory wall’. Activation sparsity, which utilizes zero values ​​in the hidden state, is a less explored method that avoids transferring unnecessary weight channels during decoding.

Older models like OPT-175B exhibit high activation sparsity, allowing significant speedups with methods like DejaVu. However, newer models like LLaMA have moved to SwiGLU variants, making these methods difficult to apply. Recent studies have attempted to ‘recover’ models that exhibit activation sparsity, but these models require extensive retraining on large datasets.

Motivational Research: Activation Distribution Characteristics of LLM

Studies have shown that the hidden states of LLM are outliers, zero-centered, and have similar distribution shapes across layers. Specifically, the states before MLP and Attention Blocks are Gaussian in shape, and the intermediate states are Laplacian in shape. This suggests that many low-amplitude activations can be eliminated with negligible model degradation, a notion also observed in other studies such as CATS.

teal

TEAL introduces optimizations by sparsifying all tensors in the model, achieving near-zero degradation at 25% sparsity and minimal degradation at 40% sparsity. At 50% sparsity, the Llama-3 variant shows slightly more degradation than its predecessors Llama-2 and Mistral. TEAL outperforms CATS by sparsifying all tensors and producing lower error by sparsifying the input.

Improved hardware recognition speed

To benchmark real-world speedups, TEAL is integrated with GPT-Fast, achieving significant speedups of up to 1.53x and 1.8x at 40% and 50% sparsity, respectively. The kernel is faster than cuBLAS at 0% sparsity, but there is still room for further optimization.

Compatibility with quantization

TEAL also demonstrates compatibility with quantization, another technique for efficient LLM inference. Combining activation sparsity and quantization opens up a new regime for transferring memory to GPU registers, leading to faster inference speeds.

Application

The most immediate application of TEAL is to accelerate inference in resource-constrained edge settings, especially in single-batch scenarios. It also enables inference providers like Together AI, which hosts over 100 open-source models on large fleets of GPUs, to serve their models more efficiently.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Algorand (Algo) Get momentum in the launch and technical growth.

July 14, 2025

It flashes again in July

July 6, 2025

Stablecoin startups surpass 2021 venture capital peaks as institutional money spills.

June 28, 2025
Add A Comment

Comments are closed.

Recent Posts

Monarq Asset Management Appoints Sam Gaer As CIO To Lead Directional Strategy

July 21, 2025

Little PEPE surpasses $ 4 million in pre -sales, emerging as one of the main memes in 2025.

July 21, 2025

Bitcoin Price $ 123K Explosion -Trader Brace for Brake Out

July 20, 2025

Ether Lee Rium breaks $ 3K with 7,200% of the virus L2 coin eyes.

July 20, 2025

XRP Breaks Through $3.5! DL Mining Launches AI Cloud Mining Contracts, Earning Steady Profits Every Day

July 20, 2025

AAVE gains strength as AAVE dominates defect loans with net deposits of $ 50B or more.

July 19, 2025

As XRP Surges, DLMining Platform Opens New High-yield Cloud Mining Opportunities For Holders

July 19, 2025

Missed Out On Bitcoin At $9999? SIM Mining Cloud Mining Brings You New Opportunities For Wealth!

July 19, 2025

NFT is a rebound -there is a teenage NFTS this week.

July 19, 2025

MultiBank Group To List $MBG Token On Gate.io And MEXC During Official Token Generation Event

July 18, 2025

Earn $4,777 Daily! PaxMining Leads 2025’s Record-Breaking Bitcoin Mining Boom

July 18, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

Monarq Asset Management Appoints Sam Gaer As CIO To Lead Directional Strategy

July 21, 2025

Little PEPE surpasses $ 4 million in pre -sales, emerging as one of the main memes in 2025.

July 21, 2025

Bitcoin Price $ 123K Explosion -Trader Brace for Brake Out

July 20, 2025
Most Popular

Ethereum’s Pectra upgrade to bring smart contract functionality to wallets while experts discuss EIP-3074

April 13, 2024

Bitcoin (BTC) tests support again. Continuing upwards from here?

May 16, 2024

Bitcoin Volatility Plunges Below Tesla, Nvidia Stocks Amid $100,000 Price Prediction

May 11, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.