Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
Home»BLOCKCHAIN NEWS»StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
BLOCKCHAIN NEWS

StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency

By Crypto FlexsJanuary 4, 20242 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
Share
Facebook Twitter LinkedIn Pinterest Email

Recent advances in AI have been greatly influenced by the Transformer architecture, a key component of large models across fields as diverse as language, vision, audio, and biology. However, the complexity of Transformer’s attention mechanism limits its application in processing long sequences. Even sophisticated models such as GPT-4 suffer from this limitation.

Breakthrough Advances with StripedHyena

To address these issues, Together Research recently open sourced StripedHyena, a language model that boasts a new architecture optimized for long contexts. StripedHyena can handle up to 128,000 tokens and has demonstrated improved performance over the Transformer architecture in both training and inference performance.​​ It is the first model to have the performance of the best open source Transformer model for both short and long contexts. .

StripedHyena’s Hybrid Architecture

StripedHyena incorporates a hybrid architecture that combines multi-head, grouped query attention with gate convolution within hyena blocks. This design differs from traditional decoder-only Transformer models. Represent the convolution with a state-space model or truncated filter to decode it into a persistent memory of Hyena blocks. This architecture has lower latency, faster decoding, and higher throughput compared to Transformers.

Improve training and efficiency

StripedHyena improves performance by more than 30%, 50%, and 100% over existing Transformer in end-to-end training on 32k, 64k, and 128k token sequences, respectively. In terms of memory efficiency, it reduces memory usage during autoregressive generation by over 50% compared to Transformers.

Comparative performance using attention mechanisms

StripedHyena significantly reduces the quality gap through large-scale attention, reducing computational cost and providing similar disruption and downstream performance without the need for mixed attention.​​

Applications beyond language processing

StripedHyena’s versatility extends to image recognition. The researchers tested the applicability of Visual Transformers (ViT) to attention substitution and showed similar accuracy in an image classification task on the ImageNet-1k dataset.

StripedHyena represents an important advancement in AI architecture, providing a more efficient alternative to Transformer models, especially when processing long sequences. Its hybrid structure of training and inference and improved performance make it a promising tool for a wide range of applications in language and vision processing.

Image source: Shutterstock

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

NVIDIA RTX strengthens FITY’s AI -centered innovation in Cooler Design.

June 27, 2025

British trail EU, US encryption regulation, think tank warning

June 22, 2025

ZKJ Crypto Price Pumps 20%: Dead Cat Bounces?

June 17, 2025
Add A Comment

Comments are closed.

Recent Posts

Shheikh.io Launches SHHEIKH Token Presale For Blockchain-Backed Real‑World Asset Investments

June 30, 2025

What should I do with encryption?

June 30, 2025

AAS Miner Will Become The Top Free Cloud Mining Platform For Passive Income From Mining Cryptocurrencies Such As BTC And ETH In 2025

June 30, 2025

Bitcoin is integrated into less than $ 108,000, but the eyes are set for $ 115,000.

June 29, 2025

Etherrium price behavior is weakened-danger of short-term modifications

June 29, 2025

Last Opportunity-The bonus stage of the light chain AI begins after closing all 15 pre-sales stages.

June 29, 2025

Its Important To Know What’s Really Going On?

June 29, 2025

Elon Musk, SpaceX And Crypto Hype: What’s Really Going On?

June 28, 2025

Checkpoint #4: Berlinterop | Ether Leeum Foundation Blog

June 28, 2025

TRON Price Propects USDT supply exceeded $ 80 billion

June 28, 2025

Stablecoin startups surpass 2021 venture capital peaks as institutional money spills.

June 28, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

Shheikh.io Launches SHHEIKH Token Presale For Blockchain-Backed Real‑World Asset Investments

June 30, 2025

What should I do with encryption?

June 30, 2025

AAS Miner Will Become The Top Free Cloud Mining Platform For Passive Income From Mining Cryptocurrencies Such As BTC And ETH In 2025

June 30, 2025
Most Popular

Layer 2 networks require decentralized sequencers — Metis Co-Founder

December 7, 2024

Cardano (ADA) Uptrend Faces Hurdles: Will the Bulls Break the Barriers?

December 13, 2024

Ethereum giga whales showing historic buying trend

January 7, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.