Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»BLOCKCHAIN NEWS»StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
BLOCKCHAIN NEWS

StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency

By Crypto FlexsJanuary 4, 20242 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
Share
Facebook Twitter LinkedIn Pinterest Email

Recent advances in AI have been greatly influenced by the Transformer architecture, a key component of large models across fields as diverse as language, vision, audio, and biology. However, the complexity of Transformer’s attention mechanism limits its application in processing long sequences. Even sophisticated models such as GPT-4 suffer from this limitation.

Breakthrough Advances with StripedHyena

To address these issues, Together Research recently open sourced StripedHyena, a language model that boasts a new architecture optimized for long contexts. StripedHyena can handle up to 128,000 tokens and has demonstrated improved performance over the Transformer architecture in both training and inference performance.​​ It is the first model to have the performance of the best open source Transformer model for both short and long contexts. .

StripedHyena’s Hybrid Architecture

StripedHyena incorporates a hybrid architecture that combines multi-head, grouped query attention with gate convolution within hyena blocks. This design differs from traditional decoder-only Transformer models. Represent the convolution with a state-space model or truncated filter to decode it into a persistent memory of Hyena blocks. This architecture has lower latency, faster decoding, and higher throughput compared to Transformers.

Improve training and efficiency

StripedHyena improves performance by more than 30%, 50%, and 100% over existing Transformer in end-to-end training on 32k, 64k, and 128k token sequences, respectively. In terms of memory efficiency, it reduces memory usage during autoregressive generation by over 50% compared to Transformers.

Comparative performance using attention mechanisms

StripedHyena significantly reduces the quality gap through large-scale attention, reducing computational cost and providing similar disruption and downstream performance without the need for mixed attention.​​

Applications beyond language processing

StripedHyena’s versatility extends to image recognition. The researchers tested the applicability of Visual Transformers (ViT) to attention substitution and showed similar accuracy in an image classification task on the ImageNet-1k dataset.

StripedHyena represents an important advancement in AI architecture, providing a more efficient alternative to Transformer models, especially when processing long sequences. Its hybrid structure of training and inference and improved performance make it a promising tool for a wide range of applications in language and vision processing.

Image source: Shutterstock

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Gala Games Launches ‘Dusk of the Broken’ Event with $GALA Rewards

November 29, 2025

Chainlink is the ‘critical connective tissue’ for tokenization

November 24, 2025

Bessent called for a reconsideration of taxes on cryptocurrency staking rewards.

November 19, 2025
Add A Comment

Comments are closed.

Recent Posts

BlackRock acquired $589 million in Bitcoin and Ethereum in just three days.

November 29, 2025

Gala Games Launches ‘Dusk of the Broken’ Event with $GALA Rewards

November 29, 2025

Balancer StableSwap Analysis and Differential Fuzzing Guide

November 28, 2025

Avail Launches Nexus Mainnet, Unifies Liquidity Across Ethereum, Solana, EVMs

November 28, 2025

MEXC Launches Long-Term P2P Incentive Program To Accelerate Global Fiat Market Expansion

November 28, 2025

How are crypto casinos shaping global iGaming?

November 28, 2025

A Retired Italian Couple Earns $998 Per Day Passively Through 8hoursmining Cloud Cryptocurrency Mining.

November 27, 2025

Mantle And Bybit Unite To Bring USDT0, The Omnichain Deployment Of Tether’s USDT Stablecoin, To The Largest Exchange-Related Network

November 27, 2025

A Retired Italian Couple Earns $998 Per Day Passively Through 8hoursmining Cloud Cryptocurrency Mining.

November 27, 2025

Technance Introduces Institutional-Grade Infrastructure For Exchanges, Fintech Platforms, And Web3 Applications

November 27, 2025

Investors Eye 900× ROI Potential as Ozak AI Continues Record Presale Momentum

November 27, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

BlackRock acquired $589 million in Bitcoin and Ethereum in just three days.

November 29, 2025

Gala Games Launches ‘Dusk of the Broken’ Event with $GALA Rewards

November 29, 2025

Balancer StableSwap Analysis and Differential Fuzzing Guide

November 28, 2025
Most Popular

Dogwifhat, Vegas Sphere Team ‘90% Sure’ Plans Going Forward

July 5, 2024

Wintermute Announces Liquidity Support for Hong Kong Spot Bitcoin and Ether ETFs

May 8, 2024

Solana: Why can $ 100- $ 120 be a sweet point for buyers?

May 2, 2025
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.