Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • ADOPTION
  • TRADING
  • HACKING
  • SLOT
  • TRADE
Crypto Flexs
Home»BLOCKCHAIN NEWS»StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
BLOCKCHAIN NEWS

StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency

By Crypto FlexsJanuary 4, 20242 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
StripedHyena-7B: Next-generation AI architecture for improved performance and efficiency
Share
Facebook Twitter LinkedIn Pinterest Email

Recent advances in AI have been greatly influenced by the Transformer architecture, a key component of large models across fields as diverse as language, vision, audio, and biology. However, the complexity of Transformer’s attention mechanism limits its application in processing long sequences. Even sophisticated models such as GPT-4 suffer from this limitation.

Breakthrough Advances with StripedHyena

To address these issues, Together Research recently open sourced StripedHyena, a language model that boasts a new architecture optimized for long contexts. StripedHyena can handle up to 128,000 tokens and has demonstrated improved performance over the Transformer architecture in both training and inference performance.​​ It is the first model to have the performance of the best open source Transformer model for both short and long contexts. .

StripedHyena’s Hybrid Architecture

StripedHyena incorporates a hybrid architecture that combines multi-head, grouped query attention with gate convolution within hyena blocks. This design differs from traditional decoder-only Transformer models. Represent the convolution with a state-space model or truncated filter to decode it into a persistent memory of Hyena blocks. This architecture has lower latency, faster decoding, and higher throughput compared to Transformers.

Improve training and efficiency

StripedHyena improves performance by more than 30%, 50%, and 100% over existing Transformer in end-to-end training on 32k, 64k, and 128k token sequences, respectively. In terms of memory efficiency, it reduces memory usage during autoregressive generation by over 50% compared to Transformers.

Comparative performance using attention mechanisms

StripedHyena significantly reduces the quality gap through large-scale attention, reducing computational cost and providing similar disruption and downstream performance without the need for mixed attention.​​

Applications beyond language processing

StripedHyena’s versatility extends to image recognition. The researchers tested the applicability of Visual Transformers (ViT) to attention substitution and showed similar accuracy in an image classification task on the ImageNet-1k dataset.

StripedHyena represents an important advancement in AI architecture, providing a more efficient alternative to Transformer models, especially when processing long sequences. Its hybrid structure of training and inference and improved performance make it a promising tool for a wide range of applications in language and vision processing.

Image source: Shutterstock

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

From Wall Street to Wallet: Ark Defai redefines financial architecture.

July 7, 2025

NVIDIA RTX strengthens FITY’s AI -centered innovation in Cooler Design.

June 27, 2025

British trail EU, US encryption regulation, think tank warning

June 22, 2025
Add A Comment

Comments are closed.

Recent Posts

Causes, History, And How To Survive

July 8, 2025

Trump’s truth social file for encryption blue chip ETF with SEC

July 8, 2025

G-Knot Appoints Fintech, Crypto Veteran Wes Kaplan As CEO To Launch The First Finger Vein Biometric Wallet

July 8, 2025

GrayScale XRP ETF GETS sec NOD: XRP price prediction and market impact

July 8, 2025

Distribute Crypto Media Release That Get Attention

July 8, 2025

GUNZ Announces $GUN Token Expansion To Solana

July 7, 2025

NEXST Launches Web3 VR Entertainment Platform With K-Pop Group UNIS As First Global Partner

July 7, 2025

Elon Musk announces that his “American Party” will accept Bitcoin and criticizes Trump’s financial bill.

July 7, 2025

From Wall Street to Wallet: Ark Defai redefines financial architecture.

July 7, 2025

What Every Investor Should Know

July 6, 2025

Protecting Your Portfolio In A Volatile Market

July 6, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

Causes, History, And How To Survive

July 8, 2025

Trump’s truth social file for encryption blue chip ETF with SEC

July 8, 2025

G-Knot Appoints Fintech, Crypto Veteran Wes Kaplan As CEO To Launch The First Finger Vein Biometric Wallet

July 8, 2025
Most Popular

Etherrium Eye $ 3,000: How to determine ETH’s fate

May 15, 2025

Jensen Huang to deliver keynote highlighting NVIDIA innovation at CES 2025

December 17, 2024

ARDA raises the previous seed and builds a real estate operating system led by EX-Goldman MD & JPMC Crypto Head.

March 19, 2025
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.