Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»AMD Launches AMD-135M: Innovation in Small Language Models
ADOPTION NEWS

AMD Launches AMD-135M: Innovation in Small Language Models

By Crypto FlexsSeptember 28, 20242 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
AMD Launches AMD-135M: Innovation in Small Language Models
Share
Facebook Twitter LinkedIn Pinterest Email

Louisa Crawford
September 28, 2024 07:13

AMD unveiled the AMD-135M, the first compact language model with speculative decoding to improve AI model efficiency and performance.





In a significant advancement in artificial intelligence, AMD announced the release of AMD-135M, its first small language model (SLM). According to AMD.com, this new model aims to address some of the limitations faced by large language models (LLMs) such as GPT-4 and Llama while providing specialized functionality.

AMD-135M: The first AMD small language model

Part of the Llama family, the AMD-135M is AMD’s pioneering effort in SLM. The model was trained from scratch using the AMD Instinct™ MI250 accelerator and 670 billion tokens. The training process resulted in two different models: AMD-Llama-135M and AMD-Llama-135M-code. The former was pre-trained with regular data, while the latter was fine-tuned with an additional 20 billion tokens specifically for code data.

prior training: AMD-Llama-135M was trained for 6 days using 4 MI250 nodes. The AMD-Llama-135M code, a code-centric variant, required an additional four days for fine-tuning.

All associated training code, datasets, and model weights are open source, allowing developers to reproduce the model and contribute to the training of other SLMs and LLMs.

Optimization through speculative decoding

One notable advancement in AMD-135M is the use of speculative decoding. Existing autoregressive approaches for large-scale language models often have low memory access efficiency because each forward pass produces only a single token. Speculative decoding solves this problem by using a small draft model to generate candidate tokens and then verifying them with a larger target model. This method allows generating multiple tokens per forward pass, significantly improving memory access efficiency and inference speed.

Accelerate inference performance

AMD tested the performance of the AMD-Llama-135M code with a draft model of CodeLlama-7b on a variety of hardware configurations, including MI250 accelerators and Ryzen™ AI processors. The results show that inference performance is significantly improved when using speculative decoding. This enhancement establishes an end-to-end workflow for training and inference on selected AMD platforms.

next steps

AMD aims to foster innovation within the AI ​​community by providing open source reference implementations. The company encourages developers to explore and contribute to new areas of AI technology.

For more information about the AMD-135M, visit the full technology blog on AMD.com.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Google unveils Gemini Omni and Gemini 3.5 Flash AI models

May 30, 2026

These three Bitcoin charts say BTC price will recover to $82,000.

May 22, 2026

Stellar (XLM) Highlights the Superiority of Native Tokenization in Securities

May 6, 2026
Add A Comment

Comments are closed.

Recent Posts

World Cup 2026 Prediction Markets Now Live On Whale.io With $90K In Prizes

June 10, 2026

Chris Jericho To Join And Co-Create Official Community Traits For Kokopi Koalas™ NFT Collection

June 9, 2026

Bancor reduced its stable fee to 0.001%. Can BNT bounce back?

June 9, 2026

Neura Closes Strategic Funding Round And Partnerships To Build Emotional AI With Persistent, User-Owned Memory

June 9, 2026

Phemex Kicks Off $7 Million Ultimate Championship, Bringing Trading Competition To Football Season

June 9, 2026

MEXC Prediction Markets Launches Combo To Enable Multi-Event Combination Trading

June 9, 2026

ZIGChain expands on-chain access by integrating Ondo tokenized stocks and ETFs.

June 8, 2026

Bitmine Immersion Technologies (BMNR) Announces ETH Holdings Reach 5.54 Million Tokens, And Total Crypto And Total Cash Holdings Of $9.6 Billion

June 8, 2026

MapleStory Universe Opens MSU Space And Launches Global Game Jam Competition As Part Of MSU 2.0 Expansion

June 8, 2026

Why is UK Financial Ltd’s trillion-dollar ERC-3643 conversion attracting major platforms?

June 7, 2026

Bybit Launches IPO Express, Becoming One Of First Centralized Crypto Exchanges To Offer Tokenized IPO Access, Starting With SpaceX

June 7, 2026

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

World Cup 2026 Prediction Markets Now Live On Whale.io With $90K In Prizes

June 10, 2026

Chris Jericho To Join And Co-Create Official Community Traits For Kokopi Koalas™ NFT Collection

June 9, 2026

Bancor reduced its stable fee to 0.001%. Can BNT bounce back?

June 9, 2026
Most Popular

NVIDIA AI Summit emphasizes safety of autonomous driving technology

October 14, 2024

Solana surpasses Ethereum and Tron in stablecoin trading volume.

January 3, 2024

Proven cryptocurrency legitimacy: Almost 100% of on-chain cryptocurrencies are legal.

January 22, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.