AMD Launches AMD-135M: Innovation in Small Language Models

Louisa Crawford
September 28, 2024 07:13

AMD unveiled the AMD-135M, the first compact language model with speculative decoding to improve AI model efficiency and performance.

In a significant advancement in artificial intelligence, AMD announced the release of AMD-135M, its first small language model (SLM). According to AMD.com, this new model aims to address some of the limitations faced by large language models (LLMs) such as GPT-4 and Llama while providing specialized functionality.

AMD-135M: The first AMD small language model

Part of the Llama family, the AMD-135M is AMD’s pioneering effort in SLM. The model was trained from scratch using the AMD Instinct™ MI250 accelerator and 670 billion tokens. The training process resulted in two different models: AMD-Llama-135M and AMD-Llama-135M-code. The former was pre-trained with regular data, while the latter was fine-tuned with an additional 20 billion tokens specifically for code data.

prior training: AMD-Llama-135M was trained for 6 days using 4 MI250 nodes. The AMD-Llama-135M code, a code-centric variant, required an additional four days for fine-tuning.

All associated training code, datasets, and model weights are open source, allowing developers to reproduce the model and contribute to the training of other SLMs and LLMs.

Optimization through speculative decoding

One notable advancement in AMD-135M is the use of speculative decoding. Existing autoregressive approaches for large-scale language models often have low memory access efficiency because each forward pass produces only a single token. Speculative decoding solves this problem by using a small draft model to generate candidate tokens and then verifying them with a larger target model. This method allows generating multiple tokens per forward pass, significantly improving memory access efficiency and inference speed.

Accelerate inference performance

AMD tested the performance of the AMD-Llama-135M code with a draft model of CodeLlama-7b on a variety of hardware configurations, including MI250 accelerators and Ryzen™ AI processors. The results show that inference performance is significantly improved when using speculative decoding. This enhancement establishes an end-to-end workflow for training and inference on selected AMD platforms.

next steps

AMD aims to foster innovation within the AI community by providing open source reference implementations. The company encourages developers to explore and contribute to new areas of AI technology.

For more information about the AMD-135M, visit the full technology blog on AMD.com.

Image source: Shutterstock

AMD Launches AMD-135M: Innovation in Small Language Models

AAVE Price Prediction: $100 is the wall. Factors that can destroy or bury a wall include:

Multicoin Capital has made its first Hyperliquid ecosystem investment in Trasia, an Asia-focused trading platform.

Polymarket Probability Price The probability that the United States will invade Iran before 2027 is 16.5%.

9 legendary cryptocurrencies you need to know

MEXC Lists Grvt (GRVT) with $60,000 Worth of GRVT and 10,000 USDT in Airdrop+ Rewards

MEXC Ventures Supports Alpha Arena’s APAC Debut at Coinfest Bali

Tria Returns More Than $600,000 to the Community That Helped Build Its Ecosystem

Bybit Launches New DCA Challenge with Up to 55,000 USDT in Rewards for BTC, ETH and XAUT Auto-Investing

MEXC Integrates World-Check to Fortify Institutional Grade Compliance Architecture

Bybit Introduces Finloop’s FUIDL backed by an AAA-rated Money Market Fund

Canton’s Decentralized App Layer Launches, Backed by $1M+ Foundation Grant

1inch launches Aqua to the public, introducing the first shared liquidity layer for DeFi

Zcash price prediction for 2026: Will $ZEC reach $500 or fall to $200?

ORBS) Announces its Participation in World Foundation’s $52.5M funding round as World Shifts From Building the Network to Scaling Utility

Top Insights

9 legendary cryptocurrencies you need to know

MEXC Lists Grvt (GRVT) with $60,000 Worth of GRVT and 10,000 USDT in Airdrop+ Rewards

MEXC Ventures Supports Alpha Arena’s APAC Debut at Coinfest Bali

Most Popular

Chainlink takes on Dogecoin on key indicators as mysterious whale pushes LINK upwards.

MANA v. AXS – Identify projects that currently have crowd support

50% of XRP supply sent to Bitfinex? what actually happened

AMD Launches AMD-135M: Innovation in Small Language Models

AMD-135M: The first AMD small language model

Optimization through speculative decoding

Accelerate inference performance

next steps

Related Posts