AssemblyAI Releases C# .NET SDK for Advanced Audio Transcription and Analysis

Louisa Crawford
Sep 3, 2024 05:37

AssemblyAI releases a C# .NET SDK that enables developers to transcribe and analyze audio and apply LLM using LeMUR.

AssemblyAI has announced the release of a new C# .NET SDK designed to facilitate audio transcription and analysis for developers using .NET languages such as C#, VB.NET, and F#. According to AssemblyAI, the SDK aims to simplify the use of AssemblyAI’s advanced Speech AI models.

Key Features and Objectives

The SDK was developed with several key goals in mind:

Provides an intuitive interface to all AssemblyAI models and functions using idiomatic C#.

Ensures compatibility with multiple frameworks, including .NET 6.0, .NET Framework 4.6.2, .NET Standard 2.0, and later.

Minimize dependencies to avoid version conflicts and the need for binding redirects.

Transcribe audio files

One of the main features of the SDK is audio transcription. Developers can transcribe audio files asynchronously or in real time. Below is an example of how to transcribe an audio file.

using AssemblyAI;
using AssemblyAI.Transcripts;

var client = new AssemblyAIClient("YOUR_API_KEY");

var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams

    AudioUrl = "https://storage.googleapis.com/aai-docs-samples/nbc.mp3"
);

transcript.EnsureStatusCompleted();

Console.WriteLine(transcript.Text);

For local files, you can also perform transcription using similar code.

await using var stream = new FileStream("./nbc.mp3", FileMode.Open);
var transcript = await client.Transcripts.TranscribeAsync(
    stream,
    new TranscriptOptionalParams
    
        LanguageCode = TranscriptLanguageCode.EnUs
    
);

transcript.EnsureStatusCompleted();

Console.WriteLine(transcript.Text);

Real-time audio transcription

The SDK also supports real-time audio transcription using Streaming Speech-to-Text, which is particularly useful for applications that require immediate processing of audio data.

using AssemblyAI.Realtime;

await using var transcriber = new RealtimeTranscriber(new RealtimeTranscriberOptions

    ApiKey = "YOUR_API_KEY",
    SampleRate = 16_000
);

transcriber.PartialTranscriptReceived.Subscribe(transcript =>

    Console.WriteLine($"Partial: transcript.Text");
);
transcriber.FinalTranscriptReceived.Subscribe(transcript =>

    Console.WriteLine($"Final: transcript.Text");
);

await transcriber.ConnectAsync();

// Pseudocode for getting audio from a microphone for example
GetAudio(async (chunk) => await transcriber.SendAudioAsync(chunk));

await transcriber.CloseAsync();

Leveraging LeMUR for LLM Applications

The SDK integrates with LeMUR to enable developers to build large-scale language model (LLM) applications from speech data. Examples include:

var lemurTaskParams = new LemurTaskParams

    Prompt = "Provide a brief summary of the transcript.",
    TranscriptIds = (transcript.Id),
    FinalModel = LemurModel.AnthropicClaude3_5_Sonnet
;

var response = await client.Lemur.TaskAsync(lemurTaskParams);

Console.WriteLine(response.Response);

Audio Intelligence Model

The SDK also includes native support for audio intelligence models, enabling you to implement sentiment analysis and other advanced features.

var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams

    AudioUrl = "https://storage.googleapis.com/aai-docs-samples/nbc.mp3",
    SentimentAnalysis = true
);

foreach (var result in transcript.SentimentAnalysisResults!)

    Console.WriteLine(result.Text);
    Console.WriteLine(result.Sentiment); // POSITIVE, NEUTRAL, or NEGATIVE
    Console.WriteLine(result.Confidence);
    Console.WriteLine($"Timestamp: result.Start - result.End");

For more information, visit the official AssemblyAI blog.

Image source: Shutterstock

AssemblyAI Releases C# .NET SDK for Advanced Audio Transcription and Analysis

XRP Open Interests decrease by $ 2.4B after recent sale

KAITO unveils Capital Launchpad, a Web3 crowdfunding platform that will be released later this week.

Algorand (Algo) Get momentum in the launch and technical growth.

VFAT Farm Strategy Audit Summary

ETH Meme Coin Pepeto Ends Stage 6 With $5.770.000 Raised In Presale

PowerBank And Intellistake Announce Strategic Alliance To Pioneer Digital Currencies, Including Bitcoin Treasury Integration And RWA Tokenization

Strategic Ettterim Protection Zone surpasses $ 10 billion as institutional interests increase.

Tethers we target the Stablecoin market and quote the path of genius behavior.

Pepescape Crypto Presale Raises $1M As Ethereum Eyes $6K, Community-Owned Exchange Gigacex Unveiled

Midl Secures $2.4M Seed Investment From Draper Associates And Draper Dragon To Pioneer Native DApp Infrastructure On Bitcoin

LayerBTC starts $ LBTC ICO to power the new Bitcoin Layer 2 for Apps and Defi.

Asia Morning Briefing: SEC’s in -kind BTC, ETH ETF reduction shift occurred in Hong Kong a few years ago.

XRP Open Interests decrease by $ 2.4B after recent sale

Is it really possible to sell Memecoins?

Top Insights

VFAT Farm Strategy Audit Summary

ETH Meme Coin Pepeto Ends Stage 6 With $5.770.000 Raised In Presale

PowerBank And Intellistake Announce Strategic Alliance To Pioneer Digital Currencies, Including Bitcoin Treasury Integration And RWA Tokenization

Most Popular

Freedx protects major DASP licenses in El Salvador.

Hong Kong Announces Consultation on Regulation of OTC Cryptocurrency Exchanges

Appeals Court Resurrects Class Action Against Binance

AssemblyAI Releases C# .NET SDK for Advanced Audio Transcription and Analysis

Key Features and Objectives

Transcribe audio files

Real-time audio transcription

Leveraging LeMUR for LLM Applications

Audio Intelligence Model

Related Posts