Add live audio transcription streaming support to Foundry Local C# SDK by rui-ren · Pull Request #485 · microsoft/Foundry-Local

rui-ren · 2026-03-05T18:29:47Z

Here's the updated PR description based on the latest changes (renamed types, CoreInterop routing fix, mermaid updates):

Title: Add live audio transcription streaming support to Foundry Local C# SDK

Description:

Adds real-time audio streaming support to the Foundry Local C# SDK, enabling live microphone-to-text transcription via ONNX Runtime GenAI's StreamingProcessor API (Nemotron ASR).

The existing OpenAIAudioClient only supports file-based transcription. This PR introduces LiveAudioTranscriptionSession that accepts continuous PCM audio chunks (e.g., from a microphone) and returns partial/final transcription results as an async stream.

What's included

New files

src/OpenAI/LiveAudioTranscriptionClient.cs — Streaming session with StartAsync(), AppendAsync(), GetTranscriptionStream(), StopAsync()
src/OpenAI/LiveAudioTranscriptionTypes.cs — LiveAudioTranscriptionResult and CoreErrorResponse types

Modified files

src/OpenAI/AudioClient.cs — Added CreateLiveTranscriptionSession() factory method
src/Detail/ICoreInterop.cs — Added StreamingRequestBuffer struct, StartAudioStream, PushAudioData, StopAudioStream interface methods
src/Detail/CoreInterop.cs — Routes audio commands through existing execute_command / execute_command_with_binary native entry points (no separate audio exports needed)
src/Detail/JsonSerializationContext.cs — Registered LiveAudioTranscriptionResult for AOT compatibility
test/FoundryLocal.Tests/Utils.cs — Updated to use CreateLiveTranscriptionSession()

Documentation

API surface

var audioClient = await model.GetAudioClientAsync();
var session = audioClient.CreateLiveTranscriptionSession();

session.Settings.SampleRate = 16000;
session.Settings.Channels = 1;
session.Settings.Language = "en";

await session.StartAsync();

// Push audio from microphone callback
await session.AppendAsync(pcmBytes);

// Read results as async stream
await foreach (var result in session.GetTranscriptionStream())
{
    Console.Write(result.Text);
}

await session.StopAsync();

Design highlights

Internal push queue — Bounded Channel<T> serializes audio pushes from any thread (safe for mic callbacks) with backpressure
Retry policy — Transient native errors retried with exponential backoff (3 attempts); permanent errors terminate the session
Settings freeze — Audio format settings are snapshot-copied at StartAsync() and immutable during the session
Cancellation-safe stop — StopAsync always calls native stop even if cancelled, preventing native session leaks
Dedicated session CTS — Push loop uses its own CancellationTokenSource, decoupled from the caller's token
Routes through existing exports — StartAudioStream and StopAudioStream route through execute_command; PushAudioData routes through execute_command_with_binary — no new native entry points required

Core integration (neutron-server)

The Core side (AudioStreamingSession.cs) uses StreamingProcessor + Generator + Tokenizer + TokenizerStream from onnxruntime-genai to perform real-time RNNT decoding. The native commands (audio_stream_start/push/stop) are handled as cases in NativeInterop.ExecuteCommandManaged / ExecuteCommandWithBinaryManaged.

Verified working

✅ SDK build succeeds (0 errors)
✅ GenAI StreamingProcessor pipeline verified with WAV file (correct transcript)
✅ Core TranscribeChunk byte[] PCM path matches reference float[] path exactly
✅ Full E2E simulation: SDK Channel + JSON serialization + session management (32 partial + 1 final result)
✅ Live microphone test: 67s real-time transcription through SDK → Core → GenAI
✅ Full SDK → Core → GenAI E2E with locally built Core DLL and GenAI NuGet 0.13.0-dev

vercel · 2026-03-05T18:29:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
foundry-local	Ready	Preview, Comment	Mar 25, 2026 8:45am

Copilot

Pull request overview

Adds a new C# SDK API for live/streaming audio transcription sessions (push PCM chunks, receive incremental/final text results) and includes a Windows microphone demo sample.

Changes:

Introduces LiveAudioTranscriptionSession + result/error types for streaming ASR over Core interop.
Extends Core interop to support audio stream start/push/stop (including binary payload routing).
Adds a samples/cs/LiveAudioTranscription demo project and updates the audio client factory API.

Reviewed changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
sdk_v2/cs/test/FoundryLocal.Tests/Utils.cs	Replaced prior test utilities with ad-hoc top-level streaming harness code (currently breaks test build).
sdk_v2/cs/test/FoundryLocal.Tests/ModelTests.cs	Adds trailing blank lines (formatting noise).
sdk_v2/cs/src/OpenAI/LiveAudioTranscriptionTypes.cs	Adds `LiveAudioTranscriptionResult` and a structured Core error type.
sdk_v2/cs/src/OpenAI/LiveAudioTranscriptionClient.cs	Adds `LiveAudioTranscriptionSession` implementation (channels, retry, stop semantics).
sdk_v2/cs/src/OpenAI/AudioClient.cs	Adds `CreateLiveTranscriptionSession()` and removes the public file streaming transcription API.
sdk_v2/cs/src/Detail/JsonSerializationContext.cs	Registers new audio streaming types for source-gen JSON.
sdk_v2/cs/src/Detail/ICoreInterop.cs	Adds interop structs + methods for audio stream start/push/stop.
sdk_v2/cs/src/Detail/CoreInterop.cs	Implements binary command routing via `execute_command_with_binary` and start/stop routing via `execute_command`.
sdk_v2/cs/src/AssemblyInfo.cs	Adds `InternalsVisibleTo("AudioStreamTest")`.
samples/cs/LiveAudioTranscription/README.md	Documentation for the live transcription demo sample.
samples/cs/LiveAudioTranscription/Program.cs	Windows microphone demo using NAudio + new session API.
samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj	Adds sample project dependencies and references the SDK project (path currently incorrect).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

sdk/cs/src/Detail/JsonSerializationContext.cs

samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj

sdk/cs/test/FoundryLocal.Tests/ModelTests.cs

sdk_v2/cs/test/FoundryLocal.Tests/Utils.cs

sdk/cs/src/OpenAI/AudioClient.cs

sdk/cs/src/OpenAI/LiveAudioTranscriptionClient.cs

sdk_v2/cs/src/OpenAI/LiveAudioTranscriptionClient.cs

sdk_v2/cs/src/AssemblyInfo.cs

sdk_v2/cs/test/FoundryLocal.Tests/Utils.cs

samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj

hanbitmyths · 2026-03-20T22:15:26Z

Is there any reason to add zip and dlls at LiveAudioTranscription?

rui-ren · 2026-03-20T22:19:43Z

Is there any reason to add zip and dlls at LiveAudioTranscription?

Yes, I am deleting this. It was caused by the rename a lot of folders and mistakenly upload the test folder etc,, let me revert it back.

sdk/cs/src/OpenAI/AudioClient.cs

sdk/cs/src/OpenAI/LiveAudioTranscriptionClient.cs

sdk/cs/src/OpenAI/AudioClient.cs

support audio streaming-csharp

c045bf3

delete dll mock test

3970936

vercel bot deployed to Preview March 5, 2026 21:50 View deployment

update core api

ef2e9e0

vercel bot deployed to Preview March 5, 2026 23:52 View deployment

ruiren_microsoft added 2 commits March 10, 2026 18:09

update sdk

535b735

update the api

f5bd916

vercel bot deployed to Preview March 13, 2026 01:53 View deployment

rename LiveAudioTranscription

6d067e0

vercel bot deployed to Preview March 13, 2026 19:17 View deployment

Merge branch 'main' into ruiren/audio-streaming-support-sdk

eb6598d

vercel bot deployed to Preview March 13, 2026 19:18 View deployment

rui-ren changed the title ~~Add real-time audio streaming support (Microphone ASR) - c#~~ Add live audio transcription streaming support to Foundry Local C# SDK Mar 13, 2026

fix: add missing using directives for EnumeratorCancellation and Channel

6dee740

vercel bot deployed to Preview March 13, 2026 20:22 View deployment

update test

b89e1bd

vercel bot deployed to Preview March 13, 2026 20:27 View deployment

rui-ren requested review from baijumeswani and kunal-vaishnavi March 13, 2026 20:29

e2e test

eb9f282

Copilot AI review requested due to automatic review settings March 18, 2026 03:42

vercel bot deployed to Preview March 18, 2026 03:43 View deployment

Copilot started reviewing on behalf of rui-ren March 18, 2026 03:43 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

update for test

5e98119

vercel bot deployed to Preview March 18, 2026 03:50 View deployment

kunal-vaishnavi reviewed Mar 20, 2026

View reviewed changes

samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 20, 2026

View reviewed changes

samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 20, 2026

View reviewed changes

samples/cs/LiveAudioTranscription/LiveAudioTranscription.csproj Outdated Show resolved Hide resolved

merge the main

ed9e350

rui-ren force-pushed the ruiren/audio-streaming-support-sdk branch from bb68897 to ed9e350 Compare March 20, 2026 23:02

vercel bot deployed to Preview March 20, 2026 23:03 View deployment

kunal-vaishnavi reviewed Mar 21, 2026

View reviewed changes

sdk/cs/src/OpenAI/AudioClient.cs Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 21, 2026

View reviewed changes

sdk/cs/src/OpenAI/LiveAudioTranscriptionClient.cs Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 21, 2026

View reviewed changes

sdk/cs/src/OpenAI/LiveAudioTranscriptionClient.cs Show resolved Hide resolved

update response type

0cac7f3

vercel bot deployed to Preview March 23, 2026 00:00 View deployment

fix nenad

06dc45c

vercel bot deployed to Preview March 23, 2026 00:27 View deployment

kunal-vaishnavi reviewed Mar 23, 2026

View reviewed changes

sdk/cs/src/OpenAI/AudioClient.cs Show resolved Hide resolved

ruiren_microsoft added 2 commits March 24, 2026 10:14

add unitest

709788c

update the ci core package

24aacb1

vercel bot deployed to Preview March 24, 2026 17:31 View deployment

update the ci core package

eeb34b8

vercel bot deployed to Preview March 24, 2026 17:39 View deployment

Update Core version to 0.9.0 in JS install script

57ce460

vercel bot deployed to Preview March 24, 2026 19:32 View deployment

update the npkg

10bbcb8

vercel bot deployed to Preview March 25, 2026 02:55 View deployment

Trigger CI re-run

a02e381

vercel bot deployed to Preview March 25, 2026 03:15 View deployment

Update package versions

c17a74d

vercel bot deployed to Preview March 25, 2026 08:19 View deployment

Temporarily use different FL Core WinML version

fc0fa6e

vercel bot deployed to Preview March 25, 2026 08:35 View deployment

Use ORT nightly feed for getting ORT GenAI in JS builds

5678587

vercel bot deployed to Preview March 25, 2026 08:45 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add live audio transcription streaming support to Foundry Local C# SDK#485

Add live audio transcription streaming support to Foundry Local C# SDK#485
rui-ren wants to merge 24 commits intomainfrom
ruiren/audio-streaming-support-sdk

rui-ren commented Mar 5, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanbitmyths commented Mar 20, 2026

Uh oh!

rui-ren commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

rui-ren commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's included

New files

Modified files

Documentation

API surface

Design highlights

Core integration (neutron-server)

Verified working

Uh oh!

vercel bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanbitmyths commented Mar 20, 2026

Uh oh!

rui-ren commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

rui-ren commented Mar 5, 2026 •

edited

Loading

vercel bot commented Mar 5, 2026 •

edited

Loading