Realtime Client Proposal by tarekgh · Pull Request #1 · tarekgh/extensions

tarekgh · 2025-11-21T23:40:12Z

Realtime Client Abstraction Proposal

This proposal outlines a unified abstraction for Realtime model clients within the Microsoft.Extensions.AI.Abstractions library. The goal is to provide a consistent and provider-agnostic interface for interacting with Realtime AI systems, making it easier for developers to integrate, use, and switch between different model implementations.

Realtime models typically operate over bidirectional streaming, enabling clients to send input and receive output concurrently. This allows the model server to begin generating results while still processing incoming data.

Interacting with realtime models usually involves creating a session or connection through which input and output are exchanged as streams. Sessions may maintain state across multiple interactions, enabling richer, more context-aware responses.

Realtime models can accept various types of input, such as text, audio, or images and typically produce output in the form of text or audio.

Proposed Interface

IRealtimeClient

Defined in IRealtimeClient.cs

This is the primary interface for interacting with realtime models. Applications use it to create sessions and manage realtime connections. Below is an example of how application code might look:

// OpenAIRealtimeClient is an example implementation of IRealtimeClient
IRealtimeClient realtimeClient = new OpenAIRealtimeClient(apiKey);

IRealtimeSession

Defined in IRealtimeSession.cs

After creating an IRealtimeClient, you can use it to create an instance of IRealtimeSession, which represents an individual session with the realtime model. A session enables sending input and receiving output as streams. Here’s an example of how the application code might look:

IRealtimeSession realtimeSession = await realtimeClient.CreateSessionAsync();

The application can then send input and receive output through the session using a mechanism similar to the example shown below:

// Create channel for client messages
clientMessageChannel = Channel.CreateUnbounded<RealtimeClientMessage>();
streamingCancellationTokenSource = new CancellationTokenSource();

// Start background task to process server messages
_ = Task.Run(async () =>
{
    try
    {
        await foreach (var serverMessage in realtimeSession.GetStreamingResponseAsync(
            clientMessageChannel.Reader.ReadAllAsync(streamingCancellationTokenSource.Token),
            streamingCancellationTokenSource.Token))
        {
            ProcessServerMessage(serverMessage); // application method to handle server messages
        }
    }
    catch (OperationCanceledException)
    {
        // Expected when cancelling
    }
    catch (Exception ex)
    {
        HandleError(ex); // application method to handle errors
    }
}, streamingCancellationTokenSource.Token);

The application sends messages to the model by creating instances of RealtimeClientMessage and writing them to the clientMessageChannel. RealtimeClientMessage is a base type with specialized derived types representing the different message categories supported by realtime models.

Similarly, the application receives RealtimeServerMessage instances from the model. This is also a base type with multiple derived message types representing messages the server may emit.

The abstraction defines the following common client and server message types, with the expectation that more can be added over time.

Client Message Types

Type	Purpose
RealtimeClientConversationItemCreateMessage	Creates a new conversation item (text, audio, image, etc.).
RealtimeClientInputAudioBufferAppendMessage	Appends audio data to an existing audio buffer in the conversation.
RealtimeClientInputAudioBufferCommitMessage	Commits the appended audio data in the buffer for processing.
RealtimeClientResponseCreateMessage	Requests generation of a new response within the conversation.

Server Message Types

Type	Purpose
RealtimeServerErrorMessage	Represents an error sent by the model server.
RealtimeServerInputAudioTranscriptionMessage	Represents messages related to input audio transcription results.
RealtimeServerOutputTextAudioMessage	Represents generated output text and audio from the model.
RealtimeServerResponseCreatedMessage	Indicates a new response item has been created on the server.

Here are examples of how to send messages to the model:

//
// Send a message to create a new audio conversation item
//
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientInputAudioBufferAppendMessage(
    audioContent: new DataContent($"data:audio/pcm;base64,{Convert.ToBase64String(resampledAudio)}")));
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientInputAudioBufferCommitMessage());

//
// Create conversation item with text
//
var contentItem = new RealtimeContentItem(new TextContent(text), id: null, role: ChatRole.User);
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientConversationItemCreateMessage(item: contentItem));
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientResponseCreateMessage());

//
// Create conversation item with image
//
var contentItem = new RealtimeContentItem(new DataContent($"data:{mimeType};base64,{base64Image}"), id: null, role: ChatRole.User);
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientConversationItemCreateMessage(item: contentItem));
await clientMessageChannel.Writer.WriteAsync(new RealtimeClientResponseCreateMessage());

Here are examples of receiving messages from the model:

private void ProcessServerMessage(RealtimeServerMessage serverMessage)
{
    Invoke(new Action(() =>
    {
        switch (serverMessage)
        {
            case RealtimeServerOutputTextAudioMessage audioMessage:
                if (audioMessage.Type == RealtimeServerMessageType.OutputAudioDelta && audioMessage.Text != null)
                {
                    // PlayAudioChunk(audioMessage.Text);
                }
                else if (audioMessage.Type == RealtimeServerMessageType.OutputAudioTranscriptionDelta && audioMessage.Text != null)
                {
                    // WriteTranscript(audioMessage.Text);
                }
                else if (audioMessage.Type == RealtimeServerMessageType.OutputAudioTranscriptionDone)
                {
                    // output transcription is done
                }
                break;

            case RealtimeServerInputAudioTranscriptionMessage transcriptionMessage:
                if (transcriptionMessage.Type == RealtimeServerMessageType.InputAudioTranscriptionCompleted &&
                    transcriptionMessage.Transcription != null)
                {
                    // WriteUserInputTranscription($"You: {transcriptionMessage.Transcription}\n");
                }
                break;

            case RealtimeServerErrorMessage errorMessage:
                // WriteError($"Error: {errorMessage.Error?.Message}");
                break;

            case RealtimeServerResponseCreatedMessage responseMessage:
                if (responseMessage.Usage != null)
                {

                    // WriteUsage($"Usage - Input: {responseMessage.Usage.InputTokenCount}, Output: {responseMessage.Usage.OutputTokenCount}\n");
                }
                break;
        }
   }));
}

Important Notes

The abstraction includes a mechanism for updating session settings after creation. This is done through the UpdateAsync method on the IRealtimeSession interface. The RealtimeSessionOptions type represents the updatable settings, and providers may extend this type to include provider-specific options.
The abstraction is designed to be extensible. Clients can create RealtimeClientMessage instances that contain raw data for scenarios not covered by the predefined message types. The RealtimeClientMessage.RawRepresentation property supports this behavior.
Likewise, servers can send RealtimeServerMessage instances with raw data for scenarios outside the predefined message types. Applications can access this through the RealtimeServerMessage.RawRepresentation property.
Several additional types were introduced, such as LogProbability, NoiseReductionOptions, PromptTemplate, RealtimeAudioFormat, SemanticVoiceActivityDetection, ServerVoiceActivityDetection, ToolChoiceMode, and TranscriptionOptions. If we choose to keep some or all of these, they may need to be moved into more appropriate folders. For now, they are grouped together to simplify review.
The proposal contains the types RealtimeClientMessageType and RealtimeServerMessageType enumerations to categorize the different message types. I am on the fence about including these; they may not be necessary.
Many implementation details across these types can be discussed during the review process.
Naming is often challenging; the current names reflect initial choices and can be revised as needed.
XML documentation has been added for public types and members, though there is room for refinement.
An example implementation based on this abstraction is available in the OpenAIRealtimeClient file. This is not part of the proposal but serves to validate the design. The implementation is not fully complete but supports the major scenarios. A demo application is also provided to illustrate practical usage.
This proposal does not address related topics such as delegate wrappers, dependency injection extensions, or telemetry support. These can be added later after the core abstraction is finalized.
As I am still ramping up in this area, there may be gaps or missing details in the proposal. I would greatly appreciate any feedback or suggestions to help refine and improve it.

ericstj · 2025-11-24T17:52:15Z

+/// <summary>Represents a real-time client.</summary>
+/// <remarks>This interface provides methods to create and manage real-time sessions.</remarks>
+[Experimental("MEAI001")]
+public interface IRealtimeClient : IDisposable


It seems very intentional that Chat is omitted from the name. Is this consistent with the way the models/providers are exposing real-time support?

Right, I don't think the Chat term is used much with the realtime models. Here is snippet example of one of the python's implemenatin:

from openai_realtime import RealtimeClient client = RealtimeClient(api_key="...", model="gpt-realtime") client.send_text("Hello") for event in client: print(event)

ericstj · 2025-11-24T17:54:42Z

+    /// <summary>
+    /// Gets or sets the type of audio. For example, "audio/pcm".
+    /// </summary>
+    public string Type { get; set; }


Elsewhere we call this MediaType

Just wondering if the MediaType is broader and not scoped to audio formats?

ericstj · 2025-11-24T18:31:42Z

This all looks very different from the existing chat client APIs, is it necessarily so? I was imagining there would be an onramp from existing APIs to real-time. As it is, it seems a complete rewrite?

tarekgh · 2025-11-24T18:42:58Z

This all looks very different from the existing chat client APIs, is it necessarily so? I was imagining there would be an onramp from existing APIs to real-time. As it is, it seems a complete rewrite?

Yes, it is different because the chat client is designed around a traditional request–response model. As mentioned in the description, realtime models operate over bidirectional streaming, allowing clients to send input and receive output at the same time. This enables the server to start generating results while it is still processing incoming data.

For example, Realtime models support Voice Activity Detection (VAD), which can determine when to begin responding even before the entire audio stream is received. Think of it as a natural voice conversation: you can interrupt the model while it is speaking, and the server can start responding as soon as it detects a brief pause in your speech.

If you have a better approach for supporting Realtime models, I’d be happy to explore it.

ericstj · 2025-11-24T20:23:39Z

I'm just trying to think about how this fits into existing samples / templates and how we teach people about it. It sounds like it's not a "grow into" type of technology but instead an "early fork".

If an early fork, then I'd still expect most of the different tasks we already have to be possible in real-time. I wonder if it would help to look at a table of the tasks, which types are involved for Chat and which types are involved for Realtime. In some cases maybe we can use similar or same types (Options, Content?), but in others we might need new types. I would expect Realtime to be a superset of functionality in most cases. Is it ever possible to derive from those existing types?

Additionally - we have infrastructure around IChatClient - like function calling - how does that work in RealTime and can we reuse the infrastructure we already have?

tarekgh · 2025-11-24T20:55:59Z

In some cases maybe we can use similar or same types (Options, Content?), but in others we might need new types

That’s exactly the approach I followed in the proposal. I reused existing types wherever they naturally fit, such as ErrorContent, AITool, AIContent, UsageDetails, etc. I also introduced several types that can apply to both chat and realtime models, including LogProbability, PromptTemplate, ToolChoiceMode, and TranscriptionOptions. We can certainly decide to delay adding some of these, but I’ve noticed that newer OpenAI models like GPT-5.1 are already using similar concepts, so it felt appropriate to include them.

The proposal also defines realtime-specific types, such as VoiceActivityDetection, ServerVoiceActivityDetection, SemanticVoiceActivityDetection, and RealtimeSessionOptions, to cover functionality unique to realtime scenarios.

Additionally - we have infrastructure around IChatClient - like function calling - how does that work in RealTime and can we reuse the infrastructure we already have?

I believe the use of AITool and function calls should work the same way with Realtime models. For function calls, the process ultimately extracts the function name and parameters from our types, and then sends that information to the model.

ericstj · 2025-11-24T21:00:32Z

I believe the use of AITool and function calls should work the same way with Realtime models. For function calls, the process ultimately extracts the function name and parameters from our types, and then sends that information to the model.

Can you have a look at how function calling works today in https://github.com/dotnet/extensions/blob/e7fac9d9885b12ea2aacf75875802cc4571ee2ca/src/Libraries/Microsoft.Extensions.AI/ChatCompletion/FunctionInvokingChatClient.cs. We have these and other infrastructure built up around IChatClient - are we going to need parallel types around IRealtimeSession?

tarekgh · 2025-11-24T21:14:28Z

Can you have a look at how function calling works today in https://github.com/dotnet/extensions/blob/e7fac9d9885b12ea2aacf75875802cc4571ee2ca/src/Libraries/Microsoft.Extensions.AI/ChatCompletion/FunctionInvokingChatClient.cs. We have these and other infrastructure built up around IChatClient - are we going to need parallel types around IRealtimeSession?

I’ll look into that. My initial thought is that we shouldn’t need a parallel infrastructure; function calls could be handled like any other conversation item (similar to text, audio, etc.). The client would initiate a function call through a client message using FunctionCallContent, and the model would return the results in a server message. That said, I’m not an expert yet, so I may be overlooking something. I’ll spend more time reviewing this area.

tarekgh · 2025-11-25T15:37:50Z

Here are the open issues currently being tracked in the proposal:

Supporting similar mechanism as FunctionInvokingChatClient with the ChatClient. Look more to the other tools like MCP in general too. Note, Realtime models are full duplex streaming which differ than ChatClient in this matter.
Need to have DelegatingRealtimeSession?
Do we need a Realtime builder as we have it with the ChtClientBuilder? should the builder be for the Session instead?
For consistency, use IList instead of IEnumerable in the places like IEnumerable or similar places.
Add a server message corresponds to response.output_item.added which can be used to handle the tool calls.
Should we have object? GetService(Type serviceType, object? serviceKey = null); on IRealtimeSession?

I’ll continue adding items to the list as we explore the proposal in more depth.

stephentoub

Have you tried implementing this on multiple providers?

stephentoub · 2025-11-27T02:57:34Z

+
+    /// <summary>Gets a value indicating whether the session is currently connected.</summary>
+    /// <returns><see langword="true"/> if the session is connected; otherwise, <see langword="false"/>.</returns>
+    bool IsConnected { get; }


Why is this needed? How does it get used?

I've removed it. Initially, I thought it might be useful if having a session object to check the connection status, but I've removed it for now until we find a need for it.

stephentoub · 2025-11-27T02:58:49Z

+    /// <param name="updates">The sequence of real-time messages to send.</param>
+    /// <param name="cancellationToken">A token to cancel the operation.</param>
+    /// <returns>The response messages generated by the session.</returns>
+    IAsyncEnumerable<RealtimeServerMessage> GetStreamingResponseAsync(


Can I call this multiple times on the same IRealtimeSession instance?

No, this method cannot be called concurrently on the same session instance. The provider's implementation should throw an exception if multiple calls are attempted. However, calling it sequentially should be fine, though I don't anticipate this being a common use case. I have added a remark to the docs for that.

stephentoub · 2025-11-27T02:59:34Z

+/// The log of the model’s confidence in generating a token. Higher values mean the token was more likely according to the model.
+/// </summary>
+[Experimental("MEAI001")]
+public class LogProbability


Is this something that developers need in the 90% case? Is this generic across all providers?

This is typically used by AI engineers for guardrails and testing. It helps with confidence scoring or probability distribution between tokens. I’m going to remove it from the proposal for now until we identify a need for it, at which point we can reintroduce it.

By the way, Log Probability is supported by other providers, such as Gemini Flash models. However, there are differences in the supported fields between providers. For example, OpenAI uses bytes to generate the result, whereas Gemini uses token IDs instead.

stephentoub · 2025-11-27T03:00:34Z

+/// Represents a reusable prompts that you can use in requests, rather than specifying the content of prompts in code.
+/// </summary>
+[Experimental("MEAI001")]
+public class PromptTemplate


This doesn't seem specific to real-time. We should think through whether we need a representation for this that applies to IChatClient as well, for example. Or if it's actually needed at all... with IChatClient, devs that need this from the underlying provider can break glass, using RawRepresentationFactory or similar.

I removed it for now. We can bring it back later if we need to.

stephentoub · 2025-11-27T03:03:09Z

+/// This is used to identify the purpose of the message being sent to the model.
+/// </summary>
+[Experimental("MEAI001")]
+public enum RealtimeClientMessageType


How does this relate to the concrete subtypes like RealtimeClientInputAudioBufferAppendMessage ?

I had initially to be able of using the same subtype for multiple events, but this is not the case any more so, I have removed RealtimeClientMessageType.

stephentoub · 2025-11-27T03:04:24Z

+    /// <summary>
+    /// Gets or sets the tool choice mode for the response.
+    /// </summary>
+    /// <remarks>
+    /// If FunctionToolName or McpToolName is specified, this value will be ignored.
+    /// </remarks>
+    public ToolChoiceMode? ToolChoiceMode { get; set; }
+
+    /// <summary>
+    /// Gets or sets the name of the function tool to use for the response.
+    /// </summary>
+    /// <remarks>
+    /// If specified, the ToolChoiceMode, McpToolName, and McpToolServerLabel values will be ignored.
+    /// </remarks>
+    public string? FunctionToolName { get; set; }
+
+    /// <summary>
+    /// Gets or sets the name of the MCP tool to use for the response.
+    /// </summary>
+    /// <remarks>
+    /// If specified, the MCP tool server label will also be required.
+    /// </remarks>
+    public string? McpToolName { get; set; }
+
+    /// <summary>
+    /// Gets or sets the label of the MCP tool server to use for the response.
+    /// </summary>
+    /// <remarks>
+    /// If specified, the MCP tool name will also be required.
+    /// </remarks>
+    public string? McpToolServerLabel { get; set; }


Why are these needed? Can't these be modeled using the same AITool-derived types we already have?

stephentoub · 2025-11-27T03:05:20Z

+    /// <summary>
+    /// Gets or sets the content of the conversation item.
+    /// </summary>
+    public AIContent Content { get; set; }


It's always one and only one?

You are correct, this should be an array of contents. I'll fix that.

stephentoub · 2025-11-27T03:06:00Z

+/// Used with the <see cref="RealtimeServerMessageType.Error"/>.
+/// </remarks>
+[Experimental("MEAI001")]
+public class RealtimeServerErrorMessage : RealtimeServerMessage


Could this just be ErrorContent as part of another message?

I prefer to keep this as a separate error message because it includes additional properties, such as ErrorEventId and Parameter, which cannot necessarily be added to other messages or to the ErrorContent itself. Please let me know if you still feel differently.

stephentoub · 2025-11-27T03:07:27Z

+    /// This property is used only when having audio and text tokens. Otherwise InputTokenCount is sufficient.
+    /// </remarks>
+    [Experimental("MEAI001")]
+    public long? InputTextTokenCount { get; set; }


These are important enough to model like this rather than as part of AdditionalCounts?

@stephentoub I don't know where we draw the line for adding such new property or just using the AdditionalCounts? Recently, I am seeing we have added CachedInputTokenCount, why decided as a new property and not a part of AdditionalCounts? I am fine either way but will be good if we have a clear guidance when using AdditionalCounts and when add a new property.

Commonality across providers. Individual providers have been putting cached token information into AdditonalCounts for a while, but now that most providers have this notion and expose it, we elevated it to public property.

The new properties look supported in multiple providers. The difference is in Gemini Flash models it is derived from modality. Would that be enough to introduce the properties or just add them to the AdditonalCounts for now and we can consider exposing them later if we want to?

OpenAI:

"usage": { "input_token_details": { "text_tokens": 100, "audio_tokens": 400 } }

Gemini:

"usage_metadata": { "prompt_tokens_details": [ { "modality": "TEXT", "token_count": 100 }, { "modality": "AUDIO", "token_count": 400 } ] }

stephentoub · 2025-11-27T03:08:43Z

    <ForceLatestDotnetVersions>true</ForceLatestDotnetVersions>
    <MinCodeCoverage>n/a</MinCodeCoverage>
    <MinMutationScore>n/a</MinMutationScore>
+    <NoWarn>$(NoWarn);MEAI001</NoWarn> <!-- Added to suppress MEAI001 warnings thrown because of experimental usage added to UsageDetails type -->


I assume this is because of the source generator? Everyone else would get those same warnings as well. The experimental properties on serializable types should be marked as [JsonIgnore] as long as they're experimental.

Thanks. Using [JsonIgnore] fixed the issue.

tarekgh · 2026-01-12T23:02:53Z

Have you tried implementing this on multiple providers?

My plan is to implement this for the Gemini Flash 2.5 model. I'll update the proposal according to the finding from this implementation.

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> Co-authored-by: jozkee <16040868+jozkee@users.noreply.github.com> Co-authored-by: David Cantú <dacantu@microsoft.com> Co-authored-by: Stephen Toub <stoub@microsoft.com>

…tnet#7236) * Initial plan * Update ModelContextProtocol packages to 0.7.0-preview.1 Co-authored-by: jeffhandley <1031940+jeffhandley@users.noreply.github.com> * Update template package version to align with ModelContextProtocol 0.7.0-preview.1 Co-authored-by: jeffhandley <1031940+jeffhandley@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: jeffhandley <1031940+jeffhandley@users.noreply.github.com>

…7.1 (dotnet#7237) * Initial plan * Update Agent Framework package versions to 1.0.0-preview.260127.1 Co-authored-by: jeffhandley <1031940+jeffhandley@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: jeffhandley <1031940+jeffhandley@users.noreply.github.com>

…conventions (dotnet#7240) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

…et#7250) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

* Bring new cpu.requests formula from Kubernetes * Fix tests * Fix tests * Fix tests * Fix tests * Fix test * Fix assert

…397 (dotnet#7247) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

…gistration (dotnet#7255) Move _changeTokenRegistration.Dispose() outside the lock to avoid deadlock. CancellationTokenRegistration.Dispose() blocks waiting for any in-flight callback to complete, but the callback (RefreshAsync) tries to acquire the same lock, causing a deadlock. The fix captures the registration reference while holding the lock, then disposes it after releasing the lock. Applied to both RefreshAsyncInternal and DisposeAsync methods.

…th inverted polarity (dotnet#7262) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> Co-authored-by: Stephen Toub <stoub@microsoft.com>

…nt (dotnet#7261) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: halter73 <54385+halter73@users.noreply.github.com> Co-authored-by: Stephen Toub <stoub@microsoft.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

* Initial plan * Add ReasoningOptions to ChatOptions with OpenAI implementation Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> * Address code review feedback: add Clone method and document ExtraHigh limitation Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> * Address PR feedback: remove experimental attributes, make Clone internal, simplify ToOpenAIChatReasoningEffortLevel Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> * Address PR feedback: update docs, remove enum values, add tests Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> * Rename ReasoningOutput.Detailed to ReasoningOutput.Full Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

dotnet#7266) Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

… content (dotnet#7267) * Update OpenAIResponsesChatClient to handle streaming code interpreter content Right now it outputs it but in a bulk fashion only at the end of the response item. This makes it yield the deltas instead. * Dedup code block

… to 1.39 (dotnet#7274) * Initial plan * Update OpenTelemetry semantic convention version comments from 1.38 to 1.39 Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

…206.3 (dotnet#7277) On relative base path root Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Templating , Microsoft.DotNet.Helix.Sdk From Version 9.0.0-beta.26070.1 -> To Version 9.0.0-beta.26106.3 Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>

* Remove [Experimental] attribute from IChatReducer * Annotate APIs that use experimental OpenAI APIs. Remove prerelease label. * Fix typo Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Remove project-wide OpenAI experimental suppressions. Finish annotating. * Use granular constants for openai experimental diagnostics * Update API baselines * Remove unused const * Remove redundant [Experimental] attributes for OpenAI Responses members * Update ApiChief baselines for MEAI --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…nt to JSON serialization infrastructure (dotnet#7275) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

… API (dotnet#7231) * new api surface, first test iteration * fix tests * add experimental attributes * Update test/Libraries/Microsoft.Extensions.Http.Diagnostics.Tests/Latency/HttpClientLatencyTelemetryExtensionsTest.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * move and remove tests * fix warnings * update xml documentation * additional tests coverage * update tests and xml documentation --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

tarekgh · 2026-02-11T00:39:38Z

Closing this as we opened the official PR at dotnet#7285

tarekgh marked this pull request as draft November 21, 2025 23:57

tarekgh self-assigned this Nov 22, 2025

ericstj reviewed Nov 24, 2025

View reviewed changes

stephentoub reviewed Nov 27, 2025

View reviewed changes

tarekgh force-pushed the RealtimeClientProposal branch from 79edc89 to 6f78967 Compare January 7, 2026 19:13

tarekgh force-pushed the RealtimeClientProposal branch 3 times, most recently from ecd8e3f to 01b7a96 Compare January 27, 2026 21:15

Copilot AI and others added 6 commits January 29, 2026 15:28

HtmlEncode the JSON data before embedding. (dotnet#7238)

e73bed3

Fix token metric unit to use UCUM format {token} (dotnet#7241)

140e405

Add server tool call support to OpenTelemetryChatClient per semantic …

032abd9

…conventions (dotnet#7240) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

tarekgh force-pushed the RealtimeClientProposal branch from 4982428 to 32b1f43 Compare February 2, 2026 18:25

Copilot AI and others added 6 commits February 2, 2026 13:46

Preserve extra JSON schema properties in ToolJson serialization (dotn…

32be2da

…et#7250) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Bring new cpu.requests formula from Kubernetes (dotnet#7239)

096975b

* Bring new cpu.requests formula from Kubernetes * Fix tests * Fix tests * Fix tests * Fix tests * Fix test * Fix assert

Update M.E.AI changelogs with recent changes (dotnet#7242)

41797f1

Fix DataUriParser to default to text/plain;charset=US-ASCII per RFC 2…

99b3272

…397 (dotnet#7247) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot AI and others added 13 commits February 4, 2026 22:55

Fix OpenAI responses streaming to preserve encrypted reasoning content (

013d9fd

dotnet#7266) Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update MinorVersion from 3 to 4 (dotnet#7268)

8625d67

Update ApiChief script to use the net10.0 artifacts (dotnet#7280)

1925fde

Update AI changelogs for 10.3.0 (dotnet#7282)

9974fbf

Realtime Client Proposal

4c04354

tarekgh force-pushed the RealtimeClientProposal branch from dbfa327 to 4c04354 Compare February 11, 2026 00:19

tarekgh closed this Feb 11, 2026

Conversation

tarekgh commented Nov 21, 2025

Realtime Client Abstraction Proposal

Proposed Interface

IRealtimeClient

IRealtimeSession

Client Message Types

Server Message Types

Important Notes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tarekgh Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ericstj commented Nov 24, 2025

Uh oh!

tarekgh commented Nov 24, 2025

Uh oh!

ericstj commented Nov 24, 2025

Uh oh!

tarekgh commented Nov 24, 2025

Uh oh!

ericstj commented Nov 24, 2025

Uh oh!

tarekgh commented Nov 24, 2025

Uh oh!

tarekgh commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stephentoub left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tarekgh Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tarekgh Nov 24, 2025 •

edited

Loading

tarekgh commented Nov 25, 2025 •

edited

Loading

tarekgh Jan 7, 2026 •

edited

Loading