feat(audio): add tracking for audio transcriptions in OpenAI client by JasonLovesDoggo · Pull Request #400 · PostHog/posthog-python

JasonLovesDoggo · 2026-01-02T19:47:45Z

This adds support for tracking transcriptions from OpenAI. It does this via a new event $ai_transcription which follows the pattern of embeddings. I figure that audio -> text Is different enough from text-to-text to deserve its own event.

Confirmed it worked in my own testing. Feel free to impersonate and view https://us.posthog.com/project/254263/events/2e8ded5c-acd2-45b4-b10f-7a85a438ffaa/2026-01-02T15%3A02%3A00.007000-05%3A00

greptile-apps · 2026-01-02T19:48:57Z

Greptile's behavior is changing!

From now on, if a review finishes with no comments, we will not post an additional "statistics" comment to confirm that our review found nothing to comment on. However, you can confirm that we reviewed your changes in the status check section.

_{This feature can be toggled off in your Code Review Settings by deselecting "Create a status check for each PR".}

Copilot

Pull request overview

This PR adds tracking support for OpenAI audio transcriptions via a new $ai_transcription event. The implementation follows the existing pattern used for embeddings, treating audio-to-text as a distinct operation from text-to-text transformations.

Introduces WrappedAudio and WrappedTranscriptions classes for both sync and async OpenAI clients
Captures transcription metadata including model, input file name, output text, latency, and optional properties like language and audio duration
Supports privacy mode, groups, and custom properties consistent with other AI tracking features

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
posthog/ai/openai/openai.py	Adds `WrappedAudio` and `WrappedTranscriptions` classes to track transcription usage in the sync OpenAI client
posthog/ai/openai/openai_async.py	Adds async versions of `WrappedAudio` and `WrappedTranscriptions` to track transcription usage in the async OpenAI client
posthog/test/ai/openai/test_openai.py	Adds comprehensive test coverage for transcription tracking including basic usage, duration tracking, language parameter, groups, privacy mode, and async support

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

posthog/ai/openai/openai.py

posthog/ai/openai/openai_async.py

posthog/test/ai/openai/test_openai.py

… API calls

JasonLovesDoggo · 2026-01-19T22:46:24Z

cc @andrewm4894

rafaeelaudibert · 2026-02-19T03:20:38Z

We've updated our release process. We require sampo now. Please rebase on master and check README to understand what should be done.

andrewm4894 · 2026-02-23T12:55:08Z

@PostHog/team-llm-analytics this is an interesting one as it adds a new $ai_transcription event so maybe needs some thought/discussion

JasonLovesDoggo · 2026-02-23T15:29:06Z

We've updated our release process. We require sampo now. Please rebase on master and check README to understand what should be done.

Gotcha, happy to add this once I get the goahead that this can move forward

carlos-marchal-ph · 2026-02-25T10:36:11Z

Hi Jason! Carlos from the PostHog LLM analytics team here. Appreciate the PR, that's certainly a sensible way to go about it. Unfortunately I'm leaning towards shelving this for the moment. There's some internal reasons for it:

We are currently implementing multimodal support. The current architecture leans towards treating all input/output to LLMs as potentially multimodal, which also supports mixed content by default.
We are still discussing how we'd like to handle potentially large payloads in AI events. Audio might not be the worst offender here, but in line with the above we want a solution that can ingest large blobs (hi-res images, potentially even videos at some point). This might impact how we ingest them from the SDK side, so we don't want to commit to anything right now.
In line with the above, we are trying to centralise as much of the event post-processing as possible to our backend. This makes it easier to control for us, and also avoids having to duplicate features across the different SDKs.

In the meantime, you should be able to get this working without forking by writing a small helper that calls client.audio.transcriptions.create() on the standard OpenAI client and then uses posthog.capture() to send a custom event with the properties you care about (model, latency, etc.). We'll be sure to credit you if we end up using part/all of your code moving forward. And again, thanks a lot for contributing!

JasonLovesDoggo · 2026-02-25T14:40:31Z

Hi Jason! Carlos from the PostHog LLM analytics team here. Appreciate the PR, that's certainly a sensible way to go about it. Unfortunately I'm leaning towards shelving this for the moment. There's some internal reasons for it:

We are currently implementing multimodal support. The current architecture leans towards treating all input/output to LLMs as potentially multimodal, which also supports mixed content by default.

We are still discussing how we'd like to handle potentially large payloads in AI events. Audio might not be the worst offender here, but in line with the above we want a solution that can ingest large blobs (hi-res images, potentially even videos at some point). This might impact how we ingest them from the SDK side, so we don't want to commit to anything right now.

In line with the above, we are trying to centralise as much of the event post-processing as possible to our backend. This makes it easier to control for us, and also avoids having to duplicate features across the different SDKs.

In the meantime, you should be able to get this working without forking by writing a small helper that calls client.audio.transcriptions.create() on the standard OpenAI client and then uses posthog.capture() to send a custom event with the properties you care about (model, latency, etc.). We'll be sure to credit you if we end up using part/all of your code moving forward. And again, thanks a lot for contributing!

All good, looking forward to this!

feat(audio): add tracking for audio transcriptions in OpenAI client

ec7f8f1

Copilot AI review requested due to automatic review settings January 2, 2026 19:47

Copilot started reviewing on behalf of JasonLovesDoggo January 2, 2026 19:48 View session

Copilot AI reviewed Jan 2, 2026

View reviewed changes

posthog/ai/openai/openai.py Show resolved Hide resolved

posthog/ai/openai/openai_async.py Show resolved Hide resolved

posthog/test/ai/openai/test_openai.py Show resolved Hide resolved

posthog/test/ai/openai/test_openai.py Show resolved Hide resolved

JasonLovesDoggo added 2 commits January 2, 2026 15:00

test(transcription): add error handling for transcription failures

0281479

feat(transcription): add error handling and logging for transcription…

7671eb4

… API calls

rafaeelaudibert requested a review from a team February 19, 2026 03:20

Merge branch 'master' into feat/support-transcribe

8409071

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(audio): add tracking for audio transcriptions in OpenAI client#400

feat(audio): add tracking for audio transcriptions in OpenAI client#400
JasonLovesDoggo wants to merge 4 commits intoPostHog:masterfrom
JasonLovesDoggo:feat/support-transcribe

JasonLovesDoggo commented Jan 2, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Jan 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JasonLovesDoggo commented Jan 19, 2026

Uh oh!

rafaeelaudibert commented Feb 19, 2026

Uh oh!

andrewm4894 commented Feb 23, 2026

Uh oh!

JasonLovesDoggo commented Feb 23, 2026

Uh oh!

carlos-marchal-ph commented Feb 25, 2026

Uh oh!

JasonLovesDoggo commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

JasonLovesDoggo commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot commented Jan 2, 2026

Greptile's behavior is changing!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JasonLovesDoggo commented Jan 19, 2026

Uh oh!

rafaeelaudibert commented Feb 19, 2026

Uh oh!

andrewm4894 commented Feb 23, 2026

Uh oh!

JasonLovesDoggo commented Feb 23, 2026

Uh oh!

carlos-marchal-ph commented Feb 25, 2026

Uh oh!

JasonLovesDoggo commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

JasonLovesDoggo commented Jan 2, 2026 •

edited

Loading