Skip to content

feat(openai): update model YAMLs [bot]#1146

Merged
harshiv-26 merged 3 commits into
mainfrom
bot/update-openai-20260522-021441
May 22, 2026
Merged

feat(openai): update model YAMLs [bot]#1146
harshiv-26 merged 3 commits into
mainfrom
bot/update-openai-20260522-021441

Conversation

@models-bot
Copy link
Copy Markdown
Contributor

@models-bot models-bot Bot commented May 22, 2026

Auto-generated by poc-agent for provider openai.


Note

Low Risk
Low risk: updates are limited to model registry YAML metadata (pricing/limits/modalities/sources) and README counts, with no executable code changes.

Overview
Updates the model registry with refreshed metadata across providers: DeepInfra gemma-4-26B-A4B-it pricing adjustments and gemma-4-31B-it now advertising video input; Google Vertex gemini-3.1-flash-lite token limits bumped to 65536 and an additional source link added.

Adds new Qwen entries for OpenRouter (qwen/qwen3.7-max) and Together AI (Qwen/Qwen3.7-Max), and expands existing configs (e.g., OpenRouter inclusionai/ling-2.6-1t provisioning set; Together Qwen3.6-35B-A3B-FP8 adds tool_choice plus image/video modalities; Together gemma-4-E2B-it adds sources/status). README provider model counts are updated to reflect the new models.

Reviewed by Cursor Bugbot for commit 8a3df67. Bugbot is set up for automated code reviews on this repo. Configure here.

@github-actions
Copy link
Copy Markdown
Contributor

/test-models

@harshiv-26
Copy link
Copy Markdown
Collaborator

Gateway test results

  • Total: 3
  • Passed: 0
  • Failed: 1
  • Validation failed: 0
  • Errored: 0
  • Skipped: 2
  • Success rate: 0.0%
Provider Model Scenarios
openai gpt-realtime-2 skipped: skip-check
openai gpt-realtime-translate skipped: skip-check
openai gpt-realtime-whisper failure: params
Failures (1)

openai/gpt-realtime-whisper — params (failure)

Error
Traceback (most recent call last):
  File "/tmp/tmpf3t1zdxq/snippet.py", line 9, in <module>
    response = client.audio.transcriptions.create(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_utils/_utils.py", line 286, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/resources/audio/transcriptions.py", line 333, in create
    return self._post(  # type: ignore[return-value]
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1259, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/openai/_base_client.py", line 1047, in request
    raise self._make_status_error_from_response(err.response) from None
openai.NotFoundError: Error code: 404 - {'status': 'failure', 'message': 'openai error: Invalid URL (POST /v1/audio/transcriptions)', 'error': {'message': 'openai error: Invalid URL (POST /v1/audio/transcriptions)', 'type': 'invalid_request_error', 'code': '404'}, 'provider': 'openai'}
Code snippet
# OpenAI-compatible STT (works with OpenAI and Groq)
from pathlib import Path
from openai import OpenAI

client = OpenAI(api_key="***", base_url="https://internal.devtest.truefoundry.tech/api/llm/api/llm")

audio_path = Path(__file__).parent / "hello.wav"
with open(audio_path, "rb") as audio_file:
    response = client.audio.transcriptions.create(
        model="test-v2-openai/gpt-realtime-whisper",
        file=audio_file,
    )

_stt_output = response.text
print(_stt_output)

assert _stt_output, "VALIDATION FAILED: Transcription returned empty text"
assert len(_stt_output.strip()) > 0, f"VALIDATION FAILED: Transcription is blank: '{_stt_output}'"
Skipped (2)

openai/gpt-realtime-2 — skip-check (skipped)

Skip reason
unsupported mode 'realtime'

openai/gpt-realtime-translate — skip-check (skipped)

Skip reason
unsupported mode 'realtime'

@github-actions
Copy link
Copy Markdown
Contributor

/test-models

@harshiv-26
Copy link
Copy Markdown
Collaborator

Gateway test results

  • Total: 3
  • Passed: 0
  • Failed: 0
  • Validation failed: 0
  • Errored: 0
  • Skipped: 3
  • Success rate: 0.0%
Provider Model Scenarios
openai gpt-realtime-2 skipped: skip-check
openai gpt-realtime-translate skipped: skip-check
openai gpt-realtime-whisper skipped: skip-check
Skipped (3)

openai/gpt-realtime-2 — skip-check (skipped)

Skip reason
unsupported mode 'realtime'

openai/gpt-realtime-translate — skip-check (skipped)

Skip reason
unsupported mode 'unsupported'

openai/gpt-realtime-whisper — skip-check (skipped)

Skip reason
unsupported mode 'unsupported'

@harshiv-26 harshiv-26 enabled auto-merge (squash) May 22, 2026 09:27
@github-actions
Copy link
Copy Markdown
Contributor

/test-models

@harshiv-26 harshiv-26 merged commit 230f541 into main May 22, 2026
8 checks passed
@harshiv-26 harshiv-26 deleted the bot/update-openai-20260522-021441 branch May 22, 2026 09:28
@harshiv-26
Copy link
Copy Markdown
Collaborator

Gateway test results

  • Total: 3
  • Passed: 0
  • Failed: 0
  • Validation failed: 0
  • Errored: 0
  • Skipped: 3
  • Success rate: 0.0%
Provider Model Scenarios
openai gpt-realtime-2 skipped: skip-check
openai gpt-realtime-translate skipped: skip-check
openai gpt-realtime-whisper skipped: skip-check
Skipped (3)

openai/gpt-realtime-2 — skip-check (skipped)

Skip reason
unsupported mode 'realtime'

openai/gpt-realtime-translate — skip-check (skipped)

Skip reason
unsupported mode 'unsupported'

openai/gpt-realtime-whisper — skip-check (skipped)

Skip reason
unsupported mode 'unsupported'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant