[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

mgoin · 2025-01-16T20:12:22Z

🚀 The feature, motivation and pitch

Now that we have support for Whisper (#11280), we should consider implementing OpenAI's explicit speech-to-text API. Documentation is here https://platform.openai.com/docs/guides/speech-to-text

Example of `v1/audio/transcriptions`

from openai import OpenAI
client = OpenAI()

audio_file= open("/path/to/file/audio.mp3", "rb")
transcription = client.audio.transcriptions.create(
    model="whisper-1", 
    file=audio_file
)

print(transcription.text)

Example of `v1/audio/translations`

from openai import OpenAI
client = OpenAI()

audio_file = open("/path/to/file/german.mp3", "rb")
transcription = client.audio.translations.create(
    model="whisper-1", 
    file=audio_file,
)

print(transcription.text)

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

Temirulan · 2025-01-28T01:57:37Z

I believe this PR is related to this feature request

mgoin added the feature request label Jan 16, 2025

DarkLight1337 added the help wanted Extra attention is needed label Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

mgoin commented Jan 16, 2025 •

edited

Loading

Temirulan commented Jan 28, 2025

[Feature]: Support OpenAI speech-to-text interface v1/audio/[transcriptions,translations] #12130

[Feature]: Support OpenAI speech-to-text interface v1/audio/[transcriptions,translations] #12130

Comments

mgoin commented Jan 16, 2025 • edited Loading

🚀 The feature, motivation and pitch

Example of v1/audio/transcriptions

Example of v1/audio/translations

Alternatives

Additional context

Before submitting a new issue...

Temirulan commented Jan 28, 2025

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

mgoin commented Jan 16, 2025 •

edited

Loading

Example of `v1/audio/transcriptions`

Example of `v1/audio/translations`