OpenAI

Whisper

Whisper is a general-purpose speech recognition model developed by OpenAI and made available via the OpenAI API under the model ID whisper-1. It was trained on a large dataset of diverse audio, enabling it to handle a wide range of accents, background noise conditions, and technical vocabulary. What distinguishes Whisper is its multitask design: it can perform not only speech-to-text transcription but also speech translation into English and automatic language identification within a single model. Whisper is well suited for developers building transcription pipelines, subtitle generation tools, voice interfaces, or any application that requires converting spoken audio into structured text. It supports multilingual input, making it useful for global applications where audio may arrive in different languages. The model accepts common audio formats and returns transcriptions or translations as plain text or with optional timestamps.

Unknown N/A context N/A output
Speech Transcription Speech Translation Language Identification Timestamp Output Audio Format Support

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

OpenAI

Input Context Window

The number of tokens supported by the input context window.

N/A tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

OpenAI API

Modalities

Types of data this model can process.

Text Audio

What is Whisper

A fuller summary of positioning, capabilities, and source-specific details for Whisper.

Whisper is a general-purpose speech recognition model developed by OpenAI and made available via the OpenAI API under the model ID whisper-1. It was trained on a large dataset of diverse audio, enabling it to handle a wide range of accents, background noise conditions, and technical vocabulary. What distinguishes Whisper is its multitask design: it can perform not only speech-to-text transcription but also speech translation into English and automatic language identification within a single model.

Whisper is well suited for developers building transcription pipelines, subtitle generation tools, voice interfaces, or any application that requires converting spoken audio into structured text. It supports multilingual input, making it useful for global applications where audio may arrive in different languages. The model accepts common audio formats and returns transcriptions or translations as plain text or with optional timestamps.

Capabilities

What Whisper supports

AI

Speech Transcription

Converts spoken audio into written text, supporting a wide range of languages, accents, and audio quality levels.

AI

Speech Translation

Translates spoken audio from supported non-English languages directly into English text in a single pass.

AI

Language Identification

Automatically detects the language spoken in an audio file without requiring the caller to specify it in advance.

AI

Timestamp Output

Optionally returns word- or segment-level timestamps alongside transcribed text, useful for subtitle and caption generation.

AUD

Audio Format Support

Accepts multiple common audio formats including mp3, mp4, mpeg, mpga, m4a, wav, and webm via the API.

Pricing for Whisper

Primary API pricing shown in the same “quick compare” spirit as the reference page.

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

OpenAI API

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Community discussion

What people think about Whisper

Whisper discussions are most active in r/pics, r/aww, r/SonicTheHedgehog. The strongest match in this snapshot has 96547 upvotes and 1854 comments.

r/whisper 35 upvotes 9 comments January 5, 2026
Bye, whisper-like apps...

So I was there at the start of whisper when it was a pure shower thought app, reconnected during the dark days where it was a scam and hookup app, and stayed until the end where it was a lifeless husk of an app that only worked intermittently.

They were fun days, talked to a lot of fun, fucked up and some disturbed people, and had a lot of fun talking to Nigerian scammers on the phone.

I migrated to some of the replacements and I've had some fun. I'm not going to badmouth them, they're all doing a decent job at recreating the experience under very tricky circumstances, it's not easy to monetise AND innovate AND keep everyone safe, users moan a lot and expect the moon on a stick without parting with a penny.

But, let's be brutally honest, Whisper at the end were the dregs of the community that made it special. These new apps are competing for the few of those dregs that didn't give up the dream. I'm sure there are a few good people on there, but for every good one, there are a hundred posting the same horny shit 20 times a day, not authentically engaging with the community, just there to DM any woman with a pulse or post ragebait.

I've noticed that at the end of 2025 some of the good folks have stopped posting altogether, as I've looked over the past few days there have been no interesting posts to engage with. The reanimated corpse is bereft of life, however much it wants to pretend that more American politic ragebait shows genuine engaged users.

So I just wanted to say goodbye to the idea of Whisper. You were lightning in a bottle and built a fantastic community of freaks. You are sorely missed.

For those of you clinging to the dream, I wish you well. The replacement apps themselves aren't that terrible when you consider the things I said above. I'm sure with sustained effort and some new blood, the anonymous shower thought app can make a comeback for a new generation.

Thank you for reading my rant and my thoughts. To anyone out there that recalls talking to plonker, thank you for the good times and I wish you well x

Open Reddit thread

War perk -> Grim Retribution, Collosal Foe and Maligant Invasion

Everytime you get an whisper gain, you have a higher chance to get ambush. So if you complete a Whisper objective even if its 1 point, you can get ambushed at a high rate.

These ambush drop the rewards of the whsiper cache. What inside whisper cache now is gift of tree :).

So you can accelerate gettingthem

Open Reddit thread
r/SpicyRomanceBooks 12 upvotes 49 comments July 23, 2025
Whisper Stories anyone?

Saw an ad for this on FB. Has anyone checked it out yet? I’m an audio book only “reader”. Do I really need another subscription account? Worth it?

Open Reddit thread
View more discussions →
FAQ

Common questions about Whisper

What is the maximum audio file size Whisper accepts via the API?

The OpenAI API enforces a 25 MB file size limit per audio file submitted to the Whisper endpoint.

Does Whisper have a context window like text models?

Whisper is an audio model, not a text model, so it does not have a token-based context window. Audio inputs are processed in segments internally.

What languages does Whisper support for transcription?

Whisper supports transcription in dozens of languages. It was trained on multilingual audio data and can identify and transcribe many of the world's most widely spoken languages.

Can Whisper translate languages other than English into English?

Yes. Whisper's translation capability converts spoken audio in supported non-English languages into English text. Translation into languages other than English is not supported by the model.

How is Whisper priced on the OpenAI API?

Whisper is billed per minute of audio processed. Pricing details are published on OpenAI's pricing page and may change over time.

More models from OpenAI

Continue browsing adjacent models from the same provider.

← All AI Models