OpenAI

Whisper

Whisper is a general-purpose speech recognition model developed by OpenAI and made available via the OpenAI API under the model ID whisper-1. It was trained on a large dataset of diverse audio, enabling it to handle a wide range of accents, background noise conditions, and technical vocabulary. What distinguishes Whisper is its multitask design: it can perform not only speech-to-text transcription but also speech translation into English and automatic language identification within a single model. Whisper is well suited for developers building transcription pipelines, subtitle generation tools, voice interfaces, or any application that requires converting spoken audio into structured text. It supports multilingual input, making it useful for global applications where audio may arrive in different languages. The model accepts common audio formats and returns transcriptions or translations as plain text or with optional timestamps.

Unknown N/A context N/A output

Speech Transcription Speech Translation Language Identification Timestamp Output Audio Format Support

Overview ↓ About ↓ Capabilities ↓ Pricing ↓ Tools ↓ Daily ↓ Resources ↓ Community ↓ FAQ ↓

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

OpenAI

Input Context Window

The number of tokens supported by the input context window.

N/A tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

N/A tokens

Open Source

Whether the model's code is available for public use.

Release Date

When the model was first released.

Unknown

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

OpenAI API

Modalities

Types of data this model can process.

Text Audio

What is Whisper

A fuller summary of positioning, capabilities, and source-specific details for Whisper.

Whisper is a general-purpose speech recognition model developed by OpenAI and made available via the OpenAI API under the model ID whisper-1. It was trained on a large dataset of diverse audio, enabling it to handle a wide range of accents, background noise conditions, and technical vocabulary. What distinguishes Whisper is its multitask design: it can perform not only speech-to-text transcription but also speech translation into English and automatic language identification within a single model.

Whisper is well suited for developers building transcription pipelines, subtitle generation tools, voice interfaces, or any application that requires converting spoken audio into structured text. It supports multilingual input, making it useful for global applications where audio may arrive in different languages. The model accepts common audio formats and returns transcriptions or translations as plain text or with optional timestamps.

Capabilities

What Whisper supports

Speech Transcription

Converts spoken audio into written text, supporting a wide range of languages, accents, and audio quality levels.

Speech Translation

Translates spoken audio from supported non-English languages directly into English text in a single pass.

Language Identification

Automatically detects the language spoken in an audio file without requiring the caller to specify it in advance.

Timestamp Output

Optionally returns word- or segment-level timestamps alongside transcribed text, useful for subtitle and caption generation.

AUD

Audio Format Support

Accepts multiple common audio formats including mp3, mp4, mpeg, mpga, m4a, wav, and webm via the API.

Pricing for Whisper

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Input tokens $0.01 Per million tokens

Output tokens N/A Per million tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

OpenAI API

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Whisper API Reference Documentation

→

Whisper Model Card & Guide Documentation

→

Whisper Research Paper (arXiv) Research

→

Whisper GitHub Repository Open Source

→

OpenAI Whisper Announcement Announcements

→

Official Website

→

Usage Policies

→

Enterprise privacy at OpenAI

→

OpenAI Status Page

→

AI tools related to Whisper

These tools are strongly connected to Whisper through direct product references, provider mentions, or explicit model mappings.

AI API

GoWhisper

GoWhisper is a privacy-focused, cross-platform desktop application designed for local audio transcription. It enables precise speech-to-text conversion while keeping your data secure. Key features include unlimited offline transcription, YouTube link processing, voice recording, and multiple export formats.

Free 0 visits 4 saves

AI Writing Assistants

Whisper

Whisper is a general-purpose speech recognition model from OpenAI. Trained on a vast and diverse audio dataset, this multi-task model handles multilingual speech recognition, speech translation, and language identification. By utilizing a Transformer sequence-to-sequence architecture, Whisper performs various speech processing tasks—including voice activity detection—as a sequence of predicted tokens. This approach allows a single model to replace multiple stages of a traditional speech-processing pipeline using special tokens to specify tasks.

Free 0 visits 3 saves

AI API

Recos

Recos is a web application that transcribes audio files into text using OpenAI's Whisper API. Users can either provide their own OpenAI API key or log in to use the platform's built-in credits. New users are granted 20 free credits, and the tool supports audio files up to 100MB.

Free 0 visits 3 saves

AI Assistant

Brilliant Labs

Brilliant Labs is an open-source ecosystem that develops pocket-sized AR glasses, such as Frame, integrated with generative AI. The platform provides developers and creatives with tools like arGPT—an iOS application for the Monocle AR glasses—to facilitate access to generative AI chat features and custom application development.

Free 146 visits 2 saves

Related Daily Briefs

Recent daily stories tied to Whisper through direct model mentions or provider-level coverage.

Frontier Models

Mistral and OpenAI Signal a Broader Shift Around Costs Using PNGs

Claude and Mistral are becoming more practical to evaluate and deploy.

2026-07-04 AI Models AI API

Frontier Models

Hugging Face, xAI, and Anthropic Signal a Broader Shift Around DojoZero

Hugging Face and xAI move deeper into real workflows.

2026-07-01 AI Models Benchmark

Capital Industry

OpenAI and Nvidia Signal a Broader Shift Around Design-Dependent Observation-Window Sufficiency

OpenAI and NVIDIA are raising the stakes for enterprise adoption.

2026-06-30 Funding

Agents Workflows

Amazon, Runway, and Pika Signal a Broader Shift Around FDE

Pika and OpenAI move deeper into real workflows.

2026-06-30 AI Agent AI API

Community discussion

What people think about Whisper

Whisper discussions are most active in r/pics, r/aww, r/SonicTheHedgehog. The strongest match in this snapshot has 96547 upvotes and 1854 comments.

r/whisper 35 upvotes 9 comments January 5, 2026

Bye, whisper-like apps...

So I was there at the start of whisper when it was a pure shower thought app, reconnected during the dark days where it was a scam and hookup app, and stayed until the end where it was a lifeless husk of an app that only worked intermittently.

They were fun days, talked to a lot of fun, fucked up and some disturbed people, and had a lot of fun talking to Nigerian scammers on the phone.

I migrated to some of the replacements and I've had some fun. I'm not going to badmouth them, they're all doing a decent job at recreating the experience under very tricky circumstances, it's not easy to monetise AND innovate AND keep everyone safe, users moan a lot and expect the moon on a stick without parting with a penny.

But, let's be brutally honest, Whisper at the end were the dregs of the community that made it special. These new apps are competing for the few of those dregs that didn't give up the dream. I'm sure there are a few good people on there, but for every good one, there are a hundred posting the same horny shit 20 times a day, not authentically engaging with the community, just there to DM any woman with a pulse or post ragebait.

I've noticed that at the end of 2025 some of the good folks have stopped posting altogether, as I've looked over the past few days there have been no interesting posts to engage with. The reanimated corpse is bereft of life, however much it wants to pretend that more American politic ragebait shows genuine engaged users.

So I just wanted to say goodbye to the idea of Whisper. You were lightning in a bottle and built a fantastic community of freaks. You are sorely missed.

For those of you clinging to the dream, I wish you well. The replacement apps themselves aren't that terrible when you consider the things I said above. I'm sure with sustained effort and some new blood, the anonymous shower thought app can make a comeback for a new generation.

Thank you for reading my rant and my thoughts. To anyone out there that recalls talking to plonker, thank you for the good times and I wish you well x

Open Reddit thread

r/Cait_ASMR 473 upvotes 18 comments May 25, 2025

ASMR Inaudible Whispering

Open Reddit thread

r/creepyPMs 1,389 upvotes 67 comments November 18, 2020

Whisper has a bestiality problem

Open Reddit thread

r/diablo4 183 upvotes 15 comments May 14, 2026

PSA take the Whisper Ambushes war plan perk to accelerate gift of the tree cache

War perk -> Grim Retribution, Collosal Foe and Maligant Invasion

Everytime you get an whisper gain, you have a higher chance to get ambush. So if you complete a Whisper objective even if its 1 point, you can get ambushed at a high rate.

These ambush drop the rewards of the whsiper cache. What inside whisper cache now is gift of tree :).

So you can accelerate gettingthem

Open Reddit thread

r/SpicyRomanceBooks 12 upvotes 49 comments July 23, 2025

Whisper Stories anyone?

Saw an ad for this on FB. Has anyone checked it out yet? I’m an audio book only “reader”. Do I really need another subscription account? Worth it?

Open Reddit thread

View more discussions →

FAQ

Common questions about Whisper

What is the maximum audio file size Whisper accepts via the API?

The OpenAI API enforces a 25 MB file size limit per audio file submitted to the Whisper endpoint.

Does Whisper have a context window like text models?

Whisper is an audio model, not a text model, so it does not have a token-based context window. Audio inputs are processed in segments internally.

What languages does Whisper support for transcription?

Whisper supports transcription in dozens of languages. It was trained on multilingual audio data and can identify and transcribe many of the world's most widely spoken languages.

Can Whisper translate languages other than English into English?

Yes. Whisper's translation capability converts spoken audio in supported non-English languages into English text. Translation into languages other than English is not supported by the model.

How is Whisper priced on the OpenAI API?

Whisper is billed per minute of audio processed. Pricing details are published on OpenAI's pricing page and may change over time.

More models from OpenAI

Continue browsing adjacent models from the same provider.

← All AI Models