Real-Time Web Search
Grounds every response in live internet data retrieved at query time, rather than relying on static training knowledge alone.
Sonar is Perplexity AI's in-house text generation model, built on Meta's Llama 3.3 70B and optimized for web-grounded question answering. Released in January 2025, it retrieves live internet data at query time rather than relying solely on static training knowledge, and every response includes inline source citations for transparency. It supports a 128,000-token context window and runs at approximately 121 tokens per second using Cerebras wafer-scale inference. Sonar is designed for developers and businesses that need to embed fast, factual, and source-backed search capabilities into their own applications. It offers three search depth modes — High, Medium, and Low — allowing teams to balance thoroughness against response speed depending on their use case. On the SimpleQA benchmark, Sonar achieved an F-score of 0.773, reflecting its focus on factual accuracy. It is particularly well-suited for high-volume applications such as sales research tools, medical information platforms, and real-time in-meeting search features.
High-signal model metadata in a structured two-column overview table.
The entity that provides this model.
The routed model identifier exposed by upstream providers.
The number of tokens supported by the input context window.
The number of tokens that can be generated by the model in a single request.
Whether the model's code is available for public use.
When the model was first released.
When the model's knowledge was last updated.
The providers that offer this model. This is not an exhaustive list.
Types of data this model can process.
A fuller summary of positioning, capabilities, and source-specific details for Sonar.
Sonar is Perplexity AI's in-house text generation model, built on Meta's Llama 3.3 70B and optimized for web-grounded question answering. Released in January 2025, it retrieves live internet data at query time rather than relying solely on static training knowledge, and every response includes inline source citations for transparency. It supports a 128,000-token context window and runs at approximately 121 tokens per second using Cerebras wafer-scale inference.
Sonar is designed for developers and businesses that need to embed fast, factual, and source-backed search capabilities into their own applications. It offers three search depth modes — High, Medium, and Low — allowing teams to balance thoroughness against response speed depending on their use case. On the SimpleQA benchmark, Sonar achieved an F-score of 0.773, reflecting its focus on factual accuracy. It is particularly well-suited for high-volume applications such as sales research tools, medical information platforms, and real-time in-meeting search features.
Grounds every response in live internet data retrieved at query time, rather than relying on static training knowledge alone.
Automatically includes inline citations with each answer, linking responses directly to their source URLs for verifiability.
Supports a 128,000-token context window, enabling extended conversations and analysis of long documents within a single request.
Achieves approximately 121 tokens per second using Cerebras wafer-scale inference, enabling sub-second response times for high-volume workloads.
Offers High, Medium, and Low search depth modes so developers can tune the balance between answer thoroughness and response latency.
Available via the Sonar API, allowing developers to embed generative search directly into their own products without building retrieval infrastructure.
Primary API pricing shown in the same “quick compare” spirit as the reference page.
Additional usage-cost dimensions synced into the project for this model.
Places where this model is available, based on the synced detail-page metadata.
Endpoint-level provider data currently available for this model.
The configurable options currently documented for this model.
Determines whether or not a request to an online model should return citations.
Determines whether or not a request to an online model should return images.
Controls how much web information is retrieved. Higher context provides more comprehensive results but costs more per request.
Parameters currently listed by OpenRouter or the local catalog for this model.
Benchmark scores synced from the current model source and normalized into the local catalog.
| Benchmark | Score |
|---|---|
|
AIME 2024
American math olympiad problems
|
|
|
GPQA Diamond
PhD-level science questions (biology, physics, chemistry)
|
|
|
HLE
Questions that challenge frontier models across many domains
|
|
|
LiveCodeBench
Real-world coding tasks from recent competitions
|
|
|
MATH-500
Undergraduate and competition-level math problems
|
|
|
MMLU-Pro
Expert knowledge across 14 academic disciplines
|
|
|
SciCode
Scientific research coding and numerical methods
|
Official model cards, release notes, docs, and other references synced from the source page.
Sonar discussions are most active in r/DispatchAdHoc, r/todayilearned, r/worldnews. Top Reddit threads cluster around benchmark and model-comparison threads.
The strongest match in this snapshot has 74036 upvotes and 375 comments.
Here are my arguments:
* **Thematically, it makes sense:** Coupé was one of the first Z-Teamers you actually meet, and she's very friendly to Robert; it doesn't really make sense to me that Robert would fire one of the first people he's friendly towards
* **It makes more sense to me that Sonar would join the Red Ring:** He's a far-right nutjob who takes cult-like fandom seriously. I could see him falling in line with Shroud's manipulation and money over Coupé, who has a level of professionalism with how she does things
* **Fight scene looks cooler:** Giant bat boss, what's not to like about it?
* **Coupé scene mocking Blonde Blazer is weird:** IDK, hearing "Ok mom, she's bluffing" from Coupé feels awkward, she has goofy moments, but she's still a serious character. Then you have Sonar, a goofy character that blows a raspberry at Blazer, which I feel fits way better.
IDK, just my personal take.
I just want a quality headset with quality SOFTWARE. Which company (Logitech, Razer, HyperX etc) have the best software? And by "best" I mean simple, RELIABLE. I just want sound and mic for gaming, not too much to ask! (when not on the headset I have AudioEngine A5+ speakers)
I have an Arctis 7 headset which worked perfectly for years with the SteelSeries 'Engine' software, until the Sonar software turned up. Now it just seems to be a nightmare - constantly having to go in to the settings because it's not detecting my headset, chatmix balance is bugged, and other miscellaneous stuff. I'm on a Windows 11 PC with modern hardware.
Think about growing up as a bat from the neck up. Yeah, people with powers seem to be generally accepted in the Dispatch world, but Sonar didn’t have privilege of looking like a normal human like most of the Z-team (sans Golem/Malevola). Even if he wasn’t born with his powers and got them around puberty, I can’t imagine he didn’t get shit from his peers over his appearance. Not to mention occasionally turning full bat monster. And maybe that led him to use his smarts as a defense mechanism and turn to drugs to cope.
Sonar supports a context window of 128,000 tokens, which allows for extended conversations and analysis of lengthy documents in a single request.
Sonar retrieves live web data at query time, so its answers are not limited to a static training cutoff. The model itself was launched in January 2025, and its underlying Llama 3.3 70B base has its own training data cutoff, but real-time search supplements this with current information.
Pricing details for Sonar via the Sonar API are available on Perplexity's official API overview page at sonar.perplexity.ai. Sonar is also used to power Perplexity's free consumer tier.
Sonar is built on Meta's Llama 3.3 70B and has been optimized by Perplexity AI for web-grounded, real-time question answering with citation support.
On the SimpleQA benchmark, which tests factual accuracy in language models, Sonar achieved an F-score of 0.773.
Continue browsing adjacent models from the same provider.