Reka

Reka Edge Deprecated

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...

Mar 20, 2026 16.4K context 128,000 tokens output
Text Image Video Tools Structured Output

Model Overview

High-signal model metadata in a structured two-column overview table.

Provider

The entity that provides this model.

Reka

Model ID

The routed model identifier exposed by upstream providers.

rekaai/reka-edge

Input Context Window

The number of tokens supported by the input context window.

16.4K tokens

Maximum Output Tokens

The number of tokens that can be generated by the model in a single request.

128,000 tokens tokens

Open Source

Whether the model's code is available for public use.

No

Release Date

When the model was first released.

Mar 20, 2026 2 months ago

Knowledge Cut-off Date

When the model's knowledge was last updated.

Unknown

API Providers

The providers that offer this model. This is not an exhaustive list.

Reka

Modalities

Types of data this model can process.

Text Image Video

What is Reka Edge Deprecated

A fuller summary of positioning, capabilities, and source-specific details for Reka Edge Deprecated.

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...

Capabilities

What Reka Edge Deprecated supports

JSON

Structured Outputs

Structured output settings are exposed through OpenRouter for schema-driven or format-controlled responses.

TL

Tool Calling

Tool invocation and tool selection are supported in the routed OpenRouter interface for this model.

MM

Multimodal I/O

This model accepts text input, image input, video input and returns text output.

CTX

Large Context Window

OpenRouter currently lists a context window of 16.4K with up to 128,000 tokens maximum output tokens.

Pricing for Reka Edge Deprecated

Primary API pricing shown in the same “quick compare” spirit as the reference page.

Price Comparison

Additional usage-cost dimensions synced into the project for this model.

maxTemperature 1
maxResponseSize 128,000 tokens

API Access & Providers

Places where this model is available, based on the synced detail-page metadata.

Reka

Provider Endpoints

Endpoint-level provider data currently available for this model.

Reka

Max output: 16,384 1d uptime: 100.0% Supported params: 11 Implicit caching: No

Resources & Documentation

Official model cards, release notes, docs, and other references synced from the source page.

Community discussion

What people think about Reka Edge Deprecated

Reka Edge Deprecated discussions are most active in r/LocalLLaMA, r/StableDiffusion, r/artificial. Top Reddit threads cluster around benchmark and model-comparison threads.

The strongest match in this snapshot has 197 upvotes and 7 comments.

r/LocalLLaMA 77 upvotes 30 comments March 11, 2026
RekaAI/reka-edge-2603 · Hugging Face

**Reka Edge** is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding, video analysis, object detection, and agentic tool-use.

[https://reka.ai/news/reka-edge-frontier-level-edge-intelligence-for-physical-ai](https://reka.ai/news/reka-edge-frontier-level-edge-intelligence-for-physical-ai)

Open Reddit thread
r/LocalLLaMA 31 upvotes 7 comments April 23, 2026
Reka Edge 2603 multimodal support has been merged into llama.cpp

Hi r/LocalLLaMA! I work at Reka and organized our [AMA](https://www.reddit.com/r/LocalLLaMA/comments/1s3eih5/ama_with_the_reka_ai_team/) last month. Some of y'all have asked for llama.cpp support - this is a follow-up to let you know that Reka Edge 2603 is now supported upstream in llama.cpp.

To get started:

* Use the Reka Edge 2603 [weights](https://huggingface.co/RekaAI/reka-edge-2603) from the HF repo
* Run the [GGUF conversion script](https://huggingface.co/RekaAI/reka-edge-2603/blob/main/convert_reka_vlm_to_gguf.py) from the llama.cpp repo root
* (optional) Use the[ quantization script](https://huggingface.co/RekaAI/reka-edge-2603/blob/main/quantize_reka_q4_last8_q8.sh) for the text decoder

One note: the model does not currently support reasoning, so run llama-server with \`--reasoning off\`. Happy hacking!

Open Reddit thread
r/LocalLLaMA 21 upvotes 32 comments March 25, 2026
AMA with the Reka AI team

https://preview.redd.it/3q803tkzr7rg1.png?width=1024&format=png&auto=webp&s=392a4324bdd55a31d22689f8e0dd9d591683ddfc

Dear [r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/), greetings from the Reka AI team!

We're a research lab with a focus on creating models that are useful for physical, real-world use cases. We're looking forward to hosting our first AMA and chatting about our latest model, our research direction, and anything else under the sun. We've just released our Reka Edge vision language model and we're looking to add new capabilities to generate and act in the physical world in our next model. Let us know what you'd like to see from us!

Joining us for the AMA are the research leads for our latest Reka Edge model:

* [u/MattiaReka](https://www.reddit.com/user/MattiaReka/)
* [u/Puzzled-Appeal-6478](https://www.reddit.com/user/Puzzled-Appeal-6478/)
* [u/donovan\_agi](https://www.reddit.com/user/donovan_agi/)

And [u/Available\_Poet\_6387](https://www.reddit.com/user/Available_Poet_6387/) who works on API and inference.

We'll be here on Wednesday, 25th March from 10am to 12pm PST, and will continue to answer questions async after the AMA is over. You can reach us on [Discord](https://link.reka.ai/discord) and check us out at [our website](https://reka.ai/), [playground](https://app.reka.ai), or [clipping app](https://creator.reka.ai/).

>Aaand that's a wrap! Thank you for all your questions - we enjoyed learning about your cat flap use cases and picked up some Polish along the way. Please continue to post questions - we'll continue to monitor this page and reply when we can. We look forward to sharing more news of future developments like GGUF and quantized versions, and upcoming models. Feel free to reach out to us on [Discord](https://link.reka.ai/discord) or on [X](https://x.com/RekaAILabs)!

Open Reddit thread
r/StableDiffusion 197 upvotes 7 comments February 16, 2024
This week in AI - all the Major AI developments in a nutshell

1. **Meta AI** introduces ***V-JEPA*** (Video Joint Embedding Predictive Architecture), a method for teaching machines to understand and model the physical world by watching videos. Meta AI releases a collection of V-JEPA vision models trained with a feature prediction objective using self-supervised learning. The models are able to understand and predict what is going on in a video, even with limited information \[[*Details*](https://ai.meta.com/blog/v-jepa-yann-lecun-ai-model-video-joint-embedding-predictive-architecture/) | [*GitHub*](https://github.com/facebookresearch/jepa)\].
2. **Open AI** introduces ***Sora***, a text-to-video model that can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions \[[*Details + sample videos*](https://openai.com/sora)[ ](https://openai.com/sora)| [*Report*](https://openai.com/research/video-generation-models-as-world-simulators)\].
3. **Google** announces their next-generation model, **Gemini 1.5,** that uses a new [Mixture-of-Experts](https://arxiv.org/abs/1701.06538) (MoE) architecture. The first Gemini 1.5 model being released for early testing is ***Gemini 1.5 Pro*** with a context window of up to 1 million tokens, which is the longest context window of any large-scale foundation model yet. 1.5 Pro can perform sophisticated understanding and reasoning tasks for different modalities, including video and it performs at a similar level to 1.0 Ultra \[[*Details*](https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/#gemini-15) *|*[*Tech Report*](https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf)\].
4. Reka introduced **Reka Flash,** a new 21B multimodal and multilingual model trained entirely from scratch that is competitive with Gemini Pro & GPT 3.5 on key language & vision benchmarks. Reka also present a compact variant Reka Edge , a smaller and more efficient model (7B) suitable for local and on-device deployment. Both models are in public beta and available in [**Reka Playground** ](https://chat.reka.ai/chat)\[[*Details*](https://reka.ai/reka-flash-an-efficient-and-capable-multimodal-language-model)\].
5. **Cohere** For AI released ***Aya***, a new open-source, massively multilingual LLM & dataset to help support under-represented languages. Aya outperforms existing open-source models and covers 101 different languages – more than double covered by previous models \[[*Details*](https://cohere.com/research/aya)\].
6. **BAAI** released ***Bunny***, a family of lightweight but powerful multimodal models. Bunny-3B model built upon SigLIP and Phi-2 outperforms the state-of-the-art MLLMs, not only in comparison with models of similar size but also against larger MLLMs (7B), and even achieves performance on par with LLaVA-13B \[[*Details*](https://github.com/BAAI-DCAI/Bunny)\].
7. **Amazon** introduced a text-to-speech (TTS) model called ***BASE TTS*** (Big Adaptive Streamable TTS with Emergent abilities). BASE TTS is the largest TTS model to-date, trained on 100K hours of public domain speech data and exhibits “emergent” qualities improving its ability to speak even complex sentences naturally \[[*Details*](https://techcrunch.com/2024/02/14/largest-text-to-speech-ai-model-yet-shows-emergent-abilities/) | [*Paper*](https://assets.amazon.science/6e/82/1d037a4243c9a6cf4169895482d5/base-tts-lessons-from-building-a-billion-parameter-text-to-speech-model-on-100k-hours-of-data.pdf)\].
8. **Stability AI** released ***Stable Cascade*** in research preview, a new text to image model that is exceptionally easy to train and finetune on consumer hardware due to its three-stage architecture. Stable Cascade can also generate image variations and image-to-image generations. In addition to providing checkpoints and inference scripts, Stability AI has also released scripts for finetuning, ControlNet, and LoRA training \[[*Details*](https://stability.ai/news/introducing-stable-cascade)\].
9. **Researchers** from UC berkeley released ***Large World Model (LWM)***, an open-source general-purpose large-context multimodal autoregressive model, trained from LLaMA-2, that can perform language, image, and video understanding and generation. LWM answers questions about 1 hour long YouTube video even if GPT-4V and Gemini Pro both fail and can retriev facts across 1M context with high accuracy \[[*Details*](https://largeworldmodel.github.io/)\].
10. **GitHub** opens applications for the next cohort of ***GitHub Accelerator program*** with a focus on funding the people and projects that are building ***AI-based solutions*** under an open source license \[[*Details*](https://github.blog/2024-02-13-powering-advancements-of-ai-in-the-open-apply-now-to-github-accelerator)\].
11. **NVIDIA** released ***Chat with RTX***, a locally running (Windows PCs with specific NVIDIA GPUs) AI assistant that integrates with your file system and lets you chat with your notes, documents, and videos using open source models \[[*Details*](https://www.nvidia.com/en-us/ai-on-rtx/chat-with-rtx-generative-ai)\].
12. **Open AI** is testing ***memory with ChatGPT***, enabling it to remember things you discuss across all chats. ChatGPT's memories evolve with your interactions and aren't linked to specific conversations. It is being rolled out to a small portion of ChatGPT free and Plus users this week \[[*Details*](https://openai.com/blog/memory-and-new-controls-for-chatgpt)\].
13. **BCG X** released of ***AgentKit***, a LangChain-based starter kit (NextJS, FastAPI) to build constrained agent applications \[[*Details*](https://blog.langchain.dev/bcg-x-releases-agentkit-a-full-stack-starter-kit-for-building-constrained-agents/) | [*GitHub*](https://github.com/BCG-X-Official/agentkit)\].
14. **Elevenalabs**' Speech to Speech feature, launched in November, for voice transformation with control over emotions and delivery, is now ***multilingual*** and available in 29 languages \[[*Link*](https://elevenlabs.io/voice-changer)\]
15. **Apple** introduced ***Keyframer***, an LLM-powered animation prototyping tool that can generate animations from static images (SVGs). Users can iterate on their design by adding prompts and editing LLM-generated CSS animation code or properties \[[*Paper*](https://arxiv.org/pdf/2402.06071.pdf)\].
16. **Eleven Labs** launched a ***payout program*** for voice actors to earn rewards every time their voice clone is used \[[*Details*](https://elevenlabs.io/voice-actors)\].
17. **Azure OpenAI Service** announced Assistants API, new models for finetuning, new text-to-speech model and new generation of embeddings models with lower pricing \[[*Details*](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-openai-service-announces-assistants-api-new-models-for/ba-p/4049940)\].
18. **Brilliant Labs**, the developer of AI glasses, launched ***Frame***, the world’s first glasses featuring an integrated AI assistant, ***Noa***. Powered by an integrated multimodal generative AI system capable of running GPT4, Stability AI, and the Whisper AI model simultaneously, Noa performs real-world visual processing, novel image generation, and real-time speech recognition and translation. \[[*Details*](https://venturebeat.com/games/brilliant-labss-frame-glasses-serve-as-multimodal-ai-assistant/)\].
19. **Nous Research** released ***Nous Hermes 2 Llama-2 70B*** model trained on the Nous Hermes 2 dataset, with over 1,000,000 entries of primarily synthetic data \[[*Details*](https://huggingface.co/NousResearch/Nous-Hermes-2-Llama-2-70B)\].
20. **Open AI** in partnership with Microsoft Threat Intelligence, have disrupted five state-affiliated actors that sought to use AI services in support of malicious cyber activities \[[*Details*](https://openai.com/blog/disrupting-malicious-uses-of-ai-by-state-affiliated-threat-actors)\]
21. **Perplexity** partners with **Vercel**, opening AI search to developer apps \[[*Details*](https://venturebeat.com/ai/perplexity-partners-with-vercel-opening-ai-search-to-developer-apps/)\].
22. **Researchers** show that ***LLM agents can autonomously hack websites***, performing tasks as complex as blind database schema extraction and SQL injections without human feedback. The agent does not need to know the vulnerability beforehand \[[*Paper*](https://arxiv.org/html/2402.06664v1)\].
23. **FCC** makes AI-generated voices in unsolicited robocalls illegal \[[*Link*](https://www.msn.com/en-us/money/companies/fcc-bans-ai-voices-in-unsolicited-robocalls/ar-BB1hZoZ0)\].
24. **Slack** adds AI-powered search and summarization to the platform for enterprise plans \[[*Details*](https://techcrunch.com/2024/02/14/slack-brings-ai-fueled-search-and-summarization-to-the-platform/)\].

**Source**: AI Brews - you can subscribe the [newsletter here](https://aibrews.com/). it's free to join, sent only once a week with bite-sized news, learning resources and selected tools. Thanks.

Open Reddit thread
View more discussions →

More models from Reka

Continue browsing adjacent models from the same provider.

← All AI Models