Free
$0Free plan available.
Convai offers Conversational AI APIs for Speech Recognition, Language Understanding, generation, and Text to Speech, enabling the design of games, speech-enabled applications, conversation-based Characters, and Speech-based games. It provides a service for games, metaverse, xr, and more, to bring characters to life with real-time perception and action abilities.
Use Convai's no-code creation tools or integrate with game engines like Unreal Engine and Unity using plugins and open APIs to craft character minds, embody them within avatars, and deploy them in custom environments.
You can explore our available plans and their benefits at convai.com/pricing. We are developing a more seamless payment system to simplify upgrades, which will be available soon. In the meantime, please contact us at support@convai.com for any inquiries, and we will be happy to assist you.
To upgrade your plan, go to the Billing page and click on Manage Accounts. This will open a separate portal where you can select and upgrade to your desired subscription tier. Note: The upgrade does not require immediate payment; the new pricing will reflect in your next billing cycle.
To update your card details, visit the Billing page and click on Manage Accounts. This will open a portal where you can modify your existing payment information. Note: Some updates may trigger additional verification based on your bank’s or country’s security policies.
Invoice links expire after 30 days. We are working on a long-term solution, but you can currently access past invoices by navigating to My Profile > Billings > Manage Accounts to open your customer portal.
An interaction is counted every time a user sends input and the AI character responds, whether in text or voice. This response is based on elements like Narrative Design, Actions, Backstory, and Personality. Each generated response counts as one interaction. Example: A user says "Hello" and receives a response; that is one interaction. If the user follows up with "How are you?" and receives another response, that is a second interaction.
Character Concurrency refers to the maximum number of users who can interact with an NPC simultaneously. For example, if 5 people are using separate instances of the application, your account needs a concurrency limit of at least 5 to allow simultaneous interaction. This limit applies to the API key being deployed rather than individual NPCs, representing a limit on actively connected sessions.
This is the limit on the number of interactions you can perform using Flagship LLMs (e.g., GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro). For example, if your plan has a 3000-interaction monthly quota but a 1500 Flagship LLM cap, you must switch to a non-Flagship LLM after 1500 interactions.
This refers to the number of interactions you can perform using high-quality third-party voices, such as ElevenLabs. Once this monthly limit is reached, HQ voices will no longer be accessible. Note: If you have connected your own custom ElevenLabs API key, these limits do not apply, and your ElevenLabs plan limits will be in effect instead.
This refers to the number of unique users who can interact with your AI character each month. For example, if your plan has a cap of 200 Monthly Active End Users, the 201st unique user will not be able to access the character. If your application targets a larger audience, please contact our sales team to discuss a tailored plan.
To upgrade your plan, go to the Billing page and click on Manage Accounts. This will open a separate portal where you can select and upgrade to your desired subscription tier. Note: The upgrade does not require immediate payment. The new pricing will reflect in your next billing cycle.
To update your card details, visit the Billing page and click on Manage Accounts. This will open a portal where you can modify your existing payment information. Note: Some updates may trigger additional verification based on your bank’s or country’s security policies.
An interaction is counted every time a user sends input and the AI character responds, whether in text or voice. This response is based on elements like Narrative Design, Actions, Backstory, and Personality. Each generated response counts as one interaction. Example: A user says "Hello" and receives a response. That’s one interaction. If the user follows up with "How are you?" and receives another response, that’s a second interaction.
Character Concurrency refers to the maximum number of users who can interact with the NPC simultaneously. Example: If 5 people are using separate instances of the application, they can only all interact with the NPC at the same time if your account concurrency limit is at least 5. The limit applies to the API key being deployed, representing a limit on actively connected sessions.
This is the limit on the number of interactions you can perform using Flagship LLMs (e.g., GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro). Example: In the Indie Dev plan, you have a total monthly quota of 3000 Interactions. However, the Flagship LLM Interaction Cap is 1500. If you use GPT-4o, after 1500 interactions, your Flagship LLM quota will be exhausted, and you will need to switch to a non-Flagship LLM for the remaining interactions.
This refers to the number of interactions you can perform using high-quality third-party voices, such as ElevenLabs. Example: In the Indie Dev plan, you have a monthly limit of 500 HQ Third Party Voice Interactions. After reaching the limit, HQ voices will no longer be accessible. Note: If you have connected your own custom ElevenLabs API key, these limits will not apply.
This refers to the number of unique users who can interact with your AI character each month. Example: In the Scale plan, you have a cap of 200 Monthly Active End Users. If 200 unique users interact with your AI character in one month, the 201st user will not be able to access it. If your application targets a larger audience, you can contact our sales team to discuss a plan tailored to your needs.
Free plan available.
Use these comparison pages to understand the trade-offs between the models most relevant to Convai.
Compare Gemini 1.5 Pro Deprecated and Gemini 1.0 Pro Deprecated across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for general-purpose AI workloads versus long-context workloads.
Compare Gemini 1.5 Pro Deprecated and Gemini 1.0 Pro Vision Deprecated across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for general-purpose AI workloads versus general-purpose AI workloads.
Compare Gemini 1.5 Pro Deprecated and Gemini 1.5 Flash Deprecated across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for general-purpose AI workloads versus general-purpose AI workloads.
Compare Gemini 1.5 Pro Deprecated and Gemini 1.5 Flash Vision Deprecated across pricing, context window, capabilities, benchmarks, and API access to choose the better fit for general-purpose AI workloads versus general-purpose AI workloads.
Yapping is a Chrome extension that enables voice-based interaction with ChatGPT, removing the need for manual typing. It provides access to premium features at a lower cost by allowing users to integrate their own OpenAI API key.
The IsItAI.com Chrome extension leverages artificial intelligence to detect and classify images. By analyzing uploaded files, it determines whether an image is a real photograph or AI-generated, providing a confidence score and content classification for each result.
The AI Chat Assistant Browser Extension is a tool designed to improve the convenience and efficiency of your AI chat interactions. It offers support for content creators, researchers, customer service representatives, and other professionals who rely on AI chat tools, helping to streamline their daily workflows.
ViiTor is a Chrome extension that provides real-time audio transcription and translation for live broadcasts, online videos, and virtual meetings, allowing users to display bilingual subtitles.
This AI-powered Chrome extension acts as a real-time call assistant for Google Meet, Zoom, and MS Teams. It offers live transcription, meeting summarization, and action item extraction to help you stay productive. Additionally, it provides real-time feedback on your speaking style and clarity to improve communication effectiveness.
The VSX Extension is a Chrome extension designed to improve your learning process through integrated quiz generation and chatbot capabilities. This tool offers interactive quizzes and conversational support to assist with your educational needs.
The Zalo Automation Tool is a Chrome extension designed to streamline tasks on Zalo, the popular messaging and social networking platform. It automates sending messages, adding friends, building communities, and identifying potential customers, helping businesses and individuals efficiently manage their Zalo presence and expand their reach.
ChartIQ AI is a Chrome extension that enhances the trading experience using AI-driven insights. It utilizes artificial intelligence to deliver advanced analytics, predictive modeling, and personalized recommendations, helping traders refine their decision-making and improve overall trading results.
Simplified AI is an all-in-one marketing platform designed to streamline content creation, design, and social media management. It features a suite of tools including an AI writer, image generator, video editor, and social media scheduler. The platform helps businesses and individuals create high-quality content efficiently, saving time and resources. It supports various formats, such as blog posts, articles, social media content, and marketing copy, while offering team collaboration and brand management features.
This AI-powered Chrome extension serves as a management and intelligence platform for SaaS software. It enables businesses to effectively monitor, analyze, and optimize their SaaS application usage and expenditures.
The Groq Cloud API Chrome extension provides developers with access to the Groq LPU™ Inference Engine, facilitating high-speed, efficient execution of large language models (LLMs). This API enables low-latency inference, making it suitable for real-time applications like chatbots, search engines, and content generation tools. By utilizing the Groq LPU™ architecture, developers can achieve faster inference times than traditional CPU or GPU setups, enhancing user experience and lowering operational costs.
Pebble is an AI-powered browser extension designed to assist you in actively reading and watching online content.