Transform your applications with our comprehensive suite of Speech AI solutions.
Convert spoken words into precise text with our state-of-the-art Speech-to-Text API. Built for developers who need reliable, scalable, and accurate speech recognition.
Whisper is an open-source multilingual speech recognition model with multiple size variants for different processing needs.
Nova 2 series offers specialized models for eIndustry-specific speech-to-text models optimized for different use cases like meetings, medical, and finance.
Real-time sensitive information and PII removal from audio transcriptions.
For research and experimental applications. Access Model Card.
An advanced model suite for every use case. Access Model Card.
Security & compliance: automated PII and sensitive information removal.
Transform text into lifelike speech with our Text-to-Speech API. Aura voice models deliver natural-sounding synthesis across a range of personas. Access Model Card.
Zeus, Hera, and Athena voices designed for corporate and formal content delivery.
Luna, Stella, and Asteria voices optimized for natural, friendly dialogue and casual interactions.
Orion, Perseus, Orpheus, Helios, Arcas, and Angus voices crafted for specific use cases like narration and entertainment.
Get 25% OFF text-to-speech and speech-to-text models in AI/ML API. Use code "SPEECHAPI" on the "Start-Up" plan here.
Begin by signing up on our AI/ML API platform. Create your account to gain access to 200+ AI models including TTS and STT.
In the Playground, navigate to the Billing section and activate 25% OFF on the "Start-up" plan with the promo code "SPEECHAPI".
In the Playground, navigate to the Key Management section and click on Create API Key. Then integrate the API into your application using our detailed documentation.