Name: Whisper API
Brand: Deepgram

How to use Whisper API

Install any OpenAI-compatible SDK, point it at api.aimlapi.com/v1, and set the model to deepgram/whisper-large.

import requests, time

headers = {"Authorization": "Bearer " + AIMLAPI_KEY}
job = requests.post(
    "https://api.aimlapi.com/v1/stt/create",
    headers=headers,
    json={
      "model": "deepgram/whisper-large",
      "url": "https://example.com/audio.mp3"
    },
).json()
gid = job["generation_id"]

while True:
    res = requests.get(f"https://api.aimlapi.com/v1/stt/{gid}", headers=headers).json()
    if res.get("status") in ("completed", "error", "failed"):
        break
    time.sleep(3)
print(res)

const headers = {
  Authorization: `Bearer ${process.env.AIMLAPI_KEY}`,
  "Content-Type": "application/json",
};
const job = await (await fetch("https://api.aimlapi.com/v1/stt/create", {
  method: "POST",
  headers,
  body: JSON.stringify({
    "model": "deepgram/whisper-large",
    "url": "https://example.com/audio.mp3"
  }),
})).json();

let res;
do {
  await new Promise((r) => setTimeout(r, 3000));
  res = await (await fetch(`https://api.aimlapi.com/v1/stt/${job.generation_id}`, { headers })).json();
} while (!["completed", "error", "failed"].includes(res.status));
console.log(res);

# submit the job — the response contains "generation_id"
curl -X POST https://api.aimlapi.com/v1/stt/create \
  -H "Authorization: Bearer $AIMLAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"deepgram/whisper-large","url":"https://example.com/audio.mp3"}'

# then poll for the result until it is ready
curl "https://api.aimlapi.com/v1/stt/{generation_id}" -H "Authorization: Bearer $AIMLAPI_KEY"

OpenAI-compatible — swap the base URL and it works with your existing SDK.

Whisper API Pricing

Type	Price
Input
Output	$0.000104 / sec tokens

Whisper vs other models

Model	Input	Output	Best for
Whisper This page		$0.000104 / sec	Transcription
Speech 2.8 HD	$130 / 1M	$130 / 1M tokens	Speech synthesis
MiniMax Speech 2.6 HD	$130 / 1M	$0.13 / 1M tokens	Speech synthesis
Octave 2	$78 / 1M	$0.078 / 1M tokens	Speech synthesis
Qwen3 TTS Flash	$13 / 1M	$0.013 / 1M tokens	Speech synthesis

Whisper API

How to use Whisper API

Whisper API Pricing

Whisper vs other models

Related chat models

Related blog posts

Start building with Whisper

Whisper API

How to use Whisper API

Whisper API Pricing

Whisper vs other models

Related chat models

Related blog posts

The most accurate speech models available — pick the right one

MiniMax Audio: Voices from China

MAI-Voice-2, MAI-Transcribe 1.5 & MAI-Image 2.5 Complete Developer Guide

Start building with Whisper