

Nova-3 Medical — specialized speech-to-text model by Deepgram fine-tuned for clinical terminology, healthcare audio, and medical transcription workflows.
Deepgram Nova-3 Medical is a domain-specialized variant of Nova-3, fine-tuned on clinical and healthcare audio datasets. It delivers significantly higher accuracy for medical terminology — prescription names, diagnoses, procedures, anatomical terms — making it the optimal choice for clinical documentation, telehealth platforms, EHR integrations, and medical dictation tools.
Technical Specifications
Performance Benchmarks
- Significantly lower word error rate on medical terminology vs. general-purpose models.
- Accurately recognizes drug names, ICD-10 diagnoses, and procedural terms.
- Real-time streaming with sub-300ms latency for live clinical workflows.
- Handles noisy clinical environments: background equipment sounds, phone audio.
- Designed to reduce manual review burden in medical documentation.
Architecture Breakdown
Nova-3 Medical is built on the same end-to-end deep learning engine as Nova-3, with an additional fine-tuning stage on curated medical audio corpora. The model's vocabulary and language model are biased toward clinical terminology, enabling reliable transcription of complex medical speech without custom vocabulary configuration.
Pricing
- $0.00559 / min
Core Features & Capabilities
- Medical Terminology Recognition: Accurate transcription of drug names, diagnoses, symptoms, and procedures.
- Streaming Transcription: Real-time clinical dictation via WebSocket.
- Speaker Diarization: Labels clinician and patient voices separately.
- Smart Formatting: Handles medical number formats, dosages, and dates.
- Entity Detection: Extracts medical entities from audio for downstream processing.
- Custom Vocabulary (keyterm): Add institution-specific or rare drug names to further boost accuracy.
- Dictation Mode: Optimized for voice-driven clinical note-taking workflows.
- Filler Word Detection: Removes hesitation markers common in dictation speech.
Comparison with Other Models
VS Deepgram Nova-3: Nova-3 Medical is fine-tuned for clinical audio and priced lower per minute; Nova-3 is the general-purpose English model for non-medical use cases.
VS Deepgram Nova-3 General: Nova-3 Medical is domain-specialized for healthcare; Nova-3 General covers 30+ languages for general multilingual workflows.
VS AssemblyAI Slam-1: Nova-3 Medical focuses on clinical ASR accuracy; Slam-1 provides semantic understanding and prompt-based customization for enterprise transcription workflows.
Deepgram Nova-3 Medical is a domain-specialized variant of Nova-3, fine-tuned on clinical and healthcare audio datasets. It delivers significantly higher accuracy for medical terminology — prescription names, diagnoses, procedures, anatomical terms — making it the optimal choice for clinical documentation, telehealth platforms, EHR integrations, and medical dictation tools.
Technical Specifications
Performance Benchmarks
- Significantly lower word error rate on medical terminology vs. general-purpose models.
- Accurately recognizes drug names, ICD-10 diagnoses, and procedural terms.
- Real-time streaming with sub-300ms latency for live clinical workflows.
- Handles noisy clinical environments: background equipment sounds, phone audio.
- Designed to reduce manual review burden in medical documentation.
Architecture Breakdown
Nova-3 Medical is built on the same end-to-end deep learning engine as Nova-3, with an additional fine-tuning stage on curated medical audio corpora. The model's vocabulary and language model are biased toward clinical terminology, enabling reliable transcription of complex medical speech without custom vocabulary configuration.
Pricing
- $0.00559 / min
Core Features & Capabilities
- Medical Terminology Recognition: Accurate transcription of drug names, diagnoses, symptoms, and procedures.
- Streaming Transcription: Real-time clinical dictation via WebSocket.
- Speaker Diarization: Labels clinician and patient voices separately.
- Smart Formatting: Handles medical number formats, dosages, and dates.
- Entity Detection: Extracts medical entities from audio for downstream processing.
- Custom Vocabulary (keyterm): Add institution-specific or rare drug names to further boost accuracy.
- Dictation Mode: Optimized for voice-driven clinical note-taking workflows.
- Filler Word Detection: Removes hesitation markers common in dictation speech.
Comparison with Other Models
VS Deepgram Nova-3: Nova-3 Medical is fine-tuned for clinical audio and priced lower per minute; Nova-3 is the general-purpose English model for non-medical use cases.
VS Deepgram Nova-3 General: Nova-3 Medical is domain-specialized for healthcare; Nova-3 General covers 30+ languages for general multilingual workflows.
VS AssemblyAI Slam-1: Nova-3 Medical focuses on clinical ASR accuracy; Slam-1 provides semantic understanding and prompt-based customization for enterprise transcription workflows.