Nova-3 now handles Arabic, Hebrew, Farsi, and Urdu natively. If you're building for MENA or South Asia markets, this removes a major integration barrier.

One API, four additional RTL languages, zero RTL preprocessing code required - faster regional expansion with lower infrastructure overhead.
Signal analysis
Deepgram's Nova-3 model now includes native support for four right-to-left languages: Arabic, Hebrew, Farsi, and Urdu. This isn't a wrapper or post-processing fix - it's baked into the model itself. The update addresses a real gap: RTL languages require different tokenization, word ordering, and acoustic handling than left-to-right models.
This matters because RTL languages account for roughly 300+ million native speakers globally. Until now, builders working with these languages either had to chain multiple services, accept degraded accuracy, or build custom workarounds. Nova-3 consolidates that into a single API call.
If you're shipping products in Middle East, North Africa, or South Asia, language support directly impacts your TAM and go-to-market velocity. Before this, you'd evaluate speech-to-text vendors and hit a wall: either accept English-only accuracy or piece together multiple APIs. That adds latency, cost, and operational complexity.
For builders specifically: this reduces your decision tree. You can now choose Deepgram for unified multilingual coverage without sacrificing RTL language quality. That's a 1-API solution instead of 2-3. More importantly, it means you can expand regional products without rearchitecting your speech pipeline.
The technical implication is cleaner. RTL languages have fundamentally different phonetic and grammatical structures - Hebrew fricatives don't map to Arabic emphatics, Farsi has distinct vowel patterns. Native model support handles these differences at inference time, not as an afterthought.
Adoption is straightforward if you're already on Nova-3. The model selection logic doesn't change - you pass the same API call, and Deepgram's language detection handles RTL identification. If you're currently using a different Deepgram model or competitor, migration is a parameter swap in your inference code.
Key consideration: test your specific use case. RTL support in Nova-3 doesn't guarantee equal accuracy across all four languages or all domains. Arabic dialect variation (Egyptian vs. Gulf vs. Levantine) still matters. Farsi and Urdu have smaller training datasets than English or Mandarin. Run benchmarks on your actual audio before rolling out to production - especially if you're processing user-generated or noisy audio.
This update signals Deepgram's commitment to non-English markets, which is increasingly table-stakes for speech providers. The timing matters: as AI adoption spreads globally, English-first tools become a liability, not a feature. Deepgram is explicitly removing that constraint.
The focus on RTL specifically is pragmatic. These languages weren't afterthoughts - they require distinct model architecture. By shipping RTL support natively, Deepgram is positioning against competitors who treat non-English as bolt-on functionality. That's a different technical and go-to-market strategy.
For builders, this lowers the friction for international expansion. You can now build and scale speech features across multiple regions without fragmenting your infrastructure. That enables product strategies previously blocked by tooling limitations.
Best use cases
Open the scenarios below to see where this shift creates the clearest practical advantage.
One concise email with the releases, workflow changes, and AI dev moves worth paying attention to.
More updates in the same lane.
Discover how to enable Basic and Enhanced Branded Calling through Twilio Console to enhance your brand's visibility.
Cohere has unveiled 'Cohere Transcribe', an open-source transcription model that enhances AI speech recognition accuracy.
Mistral AI has released Voxtral TTS, an open-source text-to-speech model, providing developers with free access to its capabilities for various applications.