Krisp Voice Translation API
Summary
Krisp launches a real-time speech-to-speech translation API designed for high accuracy.
Similar Articles
Build a Realtime Speech Translation (28 minute read)
OpenAI releases gpt-realtime-translate, a low-latency speech-to-speech model optimized for live interpretation, accompanied by a developer cookbook for building multilingual browser, phone, and video applications.
@kwindla: https://x.com/kwindla/status/2062544580105359686
NVIDIA released Nemotron 3.5 ASR, an open-source multilingual speech-to-text model with the lowest latency tested, available in multilingual and English-only variants, ideal for voice agents and self-hosted deployments.
Parrot Speech-to-text API
Parrot Speech-to-text API offers fast and accurate transcription for production-grade voice agents.
Gemini 3.5 Live Translate
Gemini 3.5 Live Translate is a new audio model for real-time speech-to-speech translation.
@tom_doerr: Transcribes audio at 70x real-time speed https://github.com/m-bain/whisperX
WhisperX is a tool for fast automatic speech recognition with word-level timestamps and speaker diarization, offering 70x realtime transcription using Whisper large-v2.