Tag
Introduces Vividh-ASR, a complexity-tiered benchmark for Hindi and Malayalam ASR, identifies studio-bias in fine-tuning, and proposes R-MFT to improve spontaneous speech performance efficiently.
OpenAI released ChatGPT (GPT-3.5 Turbo) and Whisper APIs for developers, featuring 90% cost reduction since December and enabling integration into third-party applications. The announcement includes early adopter examples from Snap, Quizlet, Instacart, Shop, and Speak.
A highly optimized version of OpenAI's Whisper Large v3 using Transformers, Optimum, and Flash Attention 2, capable of transcribing 150 minutes of audio in under 2 minutes on Replicate.