Tag
WhisperX is a tool for fast automatic speech recognition with word-level timestamps and speaker diarization, offering 70x realtime transcription using Whisper large-v2.
Audien.to is a tool that turns recordings into source-linked work, likely for transcription and productivity.
OpenBrief is an open-source desktop app that lets users download videos, transcribe audio, generate grounded summaries, and chat with media content, all running locally on their machine.
An open-source CLI tool for Obsidian that provides sandboxed AI agents for audio transcription, deep research, and mind-mapping, designed to accelerate note-taking without modifying the user's vault.
A practical guide for audio transcription on macOS using Gemma 4 E2B model with MLX and mlx-vlm, including a uv run recipe and demonstration of the workflow.