@AlphaSignalAI: A 66M parameter model just beat ElevenLabs on a Raspberry Pi. Text-to-speech has lived in the cloud for years. Every sp…
Summary
Supertonic 3 is a 99M parameter open-source TTS model that runs entirely on-device, beating ElevenLabs on a Raspberry Pi with 167x faster than real-time performance on a laptop CPU.
View Cached Full Text
Cached at: 05/22/26, 05:56 PM
A 66M parameter model just beat ElevenLabs on a Raspberry Pi.
Text-to-speech has lived in the cloud for years.
Every spoken character cost an API call and a fraction of a cent.
Supertonic 3 is an open-source TTS model that runs entirely on the device.
No network, no key, no per-character billing.
The model is 99M parameters and ships as an ONNX file.
It hits 167x faster than real-time on a laptop CPU.
That works out to about 1,263 characters of speech per second.
Larger open systems sit closer to 55 to 287.
What the on-device design unlocks:
Runs offline on a Raspberry Pi Works inside a browser tab Handles phone numbers and currency Reads dates without any preprocessing Inline tags for laugh and breath
Language coverage jumped from 5 to 31 in this release.
The public interface stayed identical to the previous version.
Similar Articles
@akshay_pachaar: this TTS model generates speech 167x faster than you can hear it. Supertonic is an on-device TTS engine that runs via O…
Supertonic is a new open-source TTS engine that runs on-device via ONNX, supporting 31 languages and outperforming ElevenLabs in speed, even on a Raspberry Pi without a GPU.
@FeitengLi: A 99M parameter TTS runs on CPU, faster than a 2B model on A100. Supertone's newly open-sourced supertonic-3 with ONNX Runtime, fully local, can run in browser, on phone, and even on Raspberry Pi.
Supertone released Supertonic 3, an open-source TTS model with 99M parameters that runs faster on CPU than a 2B model on A100, supporting 31 languages and ONNX Runtime for fully local inference.
@GoJun315: Open-source TTS that runs locally and beats ElevenLabs. Supertonic, a speech synthesis model that runs entirely on-device, no internet required, zero API costs. - Only 99M parameters, 167x faster than real-time on M4 Pro, runs on Raspberry Pi - Supports 31 languages, covering…
Supertonic is a lightning-fast, on-device TTS model with 99M parameters, supporting 31 languages. It runs locally with no API costs, outperforms cloud TTS on accuracy for numbers, phone numbers, and technical terms, and can be installed via Python, Node.js, Rust, Go, and more.
@JafarNajafov: Supertonic just killed ElevenLabs. A text-to-speech model that runs entirely on your device. No cloud. No API key. No p…
The article highlights Supertonic, an open-source text-to-speech model that runs entirely on-device, claiming superior speed and formatting accuracy compared to cloud-based services like ElevenLabs and OpenAI.
@rohanpaul_ai: Can a smaller model purpose-built for one domain beat a frontier general model that's 100× its size? A recent paper sho…
PolyAI's Raven 3.5, a smaller specialist model, outperforms GPT-5 and Claude Sonnet 4.6 on all customer service benchmarks with under 300ms latency. The company also launches ADK and PolyPhone to accelerate enterprise voice AI deployment.