Tag
NVIDIA released an optimized ONNX version of the Kokoro TTS model on Hugging Face. The 82M parameter model is lightweight, fast, and ready for commercial use.
A deep dive video explaining how AI models run from JavaScript using Transformers.js, covering tensors, ONNX, quantization, WebGPU/WASM, and more.
Trained a prompt injection classifier using ml-intern and DeepSeek V4 Flash, achieving 99% F1 with DistilBERT, optimized to ONNX int8 (~65MB) and deployable in the browser via Transformers.js v3.
Supertonic is a new open-source TTS engine that runs on-device via ONNX, supporting 31 languages and outperforming ElevenLabs in speed, even on a Raspberry Pi without a GPU.