onnx-runtime

#onnx-runtime

AMD contributes ONNX Runtime backend to FFmpeg DNN filter

Reddit r/artificial ↗ · 2026-06-25 Cached

An AMD engineer contributed an ONNX Runtime backend to FFmpeg's DNN filter, enabling AI model inference on GPU and NPU platforms for tasks like upscaling and object detection, notably making Ryzen AI NPU useful for FFmpeg.

0 favorites 0 likes

#onnx-runtime

Benchmark: ONNX Runtime vs HF Transformers vs GGUF for Parakeet TDT 0.6B on CPU-only hardware [D]

Reddit r/MachineLearning ↗ · 2026-06-05

A benchmark comparing ONNX Runtime, HF Transformers, and GGUF for the Parakeet TDT 0.6B ASR model on CPU-only hardware shows ONNX Runtime achieves 37% faster inference than HF Transformers bfloat16, while GGUF prioritizes memory efficiency.

0 favorites 0 likes

#onnx-runtime

@FeitengLi: A 99M parameter TTS runs on CPU, faster than a 2B model on A100. Supertone's newly open-sourced supertonic-3 with ONNX Runtime, fully local, can run in browser, on phone, and even on Raspberry Pi.

X AI KOLs Timeline ↗ · 2026-05-15 Cached

Supertone released Supertonic 3, an open-source TTS model with 99M parameters that runs faster on CPU than a 2B model on A100, supporting 31 languages and ONNX Runtime for fully local inference.

0 favorites 0 likes

#onnx-runtime

How we catch silent NPU fallback on Snapdragon in CI [D]

Reddit r/MachineLearning ↗ · 2026-05-15

A blog post detailing how to detect silent NPU fallback on Snapdragon in CI, including methods like running on real hardware, gating on coefficient of variation, and parsing ORT profiling JSON to identify fallen-back ops.

0 favorites 0 likes

#onnx-runtime

supertone-inc/supertonic

GitHub Trending (daily) ↗ · 2026-05-13 Cached

Supertonic is an open-source, on-device text-to-speech system designed for local inference with minimal overhead, now releasing version 3 with support for 31 languages and improved accuracy.

0 favorites 0 likes

onnx-runtime

AMD contributes ONNX Runtime backend to FFmpeg DNN filter

Benchmark: ONNX Runtime vs HF Transformers vs GGUF for Parakeet TDT 0.6B on CPU-only hardware [D]

@FeitengLi: A 99M parameter TTS runs on CPU, faster than a 2B model on A100. Supertone's newly open-sourced supertonic-3 with ONNX Runtime, fully local, can run in browser, on phone, and even on Raspberry Pi.

How we catch silent NPU fallback on Snapdragon in CI [D]

supertone-inc/supertonic

Submit Feedback