native-audio

Tag

Cards List
#native-audio

Gemma 4 12B native encoder free voice input utilization suggest?

Reddit r/LocalLLaMA · 2026-06-14

Discusses leveraging Gemma 4 12B's encoder-free architecture for native voice input, seeking out-of-the-box solutions for low-latency streaming audio ingestion.

0 favorites 0 likes
#native-audio

Improved Gemini audio models for powerful voice experiences

Google DeepMind Blog · 2025-12-12 Cached

Google has updated Gemini 2.5 Flash Native Audio to improve live voice agent capabilities, including sharper function calling, better instruction following, and smoother conversation context retrieval. The update also introduces live speech translation in the Google Translate app beta, preserving intonation across 70+ languages.

0 favorites 0 likes
← Back to home

Submit Feedback