real-time-interaction

#real-time-interaction

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Hugging Face Daily Papers ↗ · 2026-04-30 Cached

MiniCPM-o 4.5 is a 9B parameter multimodal model featuring Omni-Flow, a framework enabling real-time full-duplex interaction where the model can simultaneously perceive and respond proactively. It achieves state-of-the-art open-source performance comparable to Gemini 2.5 Flash and runs on edge devices with less than 12GB RAM.

0 favorites 0 likes

#real-time-interaction

Hello GPT-4o

OpenAI Blog ↗ · 2024-05-13 Cached

OpenAI announces GPT-4o, a flagship multimodal model that processes audio, vision, text, and video in real-time with 232ms average audio response latency. The model matches GPT-4 Turbo on text/code while significantly improving multilingual, audio, and vision capabilities at 50% cheaper API costs.

0 favorites 0 likes

real-time-interaction

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Hello GPT-4o

Submit Feedback