@rohanpaul_ai: Just a few days back, Thinking Machines Lab (TML), showcased a way of making AI interaction continuous instead of turn-…

X AI KOLs Following 05/17/26, 06:28 PM Models

full-duplex time-aligned open-source omnimodal real-time edge-deployment continuous-interaction

Summary

Thinking Machines Lab and OpenBMB released MiniCPM-o 4.5, a 9B full-duplex omnimodal model with the Omni-Flow framework that enables continuous, time-aligned real-time video and voice interaction, surpassing previous models and available as open source.

Just a few days back, Thinking Machines Lab (TML), showcased a way of making AI interaction continuous instead of turn-based, a Full-Duplex Time-aligned micro-turn. It's a preview of the future of a near-realtime AI voice and video conversation with new 'interaction models' And MiniCPM-o 4.5 already shipped the same core idea through OpenBMB’s Omni-Flow framework: time-aligned perception and response instead of old turn-based chat. A 9B Full-Duplex omnimodal model that can see, hear, and speak at the same time. Omni-Flow also treats interaction as a continuous stream on a shared temporal axis, aligning visual input, audio input, and output speech/text into time chunks so the model can perceive while responding. That breaks the old walkie-talkie UX of AI: user talks, model waits, model replies. And this is not just a demo concept. It is a 9B open model with code, weights, a report, and edge deployment under 12GB RAM. It also surpasses Qwen3-Omni-30B-A3B in omni-modal capabilities and speech generation quality. This feels like the interaction layer AI was missing. OpenBMB already shipped this as a real Full-Duplex omni-modal architecture, with video tokens, audio tokens, LLM hidden states, speech tokens, and waveform generation all synced to one shared timeline.

Original Article

View Cached Full Text

Cached at: 05/18/26, 10:30 AM

Just a few days back, Thinking Machines Lab (TML), showcased a way of making AI interaction continuous instead of turn-based, a Full-Duplex Time-aligned micro-turn.

It’s a preview of the future of a near-realtime AI voice and video conversation with new ‘interaction models’

And MiniCPM-o 4.5 already shipped the same core idea through OpenBMB’s Omni-Flow framework: time-aligned perception and response instead of old turn-based chat.

A 9B Full-Duplex omnimodal model that can see, hear, and speak at the same time.

Omni-Flow also treats interaction as a continuous stream on a shared temporal axis, aligning visual input, audio input, and output speech/text into time chunks so the model can perceive while responding.

That breaks the old walkie-talkie UX of AI: user talks, model waits, model replies.

And this is not just a demo concept. It is a 9B open model with code, weights, a report, and edge deployment under 12GB RAM.

It also surpasses Qwen3-Omni-30B-A3B in omni-modal capabilities and speech generation quality.

This feels like the interaction layer AI was missing.

OpenBMB already shipped this as a real Full-Duplex omni-modal architecture, with video tokens, audio tokens, LLM hidden states, speech tokens, and waveform generation all synced to one shared timeline.

Thinking Machines (@thinkymachines): People talk, listen, watch, think, and collaborate at the same time, in real time. We’ve designed an AI that works with people the same way.

We share our approach, early results, and a quick look at our model in action.

@rohanpaul_ai: Just a few days back, Thinking Machines Lab (TML), showcased a way of making AI interaction continuous instead of turn-…

Similar Articles

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

@rohanpaul_ai: Thinking Machines is replacing turn-taking AI with always-present AI. They just announced TML-Interaction-Small, a 276B…

@rohanpaul_ai: AI video is moving into its real-time reaction era, with MaineCoon now leading in low-latency AI video. @catnips_ai jus…

@Saboo_Shubham_: This is not an Agent, just a single AI model. Thinking Machine just launched an interaction model that can simultaneous…

openbmb/MiniCPM-RobotManip

Submit Feedback

Similar Articles

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

@rohanpaul_ai: Thinking Machines is replacing turn-taking AI with always-present AI. They just announced TML-Interaction-Small, a 276B…

@rohanpaul_ai: AI video is moving into its real-time reaction era, with MaineCoon now leading in low-latency AI video. @catnips_ai jus…

@Saboo_Shubham_: This is not an Agent, just a single AI model. Thinking Machine just launched an interaction model that can simultaneous…