Fully Realtime Interaction Models

Reddit r/LocalLLaMA Models

Summary

Discussion of an upcoming fully realtime interaction model that will be released via API, with plans to create distillation data from it.

I know this model isn't open weights, and when it does drop it'll be over api, but I'm just posting to say the very MICROsecond that this drops you already know me and probably a bunch of other people are going to create an insane amount of distill data from the api. because at least to me the very idea of a model that has complete ability to act on it's own accord is fascinating. I'm referencing this: [https://thinkingmachines.ai/blog/interaction-models/](https://thinkingmachines.ai/blog/interaction-models/)
Original Article

Similar Articles

Interaction Models

Hacker News Top

Thinking Machines AI announces a research preview of interaction models, a new architecture designed for native, real-time human-AI collaboration across audio, video, and text. By replacing turn-based interfaces with a multi-stream, micro-turn design, the model aims to keep humans actively in the loop while delivering state-of-the-art intelligence and responsiveness.

Introducing the Realtime API

OpenAI Blog

OpenAI introduces the Realtime API, enabling developers to build low-latency multimodal speech-to-speech conversational experiences with natural voice interactions powered by GPT-4o. The API supports six preset voices and simplifies development by eliminating the need to integrate multiple models.