Mel AI just shared a demo of video-native AI characters that can talk, react, and respond to camera context in real time [N]

Reddit r/MachineLearning Products

Summary

Mel AI demonstrated AI characters that can talk, react, and respond to visual context in real time via video, moving beyond text-based character chat.

Character AI, founded by former Google/LaMDA developers Noam Shazeer and Daniel De Freitas, proved that text-based character chat can work as a real entertainment category. But the next chapter might not be better text chat. It might be real-time video interaction. Mel AI recently shared a demo of AI character video chat, and the interesting part is the interaction stack: voice, lip sync, facial reactions, and camera-aware responses instead of just a static avatar or chat box. The character can respond to visual context too. If the user is visibly on a plane or in a different environment, the character can notice and react to that context during the conversation. I don’t know how much of the video layer is truly generated in real time versus powered by a clever animation/rendering system, but it feels meaningfully different from the usual text-based character AI experience. Character AI proved the demand for entertainment AI. Now it feels like the race is about who can make AI characters feel alive in real time. Demo: [https://x.com/Building\_Mel/status/2064848256115626481](https://x.com/Building_Mel/status/2064848256115626481?s=20)
Original Article

Similar Articles