@mervenoyann: hello I'm Merve I want every single developer to build computer vision applications with agents using open models, but …
Summary
Merve announces her talk at aiDotEngineer WF about a new project for building computer vision applications with agents using open models, sharing a sneak peek.
View Cached Full Text
Cached at: 07/01/26, 10:04 AM
hello I’m Merve 👋🏻
I want every single developer to build computer vision applications with agents using open models, but possibilities are endless they get lost
here to change it, come to my talk at @aiDotEngineer WF tomorrow to see my new project, here’s a sneak peek https://t.co/ERX0YUGyGB
Similar Articles
@aiDotEngineer: Your Agent Can Now Train Models The argument from @mervenoyann: open source models have caught up. GLM 5.1 is leading t…
The talk by @mervenoyann demonstrates that open source models like GLM 5.1 have caught up to closed models, and shows how Hugging Face's ecosystem enables agents to train models, run inference, and build workflows.
@mervenoyann: I will be speaking at local AI summit tomorrow 2.25 about Compression at Edge, it will be so much fun!
Mervenoyann announces they will be speaking at a local AI summit on Compression at Edge. In a reply, Ahmad Osman claims to have teamed up with NVIDIA to make Local AI the default, calling it massive news.
@mercor_ai: Agents are only as good as the environments behind them. At Mercor, we've built deep expertise in the realistic, econom…
Mercor announces joining the OpenEnv committee alongside Meta, PyTorch, NVIDIA, PrimeIntellect, and Hugging Face to guide the open foundation for agentic environments.
@mervenoyann: everyone's building simple agents meanwhile IBM is building robust enterprise agents in production, and it's open-sourc…
IBM released an open-source blog on Hugging Face detailing how to build robust enterprise agents with structured reasoning and tool use, going beyond basic LLMs and agents.
@miramurati: Today we're sharing our work on interaction models. A new class of model trained from scratch to handle real-time inter…
Mira Murati's team showcased a preview of the new interaction model. Trained from scratch, it natively supports full-duplex real-time audio and video conversations, instant interruptions, multi-language translation, and dynamic multi-tasking. The demonstration verified its core capabilities in low-latency streaming interaction, multimodal perception, and concurrent task execution.