@awnihannun: The video from @angeloskath on local agentic AI with MLX is excellent. I also hear it's one of the most viewed videos i…
Summary
A tweet highlights an excellent WWDC video by Angelos Kath on building local agentic AI with MLX, noting rapid progress in open-weight models and hardware capabilities.
View Cached Full Text
Cached at: 06/12/26, 04:51 AM
The video from @angeloskath on local agentic AI with MLX is excellent. I also hear it’s one of the most viewed videos in WWDC history
Goes through the basics of agentic AI and how to set it all up to run locally in a very approachable and simple way.
The demos are excellent and it’s kind of wild that they just work now. None of this was possible or practical < 1 year ago before M5 and the recent quality bump in open weights models. And it’s not done improving.
Similar Articles
@awnihannun: Three MLX videos dropped at WWDC: Running agents locally by @angeloskath https://youtube.com/watch?v=wykPErJ8M-8… Distr…
Three MLX videos from WWDC demonstrate running AI agents entirely locally on Apple Silicon using the MLX stack, including local inference, tool calling, and distributed inference across Macs, enabling no-cloud, offline AI workflows.
@Prince_Canuma: My @aiDotEngineer talk is live: "On-device Intelligence using MLX" Huge thanks to @swyx and the team for having me — ha…
The author announces their live talk titled 'On-device Intelligence using MLX' at the aiDotEngineer event, expressing gratitude to the organizers and community contributors.
@swyx: full writeup and links here
A Latent Space podcast episode discusses the thesis that video models derive intelligence from LLMs, and that the next frontier is video agents. Guest Ethan He, who built Grok Imagine at xAI, shares insights on building frontier image and video systems.
I've created the fastest local AI engine for Apple Silicon. Optimised for agentic use.
The author announces the release of 'lightning-mlx', a local AI engine optimized for Apple Silicon that achieves high token speeds for coding agents and tool-calling workflows.
@nicodotdev: Everything you always wanted to know about Transformers.js, in one video. I made a deep dive into how AI models run fro…
A deep dive video explaining how AI models run from JavaScript using Transformers.js, covering tensors, ONNX, quantization, WebGPU/WASM, and more.