我为Apple Silicon打造了最快的本地AI引擎。专为代理式使用优化。

Reddit r/LocalLLaMA 2026/05/08 02:27 工具

apple-silicon local-ai coding-agents open-source mlx performance

摘要

作者宣布发布'lightning-mlx'，这是一个针对Apple Silicon优化的本地AI引擎，可为编码代理和工具调用工作流实现高令牌速度。

https://preview.redd.it/p0rqofxvrtzg1.png?width=1460&format=png&auto=webp&s=8ce5b18b4ddaad9b71f71fd8eb623839fc9c6c8b 几周来我一直在为Apple Silicon打造最快的本地AI引擎……我终于做到了！它专为代理式使用优化，特别聚焦于编码代理、工具调用和短轮次工作流。仓库：[https://github.com/samuelfaj/lightning-mlx](https://github.com/samuelfaj/lightning-mlx) 来自我的Macbook Max M5（128GB）的一些结果：* Qwen3.6-27B **40.67 tok/s** * Qwen3.6-35B-A3B **220.86 tok/s** 欢迎就以下方面提供反馈：1. 针对本地编码代理的更好基准设计 2. MTPLX预设默认值是否合理 3. 应测试的其他Apple Silicon配置

查看原文

相似文章

@awnihannun: Three MLX videos dropped at WWDC: Running agents locally by @angeloskath https://youtube.com/watch?v=wykPErJ8M-8… Distr…

X AI KOLs Following

Three MLX videos from WWDC demonstrate running AI agents entirely locally on Apple Silicon using the MLX stack, including local inference, tool calling, and distributed inference across Macs, enabling no-cloud, offline AI workflows.

我为Apple Silicon打造了最快的本地AI引擎。专为代理式使用优化。

相似文章

@awnihannun: Three MLX videos dropped at WWDC: Running agents locally by @angeloskath https://youtube.com/watch?v=wykPErJ8M-8… Distr…

mlx-code — 用于Apple Silicon的本地LLM编码代理

@julien_c：Apple Silicon 是本地AI之王吗？

New MLX LM Server From Apple

我构建了mlx-Chronos——一个面向Apple Silicon上本地LLM引擎的社区基准测试排行榜（oMLX、Rapid-MLX、mlx-lm、Ollama）

提交意见反馈