Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open
Summary
A PR for mlx-lm adds support for Cohere's Command A+ (218B MoE) model on Apple Silicon, with architecture details for the implementation.
Similar Articles
mlx-code — local LLM coding agent for Apple Silicon
mlx-code is a Python package that provides a local-first LLM coding agent for Apple Silicon, bundling an MLX inference server, multi-protocol API support, git worktree isolation, and composable multi-agent primitives.
I fitted the new δ-mem research for apple silicon using mlx and openclaw integration! My findings
The author implements the δ-mem research paper on Apple Silicon using MLX and OpenClaw, showing memory and attention improvements in local AI agent tests, though with mixed results compared to CUDA benchmarks.
@cohere: Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as po…
Cohere has released Command A+, its most powerful open-source LLM, optimized to run on minimal hardware.
I built mlx-Chronos — a community benchmark leaderboard for local LLM engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama)
A CS student built mlx-Chronos, an open-source CLI tool that standardizes benchmarking of MLX inference engines on Apple Silicon by measuring TTFT, throughput, memory usage, and thermal state, with a community leaderboard for sharing results.
@neural_avb: I am working on porting SAM models and harness into Apple silicon. Already seeing 1.25x inference speed increase on mlx…
Porting SAM 2.1 models to Apple silicon with MLX, achieving 1.25x inference speed increase on the small model, with quantized versions planned.