Choosing a Mac Mini for local LLMs — what would YOU actually buy?

Reddit r/LocalLLaMA News

Summary

A community discussion post seeking advice on which Mac Mini configuration (M4, M2 Pro, or M1 Max) to purchase for running local LLMs with Ollama and coding assistants, with the decision complicated by rumored M5 releases and current supply shortages.

Got three options on my radar and genuinely can't decide. Not looking for spec sheets — want to hear from people actually running this stuff daily: M4 (32GB) — newest but apparently the slowest of the three for inference? M2 Pro (32GB) — heard it actually beats the base M4 on tok/s M1 Max (64GB) — oldest chip but highest memory bandwidth Running Ollama, coding assistants (Qwen/Kimi), maybe some RAG pipelines. Budget is $2–3k so I'm not totally screwed on options. And yeah obv openclaw to stop spending on closed models. The big thing holding me back: there are strong rumours that Apple is dropping an M5 Mac Mini and M5 Mac Studio around WWDC 2026. Apparently stock on current models is already drying up (4–5 month wait times in some configs). So do I pull the trigger now or sit tight a few more months? What's you are using ? And if you were buying today, would you wait for M5 or just grab the M4 Pro 48GB and get to work?
Original Article

Similar Articles

Which computer should I buy: Mac or custom-built 5090? [D]

Reddit r/MachineLearning

A user seeks advice on whether to purchase a Mac (M5) or custom-built RTX 5090 for machine learning projects involving fine-tuning, custom pipelines, and image/video-heavy workflows, with curiosity about Apple's MLX framework as an alternative to NVIDIA CUDA.

2x 512gb ram M3 Ultra mac studios

Reddit r/LocalLLaMA

A user shares their $25k hardware setup of two 512GB RAM M3 Ultra Mac Studios for running large language models locally, having tested DeepSeek V3 Q8 and GLM 5.1 Q4 via the exo distributed inference backend, while awaiting Kimi 2.6 MLX optimization.