LLM planner - pick a rig for your use-case/model/budget, or pick models for your rig. 60+ builds, 50+ models, 130+ cited t/s sources, 150+ reviewer YouTube videos, idle+active watts, multi-region prices, regular updates.
Summary
A comprehensive web tool and public dataset that helps users choose the right hardware for running LLMs, featuring 60+ builds, 50+ models, performance benchmarks, and reviewer videos, with two-way matching between models and hardware.
Similar Articles
Show HN: Find the best local LLM for your hardware, ranked by benchmarks
whichllm is an open-source Python tool that auto-detects your GPU/CPU/RAM and ranks the best local LLMs from HuggingFace that fit your system, using real benchmarks rather than size heuristics.
@oliviscusAI: Someone just built a tool that tells you exactly which LLMs will run on your hardware. it scans your ram, cpu, and gpu,…
A new tool has been released that scans a user's hardware specifications (RAM, CPU, GPU) to determine which Large Language Models can run locally, ranking them by performance metrics.
Inference Engines for LLMs & Local AI Hardware (2026 Edition)
This article provides a comprehensive guide to LLM inference engines for local AI hardware in 2026, explaining how to choose based on hardware strategy, workload, and serving model, and covering engines like llama.cpp, MLX, ExLlamaV2/3, vLLM, SGLang, TensorRT-LLM, and NVIDIA Dynamo.
About to build a 6× Arc B70 LLM rig, want to talk to someone experienced first
A user seeks experienced guidance on building a 6× Intel Arc B70 LLM inference rig, particularly for Llama models and vLLM deployment, offering compensation for consultation.
We have built the first of it's kind interactive blog for matching open-source LLMs to GPUs.
AgentSwarms launched an interactive, gamified blog that helps users match open-source LLMs to the right GPU by calculating VRAM requirements based on model size and quantization, turning infrastructure planning into an engaging experience.