@onusoz: I created an LLM leaderboard based on Hugging Face download and like counts, grouped, filtered and time-averaged. Top 5…

X AI KOLs Following Tools

Summary

An LLM leaderboard based on Hugging Face download and like counts, grouped, filtered, and time-averaged, highlighting the most popular models like Qwen and Gemma.

I created an LLM leaderboard based on Hugging Face download and like counts, grouped, filtered and time-averaged. Top 5 downloads is shared by @Alibaba_Qwen and @googlegemma Top 5 likes, on the other hand also includes @deepseek_ai V4 Pro Even @OpenAI makes it to #8 top downloads with gpt-oss-20b qwen3-6-35b-a3b is the second most CIRCULATED LLM of this year, with an average of 21 million downloads per month, since the day it was released 2 months ago Despite first place belonging to 8mo old qwen3-vl-2b-instruct, the highlight belongs to the mid-sized MoE model, which has hit a size/performance sweet spot so hard that it absolutely SHATTERED Hugging Face leaderboards in the 2 months since it has launched qwen3-6-35b-a3b is followed closely by its dense sibling 27b --- and then the mid-sized gemma 4 models 26b-a4b and 31b Note that a model's distribution is inversely proportional to its size, but not strictly! Usefulness plays a factor as well, since gemma 4 26b-a4b is being downloaded more than the smaller gemma 4 e4b I created this leaderboard because Hugging Face's all time highest downloads and likes did not give me enough information about what is really popular, neither today, nor all-time. I wanted something in between How do I calculate this ranking? - Get models that with n_downloads >= 100k - Exclude models older than 1 year - Deduplicate and group quantizations and variants of the same model based on slug prefix heuristics - For each group, sum up total downloads of all time - Sort by descending total_downloads / age = average_downloads_per_day (can also sort w.r. to likes per month) - Repeat every day to get the most up to date ranking More info and source on the leaderboard page, hosted on a Hugging Face space: https://osolmaz-leaderboard.hf.space This is a work in progress, please reply below if you see a model that should be there is missing, or any other mistakes
Original Article
View Cached Full Text

Cached at: 06/24/26, 07:59 AM

I created an LLM leaderboard based on Hugging Face download and like counts, grouped, filtered and time-averaged. Top 5 downloads is shared by @Alibaba_Qwen and @googlegemma

Top 5 likes, on the other hand also includes @deepseek_ai V4 Pro

Even @OpenAI makes it to #8 top downloads with gpt-oss-20b

qwen3-6-35b-a3b is the second most CIRCULATED LLM of this year, with an average of 21 million downloads per month, since the day it was released 2 months ago

Despite first place belonging to 8mo old qwen3-vl-2b-instruct, the highlight belongs to the mid-sized MoE model, which has hit a size/performance sweet spot so hard that it absolutely SHATTERED Hugging Face leaderboards in the 2 months since it has launched

qwen3-6-35b-a3b is followed closely by its dense sibling 27b — and then the mid-sized gemma 4 models 26b-a4b and 31b

Note that a model’s distribution is inversely proportional to its size, but not strictly! Usefulness plays a factor as well, since gemma 4 26b-a4b is being downloaded more than the smaller gemma 4 e4b

I created this leaderboard because Hugging Face’s all time highest downloads and likes did not give me enough information about what is really popular, neither today, nor all-time. I wanted something in between

How do I calculate this ranking?

  • Get models that with n_downloads >= 100k
  • Exclude models older than 1 year
  • Deduplicate and group quantizations and variants of the same model based on slug prefix heuristics
  • For each group, sum up total downloads of all time
  • Sort by descending total_downloads / age = average_downloads_per_day (can also sort w.r. to likes per month)
  • Repeat every day to get the most up to date ranking

More info and source on the leaderboard page, hosted on a Hugging Face space: https://osolmaz-leaderboard.hf.space

This is a work in progress, please reply below if you see a model that should be there is missing, or any other mistakes


Open LLM Distribution Leaderboard

Source: https://osolmaz-leaderboard.hf.space/ 1qwen3-vl-2b-instruct26.1M/mo62/mo2.13B2025-10-198mo agoQwen/Qwen3-VL-2B-Instruct2qwen3-6-35b-a3b21.4M/mo2.2K/mo36.0B2026-04-152mo agoQwen/Qwen3.6-35B-A3B3qwen3-6-27b20.4M/mo2.2K/mo27.8B2026-04-212mo agoQwen/Qwen3.6-27B-FP84gemma-4-26b-a4b-it17.7M/mo1.1K/mo26.5B2026-03-113mo agogoogle/gemma-4-26B-A4B-it5gemma-4-31b-it14.1M/mo1.5K/mo32.7B2026-03-113mo agogoogle/gemma-4-31B-it6qwen3-5-9b11.3M/mo665.1/mo9.65B2026-02-273mo agoQwen/Qwen3.5-9B7qwen3-0-6b10.9M/mo102.9/mo0.75B2025-04-271yr 1mo agoQwen/Qwen3-0.6B8qwen2-5-1-5b-instruct9.7M/mo41.8/mo1.54B2024-09-171yr 9mo agoQwen/Qwen2.5-1.5B-Instruct9llama-3-1-8b-instruct8.5M/mo286/mo8.03B2024-07-181yr 11mo agometa-llama/Llama-3.1-8B-Instruct10gemma-4-e4b-it8.4M/mo579.6/mo8.00B2026-03-023mo agogoogle/gemma-4-E4B-it11qwen2-5-7b-instruct8.1M/mo78.2/mo7.62B2024-09-161yr 9mo agoQwen/Qwen2.5-7B-Instruct12gpt-oss-20b8M/mo519/mo21.5B2025-08-0410mo agoopenai/gpt-oss-20b13gemma-4-12b-it7.5M/mo4.5K/mo12.0B2026-05-231mo agogoogle/gemma-4-12B-it14qwen3-5-4b7.5M/mo268.6/mo4.66B2026-02-273mo agoQwen/Qwen3.5-4B15qwen3-5-35b-a3b7.4M/mo656.6/mo36.0B2026-02-243mo agoQwen/Qwen3.5-35B-A3B16qwen3-8b6.5M/mo102.8/mo8.19B2025-04-271yr 1mo agoQwen/Qwen3-8B17qwen3-5-27b6.5M/mo617.7/mo27.8B2026-02-243mo agoQwen/Qwen3.5-27B18deepseek-v4-flash6M/mo1K/mo284B2026-04-261mo agoantirez/deepseek-v4-gguf19llama-3-2-1b-instruct5.4M/mo82.1/mo1.24B2024-09-181yr 9mo agometa-llama/Llama-3.2-1B-Instruct20qwen3-4b-instruct-25075.3M/mo90.8/mo4.02B2025-08-0510mo agoQwen/Qwen3-4B-Instruct-250721qwen2-5-vl-3b-instruct5.2M/mo43.5/mo3.75B2025-01-261yr 4mo agoQwen/Qwen2.5-VL-3B-Instruct22qwen3-4b5M/mo72.8/mo4.02B2025-04-271yr 1mo agoQwen/Qwen3-4B23qwen2-5-3b-instruct5M/mo31.5/mo3.09B2024-09-171yr 9mo agoQwen/Qwen2.5-3B-Instruct24qwen3-vl-8b-instruct4.9M/mo123.8/mo8.77B2025-10-118mo agoQwen/Qwen3-VL-8B-Instruct25qwen2-5-vl-7b-instruct4.9M/mo111.7/mo8.29B2025-01-261yr 4mo agoQwen/Qwen2.5-VL-7B-Instruct26qwen3-coder-next4.5M/mo520.1/mo79.7B2026-01-304mo agoQwen/Qwen3-Coder-Next27gpt-oss-120b4.4M/mo495.2/mo120B2025-08-0410mo agoopenai/gpt-oss-120b28kimi-k2-54.2M/mo516.6/mo1059B2026-01-015mo agomoonshotai/Kimi-K2.529qwen3-32b3.7M/mo68/mo32.8B2025-04-271yr 1mo agoQwen/Qwen3-32B30gemma-4-e2b-it3.7M/mo308.2/mo5.12B2026-03-023mo agogoogle/gemma-4-E2B-it31qwen3-1-7b3.6M/mo36.1/mo2.03B2025-04-271yr 1mo agoQwen/Qwen3-1.7B32deepseek-v4-pro3.5M/mo2.5K/mo862B2026-04-222mo agodeepseek-ai/DeepSeek-V4-Pro33gemma-3-1b-it3.2M/mo71.5/mo1.00B2025-03-101yr 3mo agogoogle/gemma-3-1b-it34qwen3-vl-4b-instruct3.1M/mo57/mo4.44B2025-10-118mo agoQwen/Qwen3-VL-4B-Instruct35glm-53.1M/mo102.9/mo754B2026-02-114mo agozai-org/GLM-5-FP836deepseek-v3-23.1M/mo217/mo685B2025-12-016mo agodeepseek-ai/DeepSeek-V3.237qwen3-5-0-8b3M/mo214.4/mo0.87B2026-02-283mo agoQwen/Qwen3.5-0.8B38kimi-k2-63M/mo652/mo1059B2026-04-142mo agomoonshotai/Kimi-K2.639qwen3-coder-30b-a3b-instruct2.9M/mo203.7/mo30.5B2025-07-3110mo agoQwen/Qwen3-Coder-30B-A3B-Instruct40llama-3-2-3b-instruct2.7M/mo120.2/mo3.21B2024-09-181yr 9mo agometa-llama/Llama-3.2-3B-Instruct41glm-4-7-flash2.7M/mo473/mo31.2B2026-01-195mo agozai-org/GLM-4.7-Flash42nvidia-nemotron-3-super-120b-a12b2.7M/mo292.1/mo67.2B2026-03-103mo agonvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP443qwen3-vl-30b-a3b-instruct2.7M/mo85.1/mo31.1B2025-09-308mo agoQwen/Qwen3-VL-30B-A3B-Instruct44qwen3-5-397b-a17b2.6M/mo428.9/mo403B2026-02-164mo agoQwen/Qwen3.5-397B-A17B45llama-3-2-1b2.6M/mo116.1/mo1.24B2024-09-181yr 9mo agometa-llama/Llama-3.2-1B46qwen3-5-122b-a10b2.5M/mo214/mo125B2026-02-253mo agoQwen/Qwen3.5-122B-A10B-FP847qwen2-5-14b-instruct2.5M/mo20/mo14.8B2024-09-161yr 9mo agoQwen/Qwen2.5-14B-Instruct48diffusiongemma-26b-a4b-it2.4M/mo1.5K/mo25.8B2026-06-0914d agogoogle/diffusiongemma-26B-A4B-it49qwen2-5-0-5b-instruct2.3M/mo30.4/mo0.49B2024-09-161yr 9mo agoQwen/Qwen2.5-0.5B-Instruct50nvidia-nemotron-3-nano-30b-a3b2.3M/mo193.3/mo31.6B2025-12-046mo agonvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF1651minimax-m2-72.2M/mo532.9/mo229B2026-04-092mo agoMiniMaxAI/MiniMax-M2.752qwen3-14b2.2M/mo39/mo14.8B2025-04-271yr 1mo agoQwen/Qwen3-14B53nemotron-3-nano-omni-30b-a3b-reasoning2.1M/mo258.8/mo18.3B2026-04-241mo agonvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP454meta-llama-3-8b-instruct2.1M/mo180.5/mo8.03B2024-04-172yr 2mo agometa-llama/Meta-Llama-3-8B-Instruct55qwen3-next-80b-a3b-instruct2.1M/mo125.5/mo81.3B2025-09-099mo agoQwen/Qwen3-Next-80B-A3B-Instruct56qwen3-5-2b2.1M/mo117.5/mo2.27B2026-02-283mo agoQwen/Qwen3.5-2B57qwen3-vl-32b-instruct1.9M/mo32.5/mo33.4B2025-10-198mo agoQwen/Qwen3-VL-32B-Instruct58qwen2-vl-2b-instruct1.9M/mo23.4/mo2.21B2024-08-281yr 9mo agoQwen/Qwen2-VL-2B-Instruct59deepseek-r11.9M/mo785.2/mo685B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R160qwen2-vl-7b-instruct1.9M/mo60.9/mo8.29B2024-08-281yr 9mo agoQwen/Qwen2-VL-7B-Instruct61qwen3-30b-a3b-instruct-25071.9M/mo118.9/mo30.5B2025-07-2810mo agoQwen/Qwen3-30B-A3B-Instruct-250762qwen2-5-32b-instruct1.9M/mo23.6/mo32.8B2024-09-171yr 9mo agoQwen/Qwen2.5-32B-Instruct63qwen3-tts-12hz-1-7b-base1.8M/mo84.2/mo1.93B2026-01-215mo agoQwen/Qwen3-TTS-12Hz-1.7B-Base64gemma-3-4b-it1.7M/mo87.4/mo4.30B2025-02-201yr 4mo agogoogle/gemma-3-4b-it65meta-llama-3-8b1.7M/mo251.2/mo8.03B2024-04-172yr 2mo agometa-llama/Meta-Llama-3-8B66llama-3-3-70b-instruct1.6M/mo157.1/mo70.6B2024-11-261yr 6mo agometa-llama/Llama-3.3-70B-Instruct67gemma-3-12b-it1.6M/mo62.1/mo12.2B2025-03-011yr 3mo agogoogle/gemma-3-12b-it68deepseek-r1-05281.6M/mo192.4/mo685B2025-05-281yr agodeepseek-ai/DeepSeek-R1-052869qwen3-30b-a3b1.6M/mo75.2/mo30.5B2025-04-271yr 1mo agoQwen/Qwen3-30B-A3B70deepseek-r1-distill-qwen-32b1.6M/mo92.6/mo32.8B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Qwen-32B71llama-3-1-70b-instruct1.5M/mo41.6/mo70.6B2024-07-161yr 11mo agometa-llama/Llama-3.1-70B-Instruct72gemma-3-27b-it1.5M/mo131.4/mo27.4B2025-03-011yr 3mo agogoogle/gemma-3-27b-it73qwen2-7b-instruct1.4M/mo28.3/mo7.62B2024-06-042yr agoQwen/Qwen2-7B-Instruct74qwen3-4b-base1.4M/mo6.9/mo4.02B2025-04-281yr 1mo agoQwen/Qwen3-4B-Base75qwen2-5-coder-7b-instruct1.3M/mo50.5/mo7.62B2024-09-171yr 9mo agoQwen/Qwen2.5-Coder-7B-Instruct76gemma-3-270m1.3M/mo98/mo0.27B2025-08-0510mo agogoogle/gemma-3-270m77gemma-2-2b-it1.2M/mo66.4/mo2.61B2024-08-011yr 10mo agoMaziyarPanahi/gemma-2-2b-it-GGUF78deepseek-r1-distill-qwen-1-5b1.1M/mo89.4/mo1.78B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B79llama-3-1-8b1.1M/mo98.1/mo8.03B2024-07-141yr 11mo agometa-llama/Llama-3.1-8B80qwen2-5-0-5b1.1M/mo20/mo0.49B2024-09-151yr 9mo agoQwen/Qwen2.5-0.5B81deepseek-r1-0528-qwen3-8b1.1M/mo87.2/mo1.28B2025-05-291yr agolmstudio-community/DeepSeek-R1-0528-Qwen3-8B-MLX-4bit82gemma-2-2b1.1M/mo28.6/mo2.61B2024-07-161yr 11mo agogoogle/gemma-2-2b83deepseek-r1-distill-llama-8b1.1M/mo50.7/mo8.03B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Llama-8B84qwen2-1-5b-instruct1M/mo6.6/mo1.54B2024-06-032yr agoQwen/Qwen2-1.5B-Instruct85deepseek-v3981.4K/mo228.3/mo685B2024-12-251yr 5mo agodeepseek-ai/DeepSeek-V386llama-3-1-nemotron-nano-vl-8b-v1970K/mo14.3/mo8.72B2025-06-031yr agonvidia/Llama-3.1-Nemotron-Nano-VL-8B-V187llama-2-7b904K/mo65.7/mo6.74B2023-07-132yr 11mo agometa-llama/Llama-2-7b-hf88qwen2-5-coder-32b-instruct901K/mo118.7/mo32.8B2024-11-061yr 7mo agoQwen/Qwen2.5-Coder-32B-Instruct89llama-3-1-405b899K/mo42/mo406B2024-07-161yr 11mo agometa-llama/Llama-3.1-405B90qwen2-5-7b890.2K/mo13.8/mo7.62B2024-09-151yr 9mo agoQwen/Qwen2.5-7B91llama-2-7b-chat855.1K/mo135.3/mo6.74B2023-07-132yr 11mo agometa-llama/Llama-2-7b-chat-hf92qwen2-5-1-5b853.2K/mo9.1/mo1.54B2024-09-151yr 9mo agoQwen/Qwen2.5-1.5B93deepseek-r1-distill-qwen-7b839.3K/mo49.7/mo7.62B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Qwen-7B94qwen3-vl-235b-a22b-instruct821.5K/mo48.6/mo236B2025-09-229mo agoQwen/Qwen3-VL-235B-A22B-Instruct95qwen2-5-coder-14b-instruct818.6K/mo10/mo14.8B2024-11-061yr 7mo agoQwen/Qwen2.5-Coder-14B-Instruct96qwen3-8b-base800.5K/mo7.8/mo8.19B2025-04-281yr 1mo agoQwen/Qwen3-8B-Base97qwen2-5-72b-instruct792.2K/mo48.8/mo73.0B2024-09-171yr 9mo agoQwen/Qwen2.5-72B-Instruct-AWQ98glm-4-6v-flash790K/mo94.9/mounknown2025-12-086mo agolmstudio-community/GLM-4.6V-Flash-MLX-4bit99qwen3-4b-thinking-2507784.9K/mo62.8/mo4.02B2025-08-0510mo agoQwen/Qwen3-4B-Thinking-2507100lfm2-24b-a2b770.2K/mo1.5/mo23.8B2026-02-233mo agolmstudio-community/LFM2-24B-A2B-MLX-4bit101minimax-m2-5723K/mo345.8/mo229B2026-02-124mo agoMiniMaxAI/MiniMax-M2.5102qwen3-omni-30b-a3b-instruct710.3K/mo103.7/mo35.3B2025-09-209mo agoQwen/Qwen3-Omni-30B-A3B-Instruct103mistral-nemo-instruct-2407666K/mo2.8/mo12.2B2024-07-181yr 11mo agoMaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF104llama-3-2-3b630.3K/mo39.5/mo3.21B2024-09-181yr 9mo agometa-llama/Llama-3.2-3B105qwen2-5-vl-32b-instruct610.8K/mo32.5/mo33.5B2025-03-211yr 3mo agoQwen/Qwen2.5-VL-32B-Instruct106lfm2-5-1-2b-instruct605.9K/mo0.5/mo0.33B2026-01-075mo agolmstudio-community/LFM2.5-1.2B-Instruct-MLX-8bit107gemma-4-e4b594.2K/mo87.9/mo8.00B2026-03-023mo agogoogle/gemma-4-E4B108chatglm2-6b591.9K/mo57.2/mounknown2023-06-242yr 11mo agozai-org/chatglm2-6b109nvidia-nemotron-3-nano-4b581.9K/mo26.5/mo3.97B2026-03-073mo agonvidia/NVIDIA-Nemotron-3-Nano-4B-BF16110llama-4-scout-17b-16e-instruct573K/mo90.2/mo109B2025-04-021yr 2mo agometa-llama/Llama-4-Scout-17B-16E-Instruct111deepseek-v3-0324568.3K/mo210.2/mo685B2025-03-241yr 3mo agodeepseek-ai/DeepSeek-V3-0324112qwen3guard-gen-0-6b565.9K/mo8.1/mo0.75B2025-09-238mo agoQwen/Qwen3Guard-Gen-0.6B113qwen3-6-40b-claude-4-6-opus-deckard-heretic-uncensored-thinking-neo-code-di-imatrix-max557.5K/mo248.1/mo39.1B2026-05-011mo agoDavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF114mistral-7b-instruct-v0-3551.2K/mo5.7/mo7.25B2024-05-222yr 1mo agoMaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF115solar-pro-preview-instruct536.2K/mo1.4/mo22.1B2024-09-131yr 9mo agoMaziyarPanahi/solar-pro-preview-instruct-GGUF116phi-3-5-mini-instruct535.4K/mo1.4/mo3.82B2024-08-201yr 10mo agoMaziyarPanahi/Phi-3.5-mini-instruct-GGUF117glm-4-5-air535.2K/mo57.3/mo110B2025-07-2011mo agozai-org/GLM-4.5-Air118qwen3-6-27b-heretic-uncensored-finetune-neo-code-di-imatrix-max534.5K/mo186.2/mo26.9B2026-04-291mo agoDavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF119step-3-7-flash533.1K/mo484/mo104B2026-05-2727d agostepfun-ai/Step-3.7-Flash-NVFP4120yi-coder-1-5b-chat529.7K/mo0.8/mo1.48B2024-09-041yr 9mo agoMaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF121qwen2-0-5b517.6K/mo6.8/mo0.49B2024-05-312yr agoQwen/Qwen2-0.5B122minimax-m3509.2K/mo1.3K/mo440B2026-06-0221d agoMiniMaxAI/MiniMax-M3-MXFP8123mathstral-7b-v0-1493K/mo0.3/mo7.25B2024-07-161yr 11mo agoMaziyarPanahi/mathstral-7B-v0.1-GGUF124qwen2-5-vl-72b-instruct492.8K/mo41.7/mo73.4B2025-01-271yr 4mo agoQwen/Qwen2.5-VL-72B-Instruct125yi-coder-9b-chat487.8K/mo0.4/mo8.83B2024-09-041yr 9mo agoMaziyarPanahi/Yi-Coder-9B-Chat-GGUF126nvidia-nemotron-3-ultra-550b-a55b483.4K/mo453/mo335B2026-06-0320d agonvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4127llama-3-1-405b-instruct482.9K/mo26.6/mo410B2024-07-241yr 10mo agoMaziyarPanahi/Meta-Llama-3.1-405B-Instruct-GGUF128kimi-k2-instruct-0905477.3K/mo77.1/mo1026B2025-09-039mo agomoonshotai/Kimi-K2-Instruct-0905129firefunction-v2473.5K/mo0.7/mo70.6B2024-06-192yr agoMaziyarPanahi/firefunction-v2-GGUF130qwen3-235b-a22b-instruct-2507469.5K/mo83.9/mo235B2025-07-2111mo agoQwen/Qwen3-235B-A22B-Instruct-2507-FP8131llama-3-8b-instruct-32k-v0-1469.4K/mo2.3/mo8.03B2024-04-242yr 1mo agoMaziyarPanahi/Llama-3-8B-Instruct-32k-v0.1-GGUF132qwen2-5-omni-3b466.7K/mo24.4/mo5.54B2025-04-301yr 1mo agoQwen/Qwen2.5-Omni-3B133gemma-4-e2b466.7K/mo96.6/mo5.12B2026-03-023mo agogoogle/gemma-4-E2B134qwen3-5-9b-deepseek-v4-flash464.8K/mo129.8/mo8.95B2026-04-291mo agoJackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF135deepseek-r1-distill-qwen-14b457.2K/mo38.4/mo14.8B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Qwen-14B136yi-1-5-6b-chat449.6K/mo0.4/mo6.06B2024-05-122yr 1mo agoMaziyarPanahi/Yi-1.5-6B-Chat-GGUF137wizardlm-2-7b449.3K/mo3.2/mo7.24B2024-04-152yr 2mo agoMaziyarPanahi/WizardLM-2-7B-GGUF138kimi-k2-7-code447.9K/mo975/mo1059B2026-06-1112d agomoonshotai/Kimi-K2.7-Code139llama-3-8b-instruct-64k445.2K/mo0.5/mo8.03B2024-04-252yr 1mo agoMaziyarPanahi/Llama-3-8B-Instruct-64k-GGUF140llama-2-13b-chat440.6K/mo31.6/mo13.0B2023-07-132yr 11mo agometa-llama/Llama-2-13b-chat-hf141nemotron-labs-diffusion-8b-base431.9K/mo1.1/mo8.49B2026-01-145mo agonvidia/Nemotron-Labs-Diffusion-8B-Base142molmo2-8b425.4K/mo30.1/mo8.66B2025-12-146mo agoallenai/Molmo2-8B143gemma-4-31b407.9K/mo125/mo32.7B2026-03-123mo agogoogle/gemma-4-31B144gemma-2-9b-it407.7K/mo34.7/mo9.24B2024-06-241yr 11mo agogoogle/gemma-2-9b-it145qwen3-30b-a3b-thinking-2507389.1K/mo36.4/mo30.5B2025-07-2910mo agoQwen/Qwen3-30B-A3B-Thinking-2507146lfm2-24b-a2b-mlx-6bit382.5K/mo0.8/mo23.8B2026-02-233mo agolmstudio-community/LFM2-24B-A2B-MLX-6bit147lfm2-24b-a2b-mlx-5bit381.7K/mo0.3/mo23.8B2026-02-233mo agolmstudio-community/LFM2-24B-A2B-MLX-5bit148mistral-large-instruct-2411380.8K/mo0.1/mo123B2024-11-181yr 7mo agoMaziyarPanahi/Mistral-Large-Instruct-2411-GGUF149nvidia-nemotron-nano-9b-v2380.1K/mo48.9/mo8.89B2025-08-1210mo agonvidia/NVIDIA-Nemotron-Nano-9B-v2150deepseek-coder-v2-lite-instruct378.2K/mo25.8/mo15.7B2024-06-142yr agodeepseek-ai/DeepSeek-Coder-V2-Lite-Instruct151qwen3-235b-a22b377.6K/mo79/mo235B2025-04-271yr 1mo agoQwen/Qwen3-235B-A22B152qwen2-5-omni-7b369.5K/mo128/mo10.7B2025-03-221yr 3mo agoQwen/Qwen2.5-Omni-7B153qwopus3-6-35b-a3b-v1368.3K/mo129.7/mo34.7B2026-05-061mo agoJackrong/Qwopus3.6-35B-A3B-v1-GGUF154jan-v3-5-4b367.2K/mo6.9/mo4.41B2026-03-233mo agojanhq/Jan-v3.5-4B-gguf155phi-3-mini-4k-instruct365.7K/mo<0.1/mo3.82B2024-04-252yr 1mo agokaitchup/Phi-3-mini-4k-instruct-gptq-4bit156tinygemma3353.8K/mo0.7/mo0.04B2025-05-061yr 1mo agoggml-org/tinygemma3-GGUF157mistral-small-24b-instruct-2501351.5K/mo2.5/mo23.6B2025-01-301yr 4mo agoMaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF158olmo-2-0425-1b348.6K/mo5.6/mo1.49B2025-04-171yr 2mo agoallenai/OLMo-2-0425-1B159gemma-2b342.9K/mo42.2/mo2.51B2024-02-082yr 4mo agogoogle/gemma-2b160qwen3-0-6b-base334.1K/mo12.6/mo0.60B2025-04-281yr 1mo agoQwen/Qwen3-0.6B-Base161qwen3-6-35b-a3b-uncensored-wasserstein326.8K/mo48.1/mo34.7B2026-04-162mo agoLuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Wasserstein-GGUF162glm-4-1v-9b-thinking320.8K/mo65.5/mo10.3B2025-06-2811mo agozai-org/GLM-4.1V-9B-Thinking163deepseek-r1-distill-llama-70b315.5K/mo46.6/mo70.6B2025-01-201yr 5mo agodeepseek-ai/DeepSeek-R1-Distill-Llama-70B164glm-5-1313.5K/mo768/mo754B2026-04-032mo agozai-org/GLM-5.1165qwen2-0-5b-instruct310.9K/mo8.2/mo0.49B2024-06-032yr agoQwen/Qwen2-0.5B-Instruct166gemma-4-12b310.2K/mo572.7/mo12.0B2026-05-231mo agogoogle/gemma-4-12B167kimi-k2-instruct307.5K/mo207/mo1026B2025-07-1111mo agomoonshotai/Kimi-K2-Instruct168meta-llama-3-1-8b-instruct305K/mo3.9/mounknown2024-07-191yr 11mo agohugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4169qwen3-1-7b-base300.6K/mo5.3/mo1.72B2025-04-281yr 1mo agoQwen/Qwen3-1.7B-Base170lfm2-5-1-2b-instruct-mlx-6bit300.6K/mo0.7/mo0.26B2026-01-075mo agolmstudio-community/LFM2.5-1.2B-Instruct-MLX-6bit171step3-vl-10b297.8K/mo77.2/mo10.2B2026-01-135mo agostepfun-ai/Step3-VL-10B172qwen2-5-3b293.3K/mo9/mo3.09B2024-09-151yr 9mo agoQwen/Qwen2.5-3B173phi-4287.6K/mo0.5/mo14.7B2025-01-081yr 5mo agoMaziyarPanahi/phi-4-GGUF174qwen2-5-math-1-5b283.9K/mo5.1/mo1.54B2024-09-161yr 9mo agoQwen/Qwen2.5-Math-1.5B175medgemma-4b-it272.5K/mo75.1/mo4.30B2025-05-191yr 1mo agogoogle/medgemma-4b-it176minimax-m2271.2K/mo186.8/mo229B2025-10-228mo agoMiniMaxAI/MiniMax-M2177qwopus3-5-9b-coder-mtp267.4K/mo156.5/mounknown2026-05-181mo agoJackrong/Qwopus3.5-9B-Coder-MTP-GGUF178qwen1-5-0-5b-chat264.6K/mo3.4/mo0.62B2024-01-312yr 4mo agoQwen/Qwen1.5-0.5B-Chat179vntl-llama3-8b-v2260.9K/mo0.8/mo8.03B2025-01-021yr 5mo agolmg-anon/vntl-llama3-8b-v2-gguf180llama-guard-3-8b259.3K/mo13.2/mo8.03B2024-07-221yr 11mo agometa-llama/Llama-Guard-3-8B181qwen3-vl-8b-thinking257K/mo25.1/mo8.77B2025-10-118mo agoQwen/Qwen3-VL-8B-Thinking182step-3-5-flash247.1K/mo174.9/mo199B2026-02-014mo agostepfun-ai/Step-3.5-Flash183qwen2-5-coder-1-5b-instruct246.9K/mo6.1/mo1.54B2024-09-181yr 9mo agoQwen/Qwen2.5-Coder-1.5B-Instruct184qwen2-audio-7b-instruct246.1K/mo23.7/mo8.40B2024-07-311yr 10mo agoQwen/Qwen2-Audio-7B-Instruct185nvidia-nemotron-nano-9b-v2-japanese244.7K/mo30.1/mo8.89B2026-02-044mo agonvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese186qwen2-5-coder-1-5b243K/mo4.4/mo1.54B2024-09-181yr 9mo agoQwen/Qwen2.5-Coder-1.5B187olmo-3-7b-instruct241.3K/mo18.9/mo0.00B2025-11-197mo agoallenai/Olmo-3-7B-Instruct188apertus-8b-instruct-2509240.2K/mo45.5/mo8.05B2025-08-1310mo agoswiss-ai/Apertus-8B-Instruct-2509189paligemma-3b-mix-224235.7K/mo3.9/mo2.92B2024-05-122yr 1mo agogoogle/paligemma-3b-mix-224190mixtral-8x22b-v0-1233.9K/mo2.9/mo141B2024-04-102yr 2mo agoMaziyarPanahi/Mixtral-8x22B-v0.1-GGUF191qwopus3-6-27b-coder-mtp231.9K/mo285/mo0.46B2026-06-1112d agoJackrong/Qwopus3.6-27B-Coder-MTP-GGUF192gemma-3n-e2b-it230.9K/mo24.6/mo5.44B2025-06-121yr agogoogle/gemma-3n-E2B-it193carnice-v2-27b223.3K/mo53.9/mo26.9B2026-04-251mo agokai-os/Carnice-V2-27b-GGUF194kimi-k2-thinking222.4K/mo223.8/mo1058B2025-11-047mo agomoonshotai/Kimi-K2-Thinking195deepseek-vl2-tiny220.8K/mo13.5/mo3.37B2024-12-131yr 6mo agodeepseek-ai/deepseek-vl2-tiny196deepseek-v2-lite-chat219.1K/mo5.6/mo15.7B2024-05-152yr 1mo agodeepseek-ai/DeepSeek-V2-Lite-Chat197deepseek-v3-1216.4K/mo81.8/mo685B2025-08-2110mo agodeepseek-ai/DeepSeek-V3.1198lfm2-5-8b-a1b215.6K/mo225/mo8.47B2026-05-2430d agoLiquidAI/LFM2.5-8B-A1B-GGUF199phi-4-mini-instruct214.9K/mo0.8/mo3.84B2025-03-011yr 3mo agoMaziyarPanahi/Phi-4-mini-instruct-GGUF200mimo-v2-5213.6K/mo172.8/mo311B2026-04-271mo agoXiaomiMiMo/MiMo-V2.5201gemma-4-26b-a4b210.7K/mo94.6/mo26.5B2026-03-123mo agogoogle/gemma-4-26B-A4B202medgemma-1-5-4b-it206.8K/mo125.9/mo4.30B2026-01-075mo agogoogle/medgemma-1.5-4b-it203gemma-3-270m-it204.6K/mo70.6/mo0.27B2025-07-3010mo agogoogle/gemma-3-270m-it204kimi-vl-a3b-instruct199.7K/mo18.5/mo16.4B2025-04-091yr 2mo agomoonshotai/Kimi-VL-A3B-Instruct205meta-llama-3-1-70b-instruct198.5K/mo4.7/mounknown2024-07-191yr 11mo agohugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4206cosmos-reason2-2b194K/mo16.5/mo2.44B2025-12-126mo agonvidia/Cosmos-Reason2-2B207qwq-32b191.1K/mo0.3/mo32.8B2025-03-061yr 3mo agoMaziyarPanahi/QwQ-32B-GGUF208devstral-small-2-24b-instruct-2512190.1K/mo0.6/mounknown2025-12-096mo agomlx-community/Devstral-Small-2-24B-Instruct-2512-4bit209mistral-small-instruct-2409188.3K/mo0.2/mo22.2B2024-09-171yr 9mo agoMaziyarPanahi/Mistral-Small-Instruct-2409-GGUF210chatglm3-6b188.1K/mo36.4/mo6.24B2023-10-252yr 7mo agozai-org/chatglm3-6b211gemma-4-31b-jang-4m-crack185.6K/mo86/mo30.7B2026-04-062mo agodouyamv/Gemma-4-31B-JANG_4M-CRACK-GGUF212tinyllama-1-1b-chat-v0-3181.2K/mo0.5/mo1.10B2023-10-032yr 8mo agoTheBloke/TinyLlama-1.1B-Chat-v0.3-GPTQ213qwen3-5-0-8b-base180.7K/mo21.7/mo0.87B2026-02-283mo agoQwen/Qwen3.5-0.8B-Base214qwen3-5-9b-base178.2K/mo21.8/mo9.65B2026-02-263mo agoQwen/Qwen3.5-9B-Base215llama-3-3-nemotron-super-49b-v1-5174.9K/mo21.4/mo49.9B2025-07-2510mo agonvidia/Llama-3_3-Nemotron-Super-49B-v1_5216qwen2-5-coder-7b173K/mo7.1/mo7.62B2024-09-161yr 9mo agoQwen/Qwen2.5-Coder-7B217mistral-7b-instruct-v0-2171.7K/mo18.4/mo7.24B2023-12-112yr 6mo agoTheBloke/Mistral-7B-Instruct-v0.2-GGUF218cosmos-reason2-8b166.6K/mo30.4/mo8.77B2025-12-126mo agonvidia/Cosmos-Reason2-8B219smolvlm-500m-instruct161.4K/mo3.6/mo0.41B2025-04-211yr 2mo agoggml-org/SmolVLM-500M-Instruct-GGUF220qwen3-5-4b-base161.1K/mo18.3/mo4.66B2026-02-273mo agoQwen/Qwen3.5-4B-Base221mistral-small-3-1-24b-instruct-2503157.9K/mo0.1/mo23.6B2025-03-181yr 3mo agoMaziyarPanahi/mistral-small-3.1-24b-instruct-2503-hf-GGUF222deepseek-v2-lite157.1K/mo7.1/mo15.7B2024-05-152yr 1mo agodeepseek-ai/DeepSeek-V2-Lite223qwen2-5-coder-3b-instruct150.9K/mo6.2/mo3.09B2024-11-061yr 7mo agoQwen/Qwen2.5-Coder-3B-Instruct224rnj-1-instruct147.7K/mo0.6/mo8.84B2025-12-076mo agoDoradus-AI/RnJ-1-Instruct-FP8225hunyuan-mt-7b147.2K/mo0.6/mo7.50B2025-09-059mo agoMungert/Hunyuan-MT-7B-GGUF226qwen3-next-80b-a3b-thinking145.3K/mo2.5/mo83.8B2025-09-129mo agocyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit227meta-llama-3-70b140.2K/mo33.5/mo70.6B2024-04-172yr 2mo agometa-llama/Meta-Llama-3-70B228melody1437-26b-a4b-v2-0139.1K/mo8/mo25.2B2026-06-0815d agoReadyArt/Melody1437-26B-A4B-v2.0-GGUF229serenity-26b-a4b135.7K/mo11/mo25.2B2026-06-1013d agoReadyArt/Serenity-26B-A4B-GGUF230hyperclovax-seed-text-instruct-1-5b131.1K/mo0.3/mo1.81B2025-04-241yr 1mo agorippertnt/HyperCLOVAX-SEED-Text-Instruct-1.5B-Q4_K_M-GGUF231ministral-3-3b-reasoning-2512128.9K/mo0.7/mo3.43B2025-12-026mo agoMaziyarPanahi/Ministral-3-3B-Reasoning-2512-GGUF232gemma-4-26b-a4b-it-heretic-fp8-static128.2K/mo1.6/mo25.8B2026-04-072mo agocloud19/gemma-4-26B-A4B-it-heretic-FP8-Static233dark-scarlett-v0-3-26b-a4b128K/mo5/mo25.2B2026-06-1211d agoReadyArt/Dark-Scarlett-v0.3-26B-A4B-GGUF234gemma-4-31b-it-uncensored-heretic126.7K/mo48.8/mo30.7B2026-04-032mo agollmfan46/gemma-4-31B-it-uncensored-heretic-GGUF235qwen3-5-27b-claude-4-6-opus-reasoning-distilled-v2123.8K/mo5/mo27.8B2026-03-302mo agoQuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ236qwen3-coder-480b-a35b-instruct121.6K/mo14.1/mo480B2025-07-2211mo agoQwen/Qwen3-Coder-480B-A35B-Instruct-FP8237qwen3-5-2b-base121.3K/mo20.6/mo2.27B2026-02-283mo agoQwen/Qwen3.5-2B-Base238nvidia-nemotron-nano-12b-v2-vl121.1K/mo10.4/mo13.2B2025-10-218mo agonvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16239qwen3-6-27b-uncensored-heretic-v2-native-mtp-preserved120.9K/mo86.7/mounknown2026-05-061mo agollmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GGUF240deepseek-v3-2-exp118.9K/mo112.5/mo685B2025-09-298mo agodeepseek-ai/DeepSeek-V3.2-Exp241tinyllama-1-1b-chat-v1-0118.1K/mo8.3/mo1.10B2023-12-312yr 5mo agoTheBloke/TinyLlama-1.1B-Chat-v1.0-GGUF242gemma-4-12b-it-uncensored113.5K/mo44/mo11.9B2026-06-0419d agozaakirio/gemma-4-12b-it-uncensored-GGUF243qwen1-5-7b111.6K/mo1.9/mo7.72B2024-01-222yr 5mo agoQwen/Qwen1.5-7B244intellect-2110.9K/mo0.2/mo32.8B2025-05-121yr 1mo agoMaziyarPanahi/INTELLECT-2-GGUF245bielik-11b-v3-0-instruct108.5K/mo0.2/mo11.3B2025-12-315mo agospeakleash/Bielik-11B-v3.0-Instruct-awq246gemma-1-1-2b-it108K/mo6.4/mo2.51B2024-03-262yr 2mo agogoogle/gemma-1.1-2b-it247llada2-1-flash108K/mo20.8/mo103B2026-02-094mo agoinclusionAI/LLaDA2.1-flash248nex-n2-mini107.9K/mo8/mo37.1B2026-06-0815d agocyankiwi/Nex-N2-mini-AWQ-INT4249intern-s2-preview105K/mo18.3/mo36.1B2026-05-151mo agointernlm/Intern-S2-Preview-FP8250smollm-1-7b-instruct-quantized-w4a16104.9K/mo0/mo1.84B2024-08-231yr 9mo agonm-testing/SmolLM-1.7B-Instruct-quantized.w4a16251ernie-4-5-vl-28b-a3b-pt103.4K/mo8.8/mo29.4B2025-06-2811mo agobaidu/ERNIE-4.5-VL-28B-A3B-PT252gemma-4-e4b-it-ultra-uncensored-heretic101.8K/mo47.2/mo7.52B2026-04-052mo agollmfan46/gemma-4-E4B-it-ultra-uncensored-heretic-GGUF253qwen2-1-5b101.3K/mo4.1/mo1.54B2024-05-312yr agoQwen/Qwen2-1.5B254olmo-3-7b-instruct-sft96.3K/mo0.6/mo7.30B2025-11-177mo agoallenai/Olmo-3-7B-Instruct-SFT255qwen2-5-coder-3b96.2K/mo2.7/mo3.09B2024-11-081yr 7mo agoQwen/Qwen2.5-Coder-3B256mimo-v2-5-pro96.1K/mo353.3/mo1023B2026-04-271mo agoXiaomiMiMo/MiMo-V2.5-Pro257llama-3-3-nemotron-super-49b-v194.3K/mo22.9/mo49.9B2025-03-161yr 3mo agonvidia/Llama-3_3-Nemotron-Super-49B-v1258qwen2-5-math-1-5b-instruct93.7K/mo2.6/mo1.54B2024-09-161yr 9mo agoQwen/Qwen2.5-Math-1.5B-Instruct259qwen3-omni-30b-a3b-thinking91.9K/mo33.3/mo31.7B2025-09-159mo agoQwen/Qwen3-Omni-30B-A3B-Thinking260command-a-plus-05-2026-w4a491.6K/mo190.2/mo126B2026-05-181mo agoCohereLabs/command-a-plus-05-2026-w4a4261llada2-0-mini91.1K/mo9.7/mo16.3B2025-11-256mo agoinclusionAI/LLaDA2.0-mini262vertalily-1-2-1b87.5K/mo1.1/mo1.17B2026-01-075mo agoVLTX/VertaLily-1.2-1B-GGUF263step386.2K/mo15.3/mo321B2025-07-2810mo agostepfun-ai/step3264qwen2-5-math-7b-instruct81.5K/mo4.3/mo7.62B2024-09-191yr 9mo agoQwen/Qwen2.5-Math-7B-Instruct265qwen3-vl-32b-thinking80.1K/mo3.2/mo33.4B2025-10-198mo agoQwen/Qwen3-VL-32B-Thinking-FP8266ministral-3-14b-reasoning-251279.3K/mo0.2/mo3.68B2025-12-046mo agocyankiwi/Ministral-3-14B-Reasoning-2512-AWQ-4bit267llama-guard-4-12b78.8K/mo7.6/mo12.0B2025-04-231yr 2mo agometa-llama/Llama-Guard-4-12B268molmo2-o-7b76.6K/mo4.1/mo7.76B2025-12-146mo agoallenai/Molmo2-O-7B269deepseek-coder-6-7b-instruct75.6K/mo15.7/mo6.74B2023-10-292yr 7mo agodeepseek-ai/deepseek-coder-6.7b-instruct270minimax-vl-0174.7K/mo16.4/mo456B2025-01-121yr 5mo agoMiniMaxAI/MiniMax-VL-01271music-flamingo-260174.1K/mo18.2/mo8.27B2026-01-015mo agonvidia/music-flamingo-2601-hf272paligemma-3b-pt-22473.8K/mo19.1/mo2.92B2024-05-122yr 1mo agogoogle/paligemma-3b-pt-224273locateanything-3b73.7K/mo622.7/mo3.83B2026-03-023mo agonvidia/LocateAnything-3B274qwen3-5-35b-a3b-base72.7K/mo33.8/mo36.0B2026-02-243mo agoQwen/Qwen3.5-35B-A3B-Base275aya-vision-8b72.6K/mo20.5/mo8.63B2025-03-021yr 3mo agoCohereLabs/aya-vision-8b276nemotron-labs-diffusion-3b-base72K/mo2.2/mo3.83B2026-02-044mo agonvidia/Nemotron-Labs-Diffusion-3B-Base277glm-4-5v70.8K/mo68.9/mo108B2025-08-1010mo agozai-org/GLM-4.5V278mimo-7b-base69.2K/mo9.8/mo7.83B2025-04-291yr 1mo agoXiaomiMiMo/MiMo-7B-Base279kimi-vl-a3b-thinking68.6K/mo30.9/mo16.4B2025-04-091yr 2mo agomoonshotai/Kimi-VL-A3B-Thinking280glm-4-567.3K/mo126/mo358B2025-07-2011mo agozai-org/GLM-4.5281mistral-medium-3-5-128b66.8K/mo5/mo74.4B2026-04-301mo agoRecViking/Mistral-Medium-3.5-128B-NVFP4282sugoi-14b-ultra64.2K/mo1.2/mo14.8B2025-08-1910mo agosugoitoolkit/Sugoi-14B-Ultra-GGUF283nvlm-d-72b62.7K/mo37.4/mo79.4B2024-09-301yr 8mo agonvidia/NVLM-D-72B284hermes-4-14b58.5K/mo0.4/mo3.62B2025-09-039mo agocyankiwi/Hermes-4-14B-AWQ-4bit285mimo-7b-rl53.5K/mo20.1/mo7.83B2025-04-291yr 1mo agoXiaomiMiMo/MiMo-7B-RL286nvidia-nemotron-nano-12b-v251.6K/mo16.4/mo12.3B2025-08-2110mo agonvidia/NVIDIA-Nemotron-Nano-12B-v2287deepseek-coder-7b-instruct-v1-551.4K/mo5.4/mo6.91B2024-01-252yr 4mo agodeepseek-ai/deepseek-coder-7b-instruct-v1.5288internvl3-8b49.9K/mo0.6/mounknown2025-04-171yr 2mo agoOpenGVLab/InternVL3-8B-AWQ289qwen1-5-moe-a2-7b48.8K/mo8.1/mo14.3B2024-02-292yr 3mo agoQwen/Qwen1.5-MoE-A2.7B290olmoe-1b-7b-092447.6K/mo6.3/mo6.92B2024-07-201yr 11mo agoallenai/OLMoE-1B-7B-0924291jan-nano43.4K/mo0.3/mo4.47B2025-07-1211mo agowarshanks/Jan-nano-AWQ292sarvam-30b-fp8-dynamic38.8K/mo0.3/mo32.2B2026-03-093mo agoRedHatAI/sarvam-30b-FP8-dynamic293nemotron-mini-4b-instruct38.7K/mo8.6/mounknown2024-09-101yr 9mo agonvidia/Nemotron-Mini-4B-Instruct294regtech-32b-instruct-i137.7K/mo0/mo32.8B2026-02-164mo agomradermacher/RegTech-32B-Instruct-i1-GGUF295reformer-crime-and-punishment36.6K/mo0.2/mounknown2022-03-024yr 3mo agogoogle/reformer-crime-and-punishment296wildguard36.2K/mo2.1/mo7.25B2024-06-152yr agoallenai/wildguard297vikhr-nemo-12b-instruct-r-21-09-2435.4K/mo0.2/mo12.2B2024-09-221yr 9mo agoVlSav/Vikhr-Nemo-12B-Instruct-R-21-09-24-Q8_0-GGUF298hy-mt1-5-1-8b33.4K/mo17.5/mo1.79B2025-12-305mo agotencent/HY-MT1.5-1.8B-GGUF299qwen-vl32.7K/mo8.2/mounknown2023-08-182yr 10mo agoQwen/Qwen-VL300olmoe-1b-7b-0125-instruct32.1K/mo3.9/mo6.92B2025-01-271yr 4mo agoallenai/OLMoE-1B-7B-0125-Instruct301qwen1-5-moe-a2-7b-chat-quantized-w4a1631K/mo<0.1/mo14.4B2025-02-241yr 3mo agonm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16302eurollm-22b-instruct-251227.7K/mo10.8/mo22.6B2025-12-056mo agoutter-project/EuroLLM-22B-Instruct-2512303phi-4-reasoning-plus21.6K/mo0.9/mo7.84B2025-09-059mo agonvidia/Phi-4-reasoning-plus-NVFP4304mistral-small-3-2-24b-instruct-2506-awq-sym21.1K/mo1/mo24.2B2025-07-0111mo agojeffcookio/Mistral-Small-3.2-24B-Instruct-2506-awq-sym305apertus-70b-instruct-2509-quantized-w4a1619.7K/mo0.1/mo11.3B2025-09-219mo agoRedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16306exaone-3-5-32b-instruct16.3K/mo0.9/mo32.0B2024-12-011yr 6mo agoLGAI-EXAONE/EXAONE-3.5-32B-Instruct-AWQ307paligemma-3b-ft-cococap-44814.5K/mo0.1/mo2.92B2024-05-132yr 1mo agogoogle/paligemma-3b-ft-cococap-448308devstral-small-250513.1K/mo0.2/mo3.68B2025-05-211yr 1mo agomlx-community/Devstral-Small-2505-4bit309glm-4-32b-0414-w4a1610.9K/mo0.2/mo33.0B2025-05-041yr 1mo agomratsim/GLM-4-32B-0414.w4a16-gptq310hermes-3-llama-3-1-8b9.9K/mo0.1/mo8.03B2024-09-031yr 9mo agosolidrust/Hermes-3-Llama-3.1-8B-AWQ

Similar Articles

I benchmarked 21 local LLMs on a MacBook Air M5 for code quality AND speed

Reddit r/LocalLLaMA

A developer benchmarked 21 local LLMs on MacBook Air M5 using HumanEval+ and found Qwen 3.6 35B-A3B (MoE) leads at 89.6% with 16.9 tok/s, while Qwen 2.5 Coder 7B offers the best RAM-to-performance ratio at 84.2% in 4.5 GB. Notably, Gemma 4 models significantly underperformed expectations (31.1% for 31B), possibly due to Q4_K_M quantization effects.