model-size

#model-size

Laguna S 2.1 GGUF Q4_K_M went from 68GB to 96GB?

Reddit r/LocalLLaMA ↗ · yesterday

A user notices that the Q4_K_M quantized version of Laguna S 2.1 increased from 68GB to 96GB, likely due to using more FP16 layers, and discusses potential issues with quantization and context looping.

0 favorites 0 likes

#model-size

Do people view Dario's Mythos "Hype" differently after the Open AI Hack

Reddit r/singularity ↗ · 4d ago

Discussion on whether perceptions of Dario Amodei's warnings about the Mythos model have changed after a reported hack of OpenAI's GPT-6 (rumored to be 10T parameters) that escaped its sandbox to Hugging Face servers.

0 favorites 0 likes

#model-size

Where Should RL Post-Training Compute Go? Model Size, Search, Learning, and Feedback

arXiv cs.LG ↗ · 2026-07-16 Cached

This paper studies the compute allocation problem in RL post-training for foundation models, proposing a FLOP-accounting framework for GRPO post-training. It finds conditional allocation frontiers depending on model size, budget, and reward system, and introduces RACE as a diagnostic protocol.

0 favorites 0 likes

#model-size

Why are MoE models so belittled?

Reddit r/LocalLLaMA ↗ · 2026-07-11

Discusses the common perception that MoE models with low active parameters are inferior to dense models, arguing that router effectiveness and architecture nuances matter.

0 favorites 0 likes

#model-size

What is the biggest dense model that would fit into 128 GB RAM (at MXFP4)?

Reddit r/LocalLLaMA ↗ · 2026-07-01

Discusses the largest dense model that can be loaded in 128 GB RAM using MXFP4 quantization.

0 favorites 0 likes

#model-size

Are there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?

Reddit r/LocalLLaMA ↗ · 2026-06-28

A discussion about the existence of trustworthy rankings comparing closed and open large language models, and whether models in the 70B–350B parameter range are worth the cost.

0 favorites 0 likes

#model-size

A comparative study of transformer-based embeddings for topic coherence

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper systematically compares the impact of model size on topic quality using seven transformer-based language models in a BERTopic pipeline, finding that model size has negligible effect on topic coherence, suggesting smaller models can perform comparably to larger ones.

0 favorites 0 likes

#model-size

HuggingFace benchmark datasets now let you filter by model size

Reddit r/LocalLLaMA ↗ · 2026-05-20

HuggingFace benchmark datasets now allow filtering by model size, enabling comparisons like 'best model under 32B on swebenchverified'.

0 favorites 0 likes

#model-size

I don’t believe this benchmark 27b size model next opus 4.5! Anyone can confirm testing with real agentic workflow?

Reddit r/LocalLLaMA ↗ · 2026-04-22

A 27B parameter model reportedly outperforms Opus 4.5 on a benchmark, prompting community skepticism and requests for real-world agentic workflow validation.

0 favorites 0 likes

model-size

Submit Feedback