Are super tiny LLMs any good?

Reddit r/singularity Models

Summary

Explores whether very small language models can handle casual conversations adequately, and what training factors differentiate the better ones.

If you’re not coding, not asking complex logical questions, but still want a model that isn’t completely stupid for casual conversations, are there any super tiny models out there that do an ok job? Which ones, and what makes them good, how were they trained and weighted that made them better than other tiny models?
Original Article

Similar Articles

Has anyone here used SLMs inside agent workflows?

Reddit r/AI_Agents

A user asks the community about using small/local language models within agent workflows for specific tasks like routing, classification, and extraction, and shares thoughts on whether larger models are always necessary.

How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework

arXiv cs.CL

This paper introduces a register-aware linguistic evaluation framework to assess how human-like large language models (LLMs) are by comparing the distribution of 67 lexico-grammatical features between human and LLM-generated texts using Maximum Mean Discrepancy. Experiments across seven instruction-tuned open-source models and five registers show that no model perfectly matches human baselines, and closeness to human language varies by register rather than model size.