small-language-models

#small-language-models

@cjzafir: VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA model…

X AI KOLs Timeline ↗ · 2d ago

The author demonstrates that small vertical language models (6B-15B) can outperform top LLMs on niche benchmarks through cost-effective fine-tuning using open-source models and Codex orchestration, achieving results with a $300 dataset.

0 favorites 0 likes

#small-language-models

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

arXiv cs.CL ↗ · 2d ago Cached

This paper compares a domain-trained small language model (Olava Extract) against frontier LLMs for structured contract extraction, showing that the specialized model achieves higher F1 scores and dramatically lower cost.

1 favorites 1 likes

#small-language-models

@eglyman: we trained a .35b-parameter model to navigate spreadsheets better than opus 4.6. normal corporate card company stuff.

X AI KOLs Following ↗ · 2d ago Cached

A developer trained a 350M-parameter model capable of navigating spreadsheets better than Anthropic's Opus 4.6.

0 favorites 0 likes

#small-language-models

SCURank: Ranking Multiple Candidate Summaries with Summary Content Units for Enhanced Summarization

arXiv cs.CL ↗ · 2026-04-22 Cached

SCURank introduces Summary Content Units to rank candidate summaries, enabling small models distilled from multiple LLMs to outperform traditional metrics and single-LLM distillates.

0 favorites 0 likes

#small-language-models

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Papers with Code Trending ↗ · 2025-03-14 Cached

SmolDocling is a compact 256M parameter vision-language model designed for end-to-end multi-modal document conversion. It introduces a new universal markup format called DocTags to capture page elements with location, competing with models 27 times larger.

0 favorites 0 likes

small-language-models

@cjzafir: VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA model…

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

@eglyman: we trained a .35b-parameter model to navigate spreadsheets better than opus 4.6. normal corporate card company stuff.

SCURank: Ranking Multiple Candidate Summaries with Summary Content Units for Enhanced Summarization

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Submit Feedback