@cjzafir: VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA model…

X AI KOLs Timeline 05/08/26, 02:00 PM News

Summary

The author demonstrates that small vertical language models (6B-15B) can outperform top LLMs on niche benchmarks through cost-effective fine-tuning using open-source models and Codex orchestration, achieving results with a $300 dataset.

VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA models in their niche benchmarks. I post-trained a 6B dense model in 15 days and beat Sonnet 4.6 and Gemini 3 Flash. I use Codex 5.5 (Extra High) to plan the SFT dataset scope, then I use DeepSeek v4 Pro & Kimi 2.6 API to generate handwritten examples. (No synthetic, templated datasets.) Codex runs each batch through quality gates and filters out all the weak data. I was able to build a 350M-parameter dataset for just $300 using Codex as the orchestrator & DS + Kimi as the executors. I can compete with giant data labs on my own, beat their VLMs, and not break the bank. This happened only because of open-source models, as they're fighting neck-and-neck with SoTA models. If I had to start a career right now, I'd start an agency that fine-tunes SLMs (small language models) for enterprises. I'd charge them a $10k to $20k one-time fee. Use Qwen 3.5 or Gemma 4 as base models, use Codex as the brain and DeepSeek v4 + Kimi as the muscle, and post-train a strong SLM under $1000. This might feel far-fetched, but in 6 months you'll see agencies like these. Not everything requires an LLM. SLMs can achieve vertical intelligence if properly trained with 10x lower cost, no privacy issues, and full control over the model. I'll be sharing technical findings here on X on the go. If you enjoy nerdy fine-tuning stuff, stay tuned.

Original Article

@cjzafir: VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA model…

Similar Articles

Are super tiny LLMs any good?

Large Language Models Are Overkill For Some Marketing Tasks. Enter The Small Language Model

Small LLMs for Biomedical Claim Verification: Cost-Effective Fine-Tuning, Structural Dataset Shortcuts, and Cross-Domain Generalization

@paulabartabajo_: Advice for AI engineers A small Visual Language Model fine-tuned on your custom dataset is as accurate as GPT-5... ... …

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

Submit Feedback

Similar Articles

Large Language Models Are Overkill For Some Marketing Tasks. Enter The Small Language Model

Small LLMs for Biomedical Claim Verification: Cost-Effective Fine-Tuning, Structural Dataset Shortcuts, and Cross-Domain Generalization

@paulabartabajo_: Advice for AI engineers A small Visual Language Model fine-tuned on your custom dataset is as accurate as GPT-5... ... …

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction