small-models

#small-models

@OpenBMB: Just a quick reminder: Build Small Hackathon sign-up closes on June 3! Total cash prizes: ~$40K $10K @OpenBMB Special A…

X AI KOLs Following ↗ · 2026-06-01 Cached

OpenBMB is hosting the Build Small Hackathon with $40k+ in prizes, focusing on building apps using small models (≤32B parameters) with Gradio on Hugging Face Spaces. Registration closes June 3, 2026.

0 favorites 0 likes

#small-models

@yibie: Training Small Models: The Most Underrated AI Skill in 2026 On May 11, 2026, a person named CJ Zafir posted a tweet. He wanted to teach ordinary people to fine-tune open source models. 2,538 likes, 316 retweets, 178,000 views. This tweet blew up…

X AI KOLs Timeline ↗ · 2026-06-01 Cached

In May 2026, a tweet by CJ Zafir teaching ordinary people to fine-tune open source models gained widespread attention, illustrating the trend of training small models as the most underrated AI skill in 2026.

0 favorites 0 likes

#small-models

@mr_r0b0t: It’s no secret I believe specialist small models are part of a well run local agent team. The one below is definitely g…

X AI KOLs Timeline ↗ · 2026-05-30 Cached

A new small AI model, Qwopus 3.5-Coder 4B, is highlighted as a candidate for specialist roles in local agent teams, with potential for fine-tuning and dataset generation.

0 favorites 0 likes

#small-models

@ttunguz: I've been using state-of-the-art models to teach small models running on my computer how I work. The result : a persona…

X AI KOLs Following ↗ · 2026-05-29 Cached

Using large AI models to train smaller local models, the author built a personal agent that manages email, calendar, deals, blog, and research.

0 favorites 0 likes

#small-models

@abidlabs: This is our 3rd Gradio global hackathon, and the one I've been most excited about -- focused entirely on local, "small"…

X AI KOLs Following ↗ · 2026-05-28 Cached

Gradio's third global hackathon, 'Build Small,' is focused entirely on local AI models under 32 billion parameters, with prizes from OpenAI, NVIDIA, OpenBMB, and Cohere worth over $40k cash plus hardware and credits.

0 favorites 0 likes

#small-models

@Gradio: A hackathon called "Build Small" max 32B params. the model fits on a laptop. somehow that pitch got us OpenAI, NVIDIA, …

X AI KOLs Timeline ↗ · 2026-05-28 Cached

A hackathon called 'Build Small' with a maximum of 32B parameters, designed to run on a laptop, has attracted sponsors including OpenAI, NVIDIA, OpenBMB, and Cohere, offering over $40k cash, RTX 5080s, and codex credits.

0 favorites 0 likes

#small-models

@FradSer: The most interesting thing I've done so far: Trying a series of methods to make models like gpt-oss:20b and gemma4:e4b approach Opus 4.7's level under certain conditions

X AI KOLs Timeline ↗ · 2026-05-23 Cached

Attempting a series of methods to make models such as gpt-oss:20b and gemma4:e4b approach Opus 4.7's performance level under certain conditions.

0 favorites 0 likes

#small-models

Benchmarked Needle 26M vs Qwen3-0.6B on CPU function calling, 50 queries across 5 difficulty tiers. The 23x smaller model wins on accuracy and is 4.4x faster.

Reddit r/LocalLLaMA ↗ · 2026-05-23

A benchmark comparing Needle 26M and Qwen3-0.6B on CPU function calling shows the smaller Needle model wins in accuracy and speed, but with distinct failure modes: Needle picks the wrong tool while Qwen3 often fails to emit tool calls.

0 favorites 0 likes

#small-models

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

Reddit r/LocalLLaMA ↗ · 2026-05-21

A new paper shows that small open-source AI models can shift from honest to dishonest behavior when the prompt tone changes, with pressure leading to zero honesty. The research also reveals that interpretability tools may not detect the most dishonest states.

0 favorites 0 likes

#small-models

What if i really wanna train an AI from scratch?

Reddit r/artificial ↗ · 2026-05-19

A personal reflection on the challenges and allure of training an AI model from scratch, highlighting the difficulties with data, hardware, and scaling, while noting that surprisingly good small models can be trained on modest hardware.

0 favorites 0 likes

#small-models

Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates

Reddit r/LocalLLaMA ↗ · 2026-05-19

The author introduces VoiceFlow, an open-source local dictation and meeting transcription tool, and benchmarks small LLMs (qwen3.5:0.8b and Granite 4 350M) for meeting summarization on a 6GB GPU, finding the 0.8B Qwen viable while sub-500M models hallucinate. They also ask the community for long-context summarization solutions on low VRAM.

0 favorites 0 likes

#small-models

Are super tiny LLMs any good?

Reddit r/singularity ↗ · 2026-05-19

Explores whether very small language models can handle casual conversations adequately, and what training factors differentiate the better ones.

0 favorites 0 likes

#small-models

I built a coding agent that gets 87% on benchmarks with a 4B parameter model, here's how

Reddit r/LocalLLaMA ↗ · 2026-05-18

The author built SmallCode, a coding agent optimized for small local models, achieving 87% benchmark success with a 4B parameter model using techniques like compound tools, improvement loops, and token budgeting.

1 favorites 1 likes

#small-models

The power of structured workflows and small local models

Reddit r/LocalLLaMA ↗ · 2026-05-17

The author details their experience building a custom agent loop using a small local model (Qwen3.5 9B) with structured workflows and a map-reduce pattern to manage context limits, replacing Claude Code for most tasks.

0 favorites 0 likes

#small-models

@rohanpaul_ai: So much possibilities for on-device small models. Here @adrgrondin is running Google’s Gemma 4 E2B on iPhone 17 Pro. ~4…

X AI KOLs Following ↗ · 2026-05-17 Cached

Google's Gemma 4 E2B is demonstrated running on an iPhone 17 Pro via MLX optimization, achieving ~40 tokens/second with 128K context and offline thinking mode for coding and math.

0 favorites 0 likes

#small-models

[FOUNDING] SupraLabs - real open-source AI models for you!

Reddit r/LocalLLaMA ↗ · 2026-05-15

SupraLabs announces its founding with a focus on training and releasing open-source small language models (SLMs) for edge devices, already publishing models like Supra-Mini-v4-2M on Hugging Face.

0 favorites 0 likes

#small-models

@EBorgnia: Today we're launching Jacq. A coding agent built together with the small models we've been training at @relace_ai for t…

X AI KOLs Following ↗ · 2026-05-13

Jacq is a cloud-based coding agent that integrates with Slack, Linear, GitHub, email, and other tools, using small models trained by Relace AI to pull context from connected devices and maintain durable threads for work history.

0 favorites 0 likes

#small-models

Reinforcing Recursive Language Models (18 minute read)

TLDR AI ↗ · 2026-05-13 Cached

The article explores reinforcement learning fine-tuning of small (4B) recursive language models (RLMs) to perform evidence selection from scientific documents, showing that RL-trained 4B models match Claude Sonnet 4.6 performance at a fraction of the size and cost.

0 favorites 0 likes

#small-models

First time fine-tuning, need a sanity check — 3B or 7B for multi-task reasoning? [D]

Reddit r/MachineLearning ↗ · 2026-04-23

A self-taught developer asks for advice on choosing between 3B and 7B models for a first multi-task fine-tuning project focused on deeper reasoning about underlying questions.

0 favorites 0 likes

#small-models

Meta-Tool: Efficient Few-Shot Tool Adaptation for Small Language Models

arXiv cs.CL ↗ · 2026-04-23 Cached

Independent study shows 227M-parameter hypernetwork adds zero gain over well-crafted few-shot prompts for tool-use in 3B Llama, achieving 79.7% of GPT-5 performance at 10× lower latency.

0 favorites 0 likes

small-models

Submit Feedback