@liquidai: Introducing LFM2.5-230M: our smallest model yet, built to run fast anywhere (CPUs, NPUs, and GPUs) to enable agentic ta…

X AI KOLs Timeline Models

Summary

Liquid AI releases LFM2.5-230M, a small 230M parameter model optimized for fast inference on CPUs, NPUs, and GPUs, targeting agentic tasks on devices like phones and robots.

Introducing LFM2.5-230M: our smallest model yet, built to run fast anywhere (CPUs, NPUs, and GPUs) to enable agentic tasks on phones, robots, home and network automation devices. > 230M parameters, built on the LFM2 architecture > Pre-trained on 19T tokens, with a 32K context extension > Post-trained with distillation from LFM2.5-350M > 213 tok/s decode speed on Galaxy S25 Ultra (CPU) > 42 tok/s on a Raspberry Pi 5 (CPU) > Competes with and often beats models more than twice its size on instruction following, data extraction, and tool use. > use it for large-scale data extraction pipelines or lightweight on-device agentic workloads.
Original Article
View Cached Full Text

Cached at: 06/25/26, 03:25 PM

Introducing LFM2.5-230M: our smallest model yet, built to run fast anywhere (CPUs, NPUs, and GPUs) to enable agentic tasks on phones, robots, home and network automation devices.

230M parameters, built on the LFM2 architecture Pre-trained on 19T tokens, with a 32K context extension Post-trained with distillation from LFM2.5-350M 213 tok/s decode speed on Galaxy S25 Ultra (CPU) 42 tok/s on a Raspberry Pi 5 (CPU) Competes with and often beats models more than twice its size on instruction following, data extraction, and tool use. use it for large-scale data extraction pipelines or lightweight on-device agentic workloads.

Similar Articles

Liquid AI releases LFM2.5-8B-A1B

Reddit r/LocalLLaMA

Liquid AI released LFM2.5-8B-A1B, an edge model with a 128K context window, 38T tokens of pre-training, and large-scale reinforcement learning, capable of tool calling and complex tasks while fitting on an entry-level laptop.

LiquidAI/LFM2.5-8B-A1B-GGUF

Hugging Face Models Trending

LiquidAI releases a GGUF quantized version of their LFM2.5-8B-A1B model, with instructions for use across multiple inference engines.

New LFM2.5 8b A1b model!!

Reddit r/LocalLLaMA

Introducing LFM2.5 8b A1b, a new AI model with performance on par with Nemotron 3 Nano but at higher speed. Support is being added to SmallCode for non-standard tool calls.