@dphnAI: Dolphin X1 Trinity Nano is now live on @huggingface Our smallest decensored model yet - 6B MoE with 1B active parameter…
Summary
Dolphin X1 Trinity Nano, a 6B Mixture-of-Experts model with 1B active parameters, has been released on Hugging Face. It is the smallest decensored model yet, trained using only online reinforcement learning.
View Cached Full Text
Cached at: 05/29/26, 09:55 PM
Dolphin X1 Trinity Nano is now live on @huggingface
Our smallest decensored model yet - 6B MoE with 1B active parameters trained using only online RL
Huge thanks to @TargonCompute for providing an 8xB200 node, @PrimeIntellect for hosted RL, and @arcee_ai for the Trinity series https://t.co/2hwnhrc7t2
Similar Articles
@Montreal_AI: A 0.6B model learned to manage giants. That is the idea behind TRINITY, a new ICLR 2026 paper by Jinglue Xu, Qi Sun, Pe…
TRINITY is a lightweight 0.6B parameter coordinator that learns to orchestrate multiple LLMs by assigning them roles (Thinker, Worker, Verifier) using an evolutionary strategy. It outperforms individual models and existing coordination methods across coding, math, reasoning, and domain knowledge tasks.
Dolphin-CN-Dialect: Where Chinese Dialects Matter
Dolphin-CN-Dialect is a streaming-capable ASR model that improves dialect recognition through temperature-based sampling and redesigned tokenization, achieving competitive performance with a smaller model size.
@abidlabs: Remarkable for an 8B model! Check out the @Gradio app here: https://huggingface.co/spaces/LiquidAI/LFM2.5-8B-A1B…
Liquid AI releases LFM2.5-8B-A1B, an 8B MoE model with 1.5B active parameters and 128K context, optimized for edge devices.
Introducing Nano Banana Pro
Google DeepMind introduces Nano Banana Pro, a new state-of-the-art image generation and editing model built on Gemini 3 Pro. The model offers improved text rendering, enhanced world knowledge integration, and high-fidelity visual capabilities available across Google products.
Nemotron 3 Ultra. 550 billion parameters, 55B active. 1 million context
NVIDIA releases Nemotron 3 Ultra, a massive 550 billion parameter mixture-of-experts model with 55B active parameters and a 1 million token context window.