SenseNova U1 dropped an infographic-specific finetune

Reddit r/LocalLLaMA Models

Summary

SenseNova U1 releases an infographic-specific finetune of its U1-8B-MoT base model, achieving significant benchmark improvements in infographic accuracy, chart understanding, and text rendering.

it's the same U1-8B-MoT base with an extended MT (multi-task) training phase focused on structured visual output. the benchmark jumps are significant: IGenBench I-ACC (infographic accuracy) : 4.2👉17.0 (4x) Chart Understanding: 51.3👉69.5Text Rendering: 39.8👉46.6Overall Aesthetic: 53.8👉53.3 Repo: https://github.com/OpenSenseNova/SenseNova-U1github (infographic model docs): https://github.com/OpenSenseNova/SenseNova-U1/blob/main/docs/u1\_infographic\_model.md
Original Article

Similar Articles

sensenova/SenseNova-U1-8B-MoT

Hugging Face Models Trending

SenseNova U1 is a new series of native multimodal models that unify understanding and generation within a single architecture using the NEO-Unify framework, eliminating the need for separate visual encoders or VAEs.

Introducing Nano Banana Pro

Google DeepMind Blog

Google DeepMind introduces Nano Banana Pro, a new state-of-the-art image generation and editing model built on Gemini 3 Pro. The model offers improved text rendering, enhanced world knowledge integration, and high-fidelity visual capabilities available across Google products.

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Google DeepMind Blog

Google DeepMind launches Nano Banana 2, an image generation model that combines the advanced capabilities of Nano Banana Pro with the speed of Gemini Flash. The model features improved subject consistency, precise text rendering, and is integrated into Google products like Gemini and Search.