crowd-counting

#crowd-counting

ABACUS: Adapting Unified Foundation Model for Bridging Image Count Understanding and Generation

Hugging Face Daily Papers ↗ · 2026-06-22 Cached

ABACUS is a unified vision-language model that handles multiple counting tasks and count-faithful image generation without benchmark-specific training, achieving state-of-the-art results across seven benchmarks.

0 favorites 0 likes

crowd-counting

ABACUS: Adapting Unified Foundation Model for Bridging Image Count Understanding and Generation

Submit Feedback