visual-quantization

Tag

Cards List
#visual-quantization

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Hugging Face Daily Papers · 2026-06-25 Cached

ViQ presents a visual quantization framework that balances semantic richness and detail preservation in discrete representations, enabling efficient multimodal training with native-resolution inputs by using text-aligned pre-training and proximal representation learning.

0 favorites 0 likes
← Back to home

Submit Feedback