dct

Tag

Cards List
#dct

Chiaroscuro Attention: Spending Compute in the Dark

Hugging Face Daily Papers · 6d ago Cached

CHIAR-Former uses spectral entropy-based routing to dynamically select between DCT, RBF, and self-attention operators, achieving improved efficiency on large text datasets while maintaining performance through hybrid attention mechanisms.

0 favorites 0 likes
← Back to home

Submit Feedback