embedding-compression

Tag

Cards List
#embedding-compression

DIVE: Embedding Compression via Self-Limiting Gradient Updates

arXiv cs.CL · 2026-05-21 Cached

Proposes DIVE, a compression adapter for embedding dimensionality reduction that uses self-limiting gradient updates and head-wise NT-Xent contrastive loss to prevent overfitting on small datasets, outperforming existing methods on BEIR benchmarks.

0 favorites 0 likes
#embedding-compression

A polynomial autoencoder beats PCA on transformer embeddings

Hacker News Top · 2026-05-05 Cached

This article introduces a polynomial autoencoder that improves upon PCA for compressing transformer embeddings by using a quadratic decoder to capture nonlinear variance. Benchmarks on BEIR show it significantly outperforms standard PCA and Matryoshka embeddings in retrieval quality while maintaining high compression ratios.

0 favorites 0 likes
#embedding-compression

Spectral Tempering for Embedding Compression in Dense Passage Retrieval

arXiv cs.CL · 2026-04-20 Cached

Spectral Tempering (SpecTemp) proposes a learning-free method for embedding compression in dense passage retrieval that adaptively determines optimal spectral scaling based on signal-to-noise ratio analysis, outperforming fixed hyperparameter approaches like PCA and whitening.

0 favorites 0 likes
← Back to home

Submit Feedback