asymmetric-compression

#asymmetric-compression

@no_stp_on_snek: Always start with uncompressed k and compressed V and go more aggressively from there. Model families have different se…

X AI KOLs Following ↗ · 2026-05-23 Cached

A tip on KV-cache compression for transformer models: start with uncompressed keys and compressed values, then adjust based on model family sensitivity; try asymmetric before symmetric compression.

0 favorites 0 likes

asymmetric-compression

@no_stp_on_snek: Always start with uncompressed k and compressed V and go more aggressively from there. Model families have different se…

Submit Feedback