latent-compression

#latent-compression

Generic Triple-Latent Compression with Gated Associative Retrieval

arXiv cs.CL ↗ · 4d ago Cached

This paper introduces generic triple-latent recurrent models that compress token pair interactions into a latent state, and a gated associative retrieval variant that improves exact recall. The hybrid model outperforms Transformers on byte-level WikiText-2 and a tokenized language benchmark, achieving up to 41.9% associative recall versus 25%.

0 favorites 0 likes

latent-compression

Generic Triple-Latent Compression with Gated Associative Retrieval

Submit Feedback