semantic-aware

#semantic-aware

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

arXiv cs.LG ↗ · 2026-05-20

A new semantic-adaptive eviction policy for LLM prefix caches that learns token reuse patterns across different token types, achieving 1.4x-2.7x TTFT improvement over existing policies.

0 favorites 0 likes

#semantic-aware

Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

arXiv cs.AI ↗ · 2026-05-14 Cached

This paper proposes CTO, a method that improves code translation by combining syntax-guided and semantic-aware preference optimization through contrastive learning and direct preference optimization, achieving significant improvements over existing baselines in C++, Java, and Python translations.

0 favorites 0 likes

semantic-aware

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

Submit Feedback