adaptive

Tag

Cards List
#adaptive

Adaptive Multi-Resolution Procedural Knowledge Compression for Large Language Models

Hugging Face Daily Papers · 2d ago Cached

SKIM is an adaptive multi-resolution soft token compression framework that compresses procedural skills for LLMs, maintaining task performance while reducing prefill cost and latency.

0 favorites 0 likes
#adaptive

AdaPLD: Adaptive Retrieval and Reuse for Efficient Model-Free Speculative Decoding

arXiv cs.CL · 2026-06-05 Cached

AdaPLD is a training-free method that improves model-free speculative decoding by using adaptive retrieval combining lexical and semantic similarity, and constructing branched reuse hypotheses to handle continuation uncertainty, achieving up to 3.10x decoding speedup.

0 favorites 0 likes
#adaptive

CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models

arXiv cs.LG · 2026-05-29 Cached

This paper presents CosmicFish-HRM, a compact 82.77M parameter language model with a hierarchical reasoning module that dynamically allocates reasoning compute during inference, learning when to halt based on input complexity.

0 favorites 0 likes
#adaptive

Consistently Informative Soft-Label Temperature for Knowledge Distillation

arXiv cs.LG · 2026-05-21 Cached

Proposes CIST, a method that assigns separate sample-wise adaptive temperatures to teacher and student in knowledge distillation, producing consistently informative soft labels and relaxing rigid logit-scale matching. Experiments on vision and language tasks show consistent improvements over standard KD.

0 favorites 0 likes
#adaptive

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

arXiv cs.LG · 2026-05-20

A new semantic-adaptive eviction policy for LLM prefix caches that learns token reuse patterns across different token types, achieving 1.4x-2.7x TTFT improvement over existing policies.

0 favorites 0 likes
← Back to home

Submit Feedback