semantic-cache

Tag

Cards List
#semantic-cache

Building an Open Source Edge Semantic Cache for LLMs in Rust/WASM – Sanity check on the architecture? [D]

Reddit r/MachineLearning · 10h ago

Proposes building an open-source, lightweight semantic cache for LLMs using Rust/WASM at the CDN edge to reduce latency and API costs, seeking community feedback on architecture and use-case validity.

0 favorites 0 likes
#semantic-cache

Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching

arXiv cs.LG · 3d ago Cached

This paper proposes Semantic Cache Distillation (SCD), a loss-constrained framework that replaces raw KV cache transmission with compact semantic codes, achieving up to 2.65x TTFT speedup while keeping generation quality within 5% F1 of the oracle.

0 favorites 0 likes
← Back to home

Submit Feedback