streaming-llm

Tag

Cards List
#streaming-llm

@ickma2311: Efficient AI Lecture 15: Long-Context LLM Long context is not just a bigger prompt window. The key question is: which p…

X AI KOLs Timeline · 2026-05-25 Cached

This post summarizes Efficient AI Lecture 15 on long-context LLMs, covering RoPE position interpolation for context extension, the needle-in-haystack evaluation, and StreamingLLM's attention sink phenomenon and KV cache eviction strategy.

0 favorites 0 likes
← Back to home

Submit Feedback