segmented-execution

Tag

Cards List
#segmented-execution

Training-Inference Consistent Segmented Execution for Long-Context LLMs

arXiv cs.CL · 12h ago Cached

This paper proposes a training-inference consistent segmented execution framework for long-context LLMs to address the mismatch between full-context training and restricted inference regimes, achieving comparable performance with significantly reduced memory usage.

0 favorites 0 likes
← Back to home

Submit Feedback