segmented-execution

#segmented-execution

Training-Inference Consistent Segmented Execution for Long-Context LLMs

arXiv cs.CL ↗ · 12h ago Cached

This paper proposes a training-inference consistent segmented execution framework for long-context LLMs to address the mismatch between full-context training and restricted inference regimes, achieving comparable performance with significantly reduced memory usage.

0 favorites 0 likes

segmented-execution

Training-Inference Consistent Segmented Execution for Long-Context LLMs

Submit Feedback