preemption

Tag

Cards List
#preemption

Linguistic Productivity in Large Language Models: Models Coerce, but do not Preempt

arXiv cs.CL · 2026-06-03 Cached

This paper investigates whether Large Language Models exhibit the same usage-based linguistic productivity constraints (entrenchment and preemption) as humans, finding that models can reproduce coercion but fail to apply statistical preemption to avoid overgeneralization.

0 favorites 0 likes
#preemption

Towards Multi-Model LLM Schedulers: Empirical Insights into Offloading and Preemption

arXiv cs.AI · 2026-05-20

This paper presents an empirical study on scheduling multiple LLMs on shared heterogeneous hardware, focusing on performance implications of CPU-GPU offloading and preemption. It finds that offloading causes non-linear decode degradation, especially for smaller models, and preemption overhead is dominated by model state reload, providing design guidance for future multi-model schedulers.

0 favorites 0 likes
← Back to home

Submit Feedback