output-space-allocation

#output-space-allocation

Output-Space Allocation Costs for Calibration-Guided LLM Compression: An Empirical Study

arXiv cs.CL ↗ · 10h ago Cached

This paper empirically investigates whether aligning the allocation cost with the output-space objective improves compressed model fidelity in ROCKET, a training-free LLM compression method. Results show a trade-off between accuracy and perplexity, with effects more pronounced at higher compression ratios.

0 favorites 0 likes

output-space-allocation

Output-Space Allocation Costs for Calibration-Guided LLM Compression: An Empirical Study

Submit Feedback