Squeeze-Release: Iterative Pruning with Exact Structural Minimization

Hugging Face Daily Papers 06/12/26, 12:00 AM Papers

Summary

This paper introduces Squeeze-Release, an iterative pruning method that achieves exact structural minimization.

Unstructured pruning produces sparse weight tensors, but the standard implementation keeps tensor shapes unchanged so the deployed model is no smaller than before pruning. We present an exact structural rewrite, which we call minimization, that converts a masked network into a smaller dense network with the same forward function up to floating-point rounding. The Squeeze-Release cycle iterates pruning and minimization with an intermediate release step that re-enables the exact-zero positions inside the compacted tensors as small calibrated noise, turning otherwise wasted capacity back into trainable parameters. Successive cycles use that capacity to find structural redundancy a single pass cannot reach. We additionally introduce CompensatedLayerNorm, a function-preserving replacement for LayerNorm that extends minimization to channel reduction across LayerNorm-equipped residual streams. Squeeze-Release compresses the deployable network to 39x smaller than the unpruned model on a fully-connected model network and 14.8x smaller on modern CNN (ConvNeXt-Tiny), at comparable accuracy. In addition we prove that the rewrite can be extended to transformer architectures.

Original Article

View Cached Full Text

Cached at: 06/15/26, 04:59 PM

Paper page - Squeeze-Release: Iterative Pruning with Exact Structural Minimization

Source: https://huggingface.co/papers/2606.14346 Get this paper in your agent:

hf papers read 2606\.14346

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.14346 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.14346 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.14346 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

Squeeze-Release: Iterative Pruning with Exact Structural Minimization

Paper page - Squeeze-Release: Iterative Pruning with Exact Structural Minimization

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

Joint Structural Pruning and Mixed-Precision Quantization for LLM Compression

SHAPE: Coalition-Aware Expert Pruning for Sparse Mixture-of-Experts LLMs

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling

Submit Feedback

Similar Articles

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

Joint Structural Pruning and Mixed-Precision Quantization for LLM Compression

SHAPE: Coalition-Aware Expert Pruning for Sparse Mixture-of-Experts LLMs

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling