dynamic-inference

#dynamic-inference

Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters

arXiv cs.LG ↗ · yesterday Cached

Sigma-Branch restructures pretrained dense networks into a hierarchical binary tree with a shared backbone, routers, and specialized leaves, reducing per-inference active parameters by 58–60% while staying within 1.72 pp of baseline accuracy on CIFAR-100, ImageNet-1K, and ModelNet40.

0 favorites 0 likes

#dynamic-inference

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

arXiv cs.LG ↗ · 3d ago Cached

This paper introduces Program-of-Layers (PoLar), a method that allows LLMs to dynamically skip or loop pretrained layers per input, improving accuracy and efficiency over fixed-depth inference.

0 favorites 0 likes

dynamic-inference

Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

Submit Feedback