BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Hugging Face Daily Papers 06/08/26, 04:26 PM Papers

model-editing upcycling weight-manipulation tensor-surgery yaml-plans open-source tool

Summary

BrainSurgery is a tool for reproducible and declarative weight manipulations on neural network checkpoints, enabling model editing and upcycling through YAML plans with built-in validation.

As deep learning models scale, managing, inspecting, and modifying large checkpoints has become increasingly challenging. Researchers often need to alter model weights for layer restructuring, precision casting, low-rank factorization, and architectural debugging, yet these workflows often rely on fragile ad-hoc Python scripts. Here, we introduce BrainSurgery, a tool for robust and reproducible "tensor surgery" on neural network checkpoints, and provide a system demonstration covering four examples and three case studies from model upcycling to LoRA extraction. By abstracting storage formats and memory management, BrainSurgery executes complex transformations through declarative YAML plans. It supports structural modifications, mathematical transformations, and tensor reshaping through expressive regex and structural targeting, while built-in assertions validate tensor shapes, data types, and values to prevent silent errors. We envision that BrainSurgery will provide a strong foundation for future research through its reproducible and validated operations.

Original Article

View Cached Full Text

Cached at: 06/10/26, 09:44 AM

Paper page - BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Source: https://huggingface.co/papers/2606.09707

Abstract

BrainSurgery is a tool for robust and reproducible tensor manipulation of neural network checkpoints through declarative YAML plans with built-in validation.

As deep learning models scale, managing, inspecting, and modifying large checkpoints has become increasingly challenging. Researchers often need to alter model weights for layer restructuring, precision casting, low-rank factorization, and architectural debugging, yet these workflows often rely on fragile ad-hoc Python scripts. Here, we introduce BrainSurgery, a tool for robust and reproducible “tensor surgery” onneural network checkpoints, and provide a system demonstration covering four examples and three case studies from model upcycling to LoRA extraction. By abstracting storage formats and memory management, BrainSurgery executes complex transformations throughdeclarative YAML plans. It supportsstructural modifications,mathematical transformations, andtensor reshapingthrough expressiveregexandstructural targeting, while built-inassertionsvalidatetensor shapes,data types, andvaluesto prevent silent errors. We envision that BrainSurgery will provide a strong foundation for future research through its reproducible and validated operations.

View arXiv page View PDF GitHub3 Add to collection

Get this paper in your agent:

hf papers read 2606\.09707

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.09707 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.09707 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.09707 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Paper page - BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Data-centric debugging for teams training neural nets [P]

Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics

Task-Restricted Symmetries in Recurrent Weight Space

@AnneliesGamble: https://x.com/AnneliesGamble/status/2066949973749755919

Submit Feedback

Similar Articles

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Data-centric debugging for teams training neural nets [P]

Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics

Task-Restricted Symmetries in Recurrent Weight Space

@AnneliesGamble: https://x.com/AnneliesGamble/status/2066949973749755919