norm-preserving

Tag

Cards List
#norm-preserving

Norm-preserving abliteration on Qwen3.6-35B-A3B: 0% refusal, benchmarks intact, open source dataset

Reddit r/LocalLLaMA · 9h ago

Norm-preserving abliteration technique applied to Qwen3.6-35B-A3B achieves 0% refusal rate while maintaining benchmark performance, with open source dataset released.

0 favorites 0 likes
#norm-preserving

An Integrable Token Mixing Layer from the Generalized Yang Baxter Equation

arXiv cs.LG · 2026-06-16 Cached

The paper introduces YB-Mixer, a token-mixing layer derived from the generalized Yang-Baxter equation, which is exactly norm-preserving, depth-stable, and allows order-free and variable-budget inference. It achieves competitive performance on long-range memory tasks with fewer parameters compared to attention and state-space baselines.

0 favorites 0 likes
← Back to home

Submit Feedback