eigendecomposition

#eigendecomposition

@rasbt: Always back to the basics: LatentMoE was probably inspired by MLA, which was inspired by LoRA, which was inspired by SV…

X AI KOLs Timeline ↗ · 2026-06-09 Cached

Sebastian Raschka points out the chain of inspiration from LatentMoE back to eigendecomposition through MLA, LoRA, and SVD.

0 favorites 0 likes

#eigendecomposition

Dynamics of the Transformer Residual Stream: Coupling Spectral Geometry to Network Topology

arXiv cs.LG ↗ · 2026-05-15 Cached

This paper performs full Jacobian eigendecomposition across production-scale LLMs, revealing a learned spectral gradient from rotation-dominated early layers to symmetric late layers, along with a low-rank bottleneck that compresses perturbations. The results link perturbation propagation and compression to network functional topology.

0 favorites 0 likes

#eigendecomposition

After 8 years, I rewrote my open-source PyTorch curvature library

Hacker News Top ↗ · 2026-05-14 Cached

After 8 years, the author rewrote the open-source pytorch-hessian-eigenthings library, providing efficient eigendecomposition of Hessian and other curvature matrices for PyTorch models using iterative methods like Lanczos.

0 favorites 0 likes

eigendecomposition

@rasbt: Always back to the basics: LatentMoE was probably inspired by MLA, which was inspired by LoRA, which was inspired by SV…

Dynamics of the Transformer Residual Stream: Coupling Spectral Geometry to Network Topology

After 8 years, I rewrote my open-source PyTorch curvature library

Submit Feedback