representation-geometry

Tag

Cards List
#representation-geometry

A Geometric Account of Activation Steering through Angle-Norm Decomposition

arXiv cs.AI · 4d ago Cached

This paper analyzes linear activation steering in language models by decomposing interventions into angular and radial components. It finds that concepts are primarily encoded in angular structure, but norm adjustments are crucial for stability, supporting spherical steering methods while showing that additive coefficients conflate geometry.

0 favorites 0 likes
#representation-geometry

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

Hugging Face Daily Papers · 2026-05-27 Cached

This paper systematically tests linear probes for deception detection in large language models, finding they fail under distributional shifts but style-augmented probes recover performance, and revealing that deception is encoded through distributed sub-threshold features.

0 favorites 0 likes
#representation-geometry

@snowboat84: Today, let's discuss something hardcore. One question: what level of mathematics does AI use? From the perspective of tools and models themselves, the mathematics used by AI has an average age of 150 years, with most being from before the mid-19th century: matrix multiplication, gradient descent, chain rule, Fourier transform, inner product, probability — mostly content from the first two years of undergraduate studies. But some phenomena emerging from AI...

X AI KOLs Timeline · 2026-05-23 Cached

Discusses that the mathematics used by AI is mainly linear algebra, calculus, etc., from before the 19th century, but emerging phenomena such as Scaling Law, emergent abilities, double descent, in-context learning, and representation geometry lack mathematical explanation. Analogizes to the clouds in physics in 1900, suggesting it may drive the development of 21st-century mathematics.

0 favorites 0 likes
#representation-geometry

Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces

arXiv cs.AI · 2026-05-15 Cached

This paper introduces the concept of 'minimal cores' in overcomplete reasoning traces, showing that on average 46% of steps can be removed while preserving the final answer, and that minimal cores improve trace separation and reduce intrinsic dimensionality.

0 favorites 0 likes
#representation-geometry

Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter

arXiv cs.CL · 2026-05-13 Cached

This paper introduces Minor Component Unlearning (MCU), a novel approach to LLM unlearning that targets minor components in representations to resist relearning attacks. It addresses the vulnerability of existing methods by focusing on robust directions within the model's spectral structure.

0 favorites 0 likes
← Back to home

Submit Feedback