geometric-analysis

Tag

Cards List
#geometric-analysis

@dair_ai: Why do RL runs on LLMs blow up even when the recipe looks right? GEOALIGN, from the Alibaba team behind Qwen, points at…

X AI KOLs Following · 6d ago Cached

GEOALIGN, from the Alibaba team behind Qwen, identifies that instability in RL for LLMs often stems from a few bad rollouts causing conflicting update directions, and proposes a lightweight method to curate rollouts based on directional consistency, improving training stability and performance.

0 favorites 0 likes
#geometric-analysis

Observable Patterns Are Not Explanations: A Causal-Geometric Analysis of Latent Reasoning Models

arXiv cs.CL · 2026-06-12 Cached

This paper analyzes latent reasoning models (LRMs) and demonstrates that observable patterns in latent states are not causal explanations of reasoning; it advocates for matched controls and causal tests in interpretability research.

0 favorites 0 likes
#geometric-analysis

Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation

arXiv cs.LG · 2026-06-10 Cached

This paper reveals that low-bit KV cache quantization can silently destroy safety alignment in instruction-tuned LLMs, and proposes a diagnostic method (PCR) to classify failure modes along with a training-free mitigation protocol that recovers up to 97% of lost alignment.

0 favorites 0 likes
#geometric-analysis

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

arXiv cs.LG · 2026-06-09 Cached

Introduces Contribution Weights, a projection-based metric that accounts for attention weight, value magnitude, and directional alignment to more faithfully measure token importance in transformer LLMs, revealing active functional roles of attention sinks.

0 favorites 0 likes
#geometric-analysis

A Geometric Account of Activation Steering through Angle-Norm Decomposition

arXiv cs.AI · 2026-06-08 Cached

This paper analyzes linear activation steering in language models by decomposing interventions into angular and radial components. It finds that concepts are primarily encoded in angular structure, but norm adjustments are crucial for stability, supporting spherical steering methods while showing that additive coefficients conflate geometry.

0 favorites 0 likes
#geometric-analysis

A Geometric View of Counterfactual Behavior: Interaction of Boundary Proximity and Local Support

arXiv cs.LG · 2026-06-04 Cached

This paper examines counterfactual behavior in ML models through a geometric lens, showing that models with similar predictive performance can differ substantially in counterfactual outcomes due to the interaction between decision-boundary proximity and local data support. The findings identify counterfactual behavior as a distinct dimension from predictive performance, with implications for model selection and reliability of counterfactual explanation methods.

0 favorites 0 likes
#geometric-analysis

The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment

arXiv cs.CL · 2026-06-03 Cached

This paper geometrically analyzes why LLMs acting as judges agree strongly with each other but weakly with humans, finding that inter-LLM consensus reflects a collapsed subspace rather than true human alignment on subjective rubrics. Post-hoc calibration on human data improves alignment, but even calibrated LLMs fall short of human reliability.

0 favorites 0 likes
#geometric-analysis

Geometric Asymmetry in MoE Specialization: Functional Decorrelation and Representational Overlap

arXiv cs.LG · 2026-05-19 Cached

This paper introduces a Jacobian-PCA-Grassmann framework to analyze the geometric structure of expert specialization in Mixture-of-Experts (MoE) Transformers. It finds that experts exhibit strong functional decorrelation while their representations overlap, and that routing sparsity significantly influences this geometry.

0 favorites 0 likes
#geometric-analysis

Most injection detectors score each prompt in isolation. I built one that tracks the geometric trajectory of the full session. Here is a concrete result.

Reddit r/artificial · 2026-04-20

A developer built Arc Gate, a monitoring proxy for LLMs that uses Fisher information manifold geometry to detect session-level prompt injection attacks, identifying Crescendo-style gradual manipulation by tracking t-values against a phase transition threshold t* = 1.2247 rather than per-turn phrase detection.

0 favorites 0 likes
#geometric-analysis

Geometric coherence of single-cell CRISPR perturbations reveals regulatory architecture and predicts cellular stress

Hugging Face Daily Papers · 2026-04-17 Cached

This paper introduces Shesha, a geometric stability metric that quantifies directional coherence of single-cell CRISPR perturbation responses using mean cosine similarity, revealing regulatory architecture and predicting cellular stress across 2,200+ perturbations in five CRISPR datasets.

0 favorites 0 likes
← Back to home

Submit Feedback