ablation

Tag

Cards List
#ablation

Recursive Self-Improvement for Skills (Skill RSI)

Reddit r/AI_Agents · yesterday

Skill RSI is a free tool that recursively evaluates and improves AI skills via procedural evaluations and a research agent, supporting standalone or Codex plugin usage.

0 favorites 0 likes
#ablation

Why our #1 LightGBM feature by importance made predictions worse [D]

Reddit r/MachineLearning · 3d ago

A blog post from Flyback demonstrates how a LightGBM feature that ranked #1 in importance actually worsened predictions due to target encoding leakage, highlighting the danger of relying solely on feature importance metrics.

0 favorites 0 likes
#ablation

Measuring, Localizing, and Ablating Alignment Signatures in LLMs

arXiv cs.LG · 3d ago Cached

This paper investigates how post-training of LLMs introduces AI-like stylistic regularities and proposes PASTA, a training-free method to localize and ablate these alignment signatures, reducing AI detection rates while maintaining coherence across 11 models and 6 detectors.

0 favorites 0 likes
#ablation

@NousResearch: To check that CNA isolates only the intended behavior, we evaluate steered models on MMLU across a range of steering st…

X AI KOLs Following · 2026-05-19 Cached

Nous Research released Contrastive Neuron Attribution (CNA), a method to steer LLM behavior by identifying and ablating sparse circuits in MLP neurons without training sparse autoencoders or degrading general benchmarks, validated on multiple large language models.

0 favorites 0 likes
← Back to home

Submit Feedback