out-of-distribution

Tag

Cards List
#out-of-distribution

At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization

arXiv cs.LG · 2d ago Cached

This paper proposes using sparse autoencoders to detect out-of-distribution inputs for transformers, including typos and jailbreak prompts, by analyzing spurious concept activations. The method enables a mechanistically grounded fine-tuning strategy to improve LLM robustness.

0 favorites 0 likes
#out-of-distribution

Catastrophic Compositional Generation: Why Vanilla Diffusion Models Fail to Extrapolate

arXiv cs.LG · 4d ago Cached

This paper argues that vanilla conditional diffusion models fundamentally fail at compositional generation when the target distribution is out-of-distribution, due to score estimation error, and that inference-time corrections cannot fully compensate.

0 favorites 0 likes
#out-of-distribution

Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City

Hugging Face Daily Papers · 2026-06-18 Cached

This paper studies how self-driving car systems and humans perform on visual question answering tasks across different geographic locations (Lima and New York City), finding that both humans and VLMs show similar performance regardless of location but diverge based on question type.

0 favorites 0 likes
#out-of-distribution

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Hugging Face Daily Papers · 2026-06-18 Cached

This paper argues that aggregate-score leaderboards for LLM agent benchmarks fail to capture deployment-relevant dimensions and show rank instability. It proposes ranking configurations by predictive validity—the correlation between in-sample and out-of-sample rank—and introduces a twelve-tier measurement apparatus along with falsifiable out-of-distribution criteria.

0 favorites 0 likes
#out-of-distribution

Nothing from Something: Can a Language Model Discover 0?

arXiv cs.AI · 2026-06-17 Cached

This paper examines whether language models can independently discover the concept of zero as a form of out-of-distribution generalization, finding that GPT-2 sized models cannot at test time but improve with training on examples of zero, and that language pretraining reduces the number of required examples.

0 favorites 0 likes
#out-of-distribution

Non-Parametric Machine Text Detection via Multi-View Gaussian Processes

arXiv cs.LG · 2026-06-15 Cached

This paper introduces a non-parametric multi-view Gaussian process framework for detecting machine-generated text that is robust to adversarial manipulations like paraphrasing. By combining complementary features and providing calibrated uncertainty, it outperforms existing detectors on held-out attacks.

0 favorites 0 likes
#out-of-distribution

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

arXiv cs.LG · 2026-06-04 Cached

ADAPTOOD is a novel framework that uses data uncertainty to quantify distribution shift severity and guide fine-tuning of ECG time series models for out-of-distribution settings. It combines uncertainty estimation with low-rank model updates and adaptive hyperparameter optimization, achieving up to 7% higher accuracy and 12.9% higher precision than existing OOD adaptation methods.

0 favorites 0 likes
#out-of-distribution

Outsmarting the Chameleon: Counterfactual Decoupling for Tactical OOD Shifts in Live Streaming Risk Assessment

arXiv cs.LG · 2026-06-03 Cached

Proposes Latent-Predictive Counterfactual Decoupling (LPCD) to address tactical out-of-distribution shifts in live streaming risk assessment by decoupling stable malicious intent from evolving narrative tactics at the latent level, achieving superior performance on large-scale industrial datasets.

0 favorites 0 likes
#out-of-distribution

Toward Robust In-Context Learning: Leveraging Out-of-distribution Proxies for Target Inaccessible Demonstration Retrieval

arXiv cs.CL · 2026-06-02 Cached

This paper introduces DOPA, a demonstration search framework that uses an out-of-distribution proxy to retrieve robust demonstrations for LLMs when the target domain is inaccessible, enhancing in-context learning performance under distribution shift.

0 favorites 0 likes
#out-of-distribution

Curriculum Learning for Safety Alignment

arXiv cs.LG · 2026-05-27 Cached

This paper proposes Staged-Competence, a curriculum learning framework for DPO-based safety alignment that organizes preference data by difficulty, improving robustness and data efficiency while preserving general capabilities.

0 favorites 0 likes
#out-of-distribution

Generative OOD-regularized Model-based Policy Optimization

arXiv cs.LG · 2026-05-26 Cached

Introduces GORMPO, a density-regularized offline RL algorithm that uses generative density modeling to restrict policy updates to high-density areas, achieving 17% improvement on a real-world medical dataset and outperforming state-of-the-art baselines.

0 favorites 0 likes
#out-of-distribution

Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

arXiv cs.LG · 2026-05-21 Cached

This paper presents the first theoretical model for out-of-distribution generalization in reinforcement learning, showing that smaller abstract state spaces enable cross-scale generalization in POMDPs.

0 favorites 0 likes
#out-of-distribution

Spectral Gradient Surgery for Domain-Generalizable Dataset Distillation

arXiv cs.LG · 2026-05-20

This paper introduces Domain Generalizable Dataset Distillation (DGDD), a new problem setting that targets out-of-distribution generalization of distilled datasets, and proposes Spectral Gradient Surgery (SGS) to disentangle class-discriminative and domain-specific information by leveraging cross-domain gradient agreement in the spectral domain.

0 favorites 0 likes
#out-of-distribution

@omarsar0: Every time I ask my 10-year-old to use coding agents, he gets extremely disappointed. It turns out that all he wants is…

X AI KOLs Following · 2026-05-18 Cached

A developer notes that coding agents consistently fail to help his 10-year-old build creative simulators, revealing LLMs' inability to handle out-of-distribution use cases and arguing that claims of imminent AGI are overstated.

0 favorites 0 likes
#out-of-distribution

Backbone-Equated Diffusion OOD via Sparse Internal Snapshots

arXiv cs.LG · 2026-05-13 Cached

This paper introduces a protocol for fair comparison of diffusion-based OOD detectors and proposes Canonical Feature Snapshots (CFS), which leverage sparse internal activations for efficient detection.

0 favorites 0 likes
#out-of-distribution

CPCANet: Deep Unfolding Common Principal Component Analysis for Domain Generalization

Hugging Face Daily Papers · 2026-05-07 Cached

CPCANet is a domain generalization framework that uses Common Principal Component Analysis to discover structured domain-invariant subspaces, achieving state-of-the-art performance in zero-shot transfer.

0 favorites 0 likes
#out-of-distribution

Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation

arXiv cs.CL · 2026-04-20 Cached

This paper proposes CAP-TTA, a test-time adaptation framework that uses preconditioned LoRA updates triggered by bias-risk scores to mitigate toxicity and bias in large language models during narrative generation, achieving faster optimization and better fluency than standard baselines.

0 favorites 0 likes
← Back to home

Submit Feedback