bayesian

#bayesian

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper presents the first implementation of an infra-Bayesian reinforcement learning agent, demonstrating that it outperforms classical RL in worst-case regret and handles Newcomb's problem optimally, offering a step toward robustness under model misspecification.

0 favorites 0 likes

#bayesian

Probabilistic Attribution For Large Language Models

arXiv cs.CL ↗ · 2026-05-22 Cached

This paper proposes a model-agnostic probabilistic token attribution measure for LLMs using Bayes' rule to invert next-token log probabilities, capturing the model's internal representation of token sequences and improving interpretability through entropy analysis.

0 favorites 0 likes

#bayesian

Precision Tracked Transformer via Kalman Filtering, Kriging and Process Noise

arXiv cs.LG ↗ · 2026-05-20

The paper introduces the Bayesian Filtering Transformer (BFT), which incorporates uncertainty into Transformers via precision-weighted attention and Kalman update residuals, improving performance on sequential recommendation and noisy LLM fine-tuning.

0 favorites 0 likes

#bayesian

Learning Normalized Energy Models for Linear Inverse Problems

arXiv cs.LG ↗ · 2026-05-18 Cached

This paper introduces a new energy-based model for linear inverse problems that learns normalized posterior densities, overcoming limitations of diffusion models. It enables unbiased sampling, adaptive sampling, and blind degradation estimation, with competitive performance on ImageNet, CelebA, and AFHQ.

0 favorites 0 likes

#bayesian

Bayesian Model Merging

arXiv cs.LG ↗ · 2026-05-14 Cached

Introduces Bayesian Model Merging (BMM), a plug-and-play bi-level optimization framework for combining multiple task-specific experts into a single model, achieving state-of-the-art performance on vision and language benchmarks.

0 favorites 0 likes

bayesian

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

Probabilistic Attribution For Large Language Models

Precision Tracked Transformer via Kalman Filtering, Kriging and Process Noise

Learning Normalized Energy Models for Linear Inverse Problems

Bayesian Model Merging

Submit Feedback