Efficient Conditioning Why Pseudo Observation Batch Bayesian Optimization Works When It Does not
Summary
This paper provides a unified theoretical framework for pseudo observation batch Bayesian optimization, proving that Gaussian processes produce distinct batch points and that common methods like Constant Liar and Kriging Believer are instances of a single conditioning mechanism. It introduces the Structural Diversity Diagnostic (SDD) for testing surrogate compatibility and validates predictions across multiple benchmark functions and hyperparameter tuning.
Similar Articles
Accelerating Multi-Objective Bayesian Optimisation via Predictive-Gradient Catalysts
This paper introduces a general acceleration mechanism for multi-objective Bayesian optimisation that uses Gaussian process predictive gradients as auxiliary signals to augment existing acquisition functions, enabling faster convergence to the global Pareto set under limited evaluation budgets.
Regularized Offline Policy Optimization with Posterior Hybrid Bayesian Belief
This paper introduces Posterior Hybrid Bayesian Belief (PhyB), a framework that reformulates the expectation in Bayesian RL as a convex combination over dynamics models, enabling efficient regularized offline policy optimization with bounded objective discrepancy and state-of-the-art performance.
PBSD: Privileged Bayesian Self-Distillation for Long-Horizon Credit Assignment
PBSD proposes a Bayesian self-distillation method that converts sparse final rewards into calibrated turn-level credit signals for long-horizon agentic tasks, improving policy learning and generalization.
Automated Kernel Discovery Towards Understanding High-dimensional Bayesian Optimization
The paper introduces Kernel Discovery, an LLM-driven evolutionary framework for high-dimensional Bayesian optimization that searches a broader kernel space and achieves state-of-the-art results on benchmarks.
When Determinants Are Not Enough: Private Rare Switching
This note presents a research moment where Codex helped find a new rare-switching rule for private linear bandits, using the generalized Rayleigh quotient to overcome the failure of determinant-based monotonicity due to Gaussian noise.