generalized-linear-models

Tag

Cards List
#generalized-linear-models

Contextual Slate GLM Bandits with Limited Adaptivity

arXiv cs.LG · 4h ago Cached

Proposes algorithms for contextual slate bandits with generalized linear rewards under limited adaptivity, achieving regret bounds independent of the non-linearity parameter. The batched and rarely-switching algorithms are computationally efficient and empirically outperform baselines, including in a language model example selection task.

0 favorites 0 likes
#generalized-linear-models

Best Arm Identification in Generalized Linear Bandits via Hybrid Feedback

arXiv cs.AI · 2026-05-08 Cached

This paper introduces a hybrid Track-and-Stop algorithm for best arm identification in generalized linear bandits that unifies absolute and relative feedback. The authors propose a likelihood-ratio-based confidence sequence to adaptively allocate queries, demonstrating improved sample efficiency over baseline methods.

0 favorites 0 likes
← Back to home

Submit Feedback