contextual-bandit

Tag

Cards List
#contextual-bandit

Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning

arXiv cs.AI · 2026-05-25 Cached

This paper presents VDSS, a human-in-the-loop multi-agent framework for ventilator decision support that uses contextual bandit preference learning to adapt to clinician-specific tuning styles, with retrospective ICU trajectory replays showing improved recommendation acceptability and reduced interaction rounds.

0 favorites 0 likes
#contextual-bandit

Built a self-hosted contextual bandit appliance in Rust. Deployed it against a live AI trading product. Found two bugs in my own configuration before I found any in the runtime.

Reddit r/ArtificialInteligence · 2026-05-15

Announces two open-source Rust projects: Lycan (a graph execution language for contextual bandits) and Syntra (a self-hosted Docker appliance for serving Lycan capsules). The author dogfoods them on a live AI trading product, discovering that data pipeline bugs, not algorithm issues, dominated the adaptation work.

0 favorites 0 likes
#contextual-bandit

Latency-Quality Routing for Functionally Equivalent Tools in LLM Agents

arXiv cs.LG · 2026-05-15 Cached

This paper introduces LQM-ContextRoute, a contextual bandit router for selecting between functionally equivalent tool providers in LLM agents, balancing latency and answer quality. It outperforms baselines on web-search and retriever benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback