adaptive-control

#adaptive-control

Max Out GRPO Signal: Adaptive Trace Prefix Control for Hard Reasoning Problems

arXiv cs.CL ↗ · 20h ago Cached

This paper introduces AdaPrefix-GRPO, a method that adaptively controls the length of correct solution prefixes provided to a model during GRPO training, maintaining a 50% success rate to maximize gradient signal. It significantly improves accuracy on hard math reasoning problems while reducing computational cost.

0 favorites 0 likes

#adaptive-control

Active Inference for Adaptive Traffic Signal Control in Noisy Nonstationary IoT Environments

arXiv cs.AI ↗ · 2026-06-15 Cached

The paper proposes an active inference controller for adaptive traffic signal control in noisy IoT environments, outperforming DQN in idle times and CO2 emissions under sensor occlusion and adverse weather conditions.

0 favorites 0 likes

#adaptive-control

From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

arXiv cs.LG ↗ · 2026-05-20

Proposes CPSS, a runtime safety mechanism that converts cumulative cost constraints into adaptive state-level thresholds for safe reinforcement learning in nonstationary environments, demonstrating reduced violations in highway merging scenarios.

0 favorites 0 likes

#adaptive-control

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network

arXiv cs.LG ↗ · 2026-05-13 Cached

This paper introduces ACSAC, a reinforcement learning method that uses an adaptive chunk size actor-critic algorithm with a causal Transformer Q-network to handle long-horizon, sparse-reward tasks. It demonstrates state-of-the-art performance on manipulation tasks by dynamically adjusting action chunk sizes based on state-dependent needs.

0 favorites 0 likes

#adaptive-control

Temporal Attention for Adaptive Control of Euler-Lagrange Systems with Unobservable Memory

arXiv cs.LG ↗ · 2026-05-11 Cached

This paper proposes a meta-control architecture using temporal self-attention for adaptive control of Euler-Lagrange systems with unobservable memory states. It demonstrates improved tracking performance over baseline methods on a 2-DOF manipulator while identifying failure modes in long-memory regimes.

0 favorites 0 likes

adaptive-control

Max Out GRPO Signal: Adaptive Trace Prefix Control for Hard Reasoning Problems

Active Inference for Adaptive Traffic Signal Control in Noisy Nonstationary IoT Environments

From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network

Temporal Attention for Adaptive Control of Euler-Lagrange Systems with Unobservable Memory

Submit Feedback