trajectory-optimization

#trajectory-optimization

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Hugging Face Daily Papers ↗ · 3d ago Cached

This paper introduces FATE, an on-policy framework that leverages failure trajectories to enhance the safety and performance of tool-using LLM agents through self-evolution and Pareto-aware optimization.

0 favorites 0 likes

#trajectory-optimization

Plan online, learn offline: Efficient learning and exploration via model-based control

OpenAI Blog ↗ · 2018-11-05 Cached

OpenAI proposes POLO (Plan Online, Learn Offline), a framework combining model-based control with value function learning and coordinated exploration to enable efficient learning on complex control tasks like humanoid locomotion and dexterous manipulation with minimal real-world experience.

0 favorites 0 likes

#trajectory-optimization

Prediction and control with temporal segment models

OpenAI Blog ↗ · 2017-03-12 Cached

OpenAI introduces a method for learning complex nonlinear system dynamics using deep generative models over temporal segments, enabling stable long-horizon predictions and differentiable trajectory optimization for model-based control.

0 favorites 0 likes

trajectory-optimization

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Plan online, learn offline: Efficient learning and exploration via model-based control

Prediction and control with temporal segment models

Submit Feedback