recipe

#recipe

Tmax: A simple recipe for terminal agents

Hugging Face Daily Papers ↗ · 3d ago Cached

Tmax introduces a simplified RL training recipe for terminal agents, achieving state-of-the-art performance with a 9B parameter model using a novel data generation taxonomy and an expanded open-source dataset.

0 favorites 0 likes

recipe

Tmax: A simple recipe for terminal agents

Submit Feedback