Tag
Hermes Agent has added native support for skill bundles, allowing multiple skills to be triggered together. The author advises bundling only logically chained workflows to avoid conflicting instructions.
The article highlights how verbose CLI output wastes tokens for LLM coding agents and introduces a pattern-based compressor that reduces shell command output noise while preserving essential information.
This paper addresses the missing old logits problem in asynchronous reinforcement learning for LLMs, proposing exact and approximate correction methods to improve training stability and performance.