token-reduction

Tag

Cards List
#token-reduction

Good to Go: The LOOP Skill Engine That Hits 99% Success and Slashes Token Usage by 99% via One-Shot Recording and Deterministic Replay

arXiv cs.AI · 22h ago Cached

The LOOP Skill Engine achieves 99% success and 99% token reduction for periodic AI agent tasks by recording a single LLM-driven execution and replaying it deterministically via a parameterized, branch-free skill, eliminating stochastic failures and high costs.

0 favorites 0 likes
#token-reduction

@berryxia: Agent memory is incredibly competitive! I have to say, the more people join this track, the better it gets! The Tencent AI team spent a full 6 months tackling just one problem: AI agents frequently dropping context in long conversations. They ended up building a complete memory system and open-sourced it directly. After reading their sharing, my biggest takeaway is...

X AI KOLs Timeline · yesterday Cached

Tencent AI has open-sourced an Agent memory system that significantly improves token efficiency and agent consistency in long dialogues through three methods: real-time context compression, Mermaid task maps, and Persona memory. Token consumption is reduced by 61%, and persona consistency jumps from 48% to 76%.

0 favorites 0 likes
#token-reduction

Hint Tuning: Less Data Makes Better Reasoners

arXiv cs.CL · 3d ago Cached

This paper introduces 'Hint Tuning,' a data-efficient method that reduces token usage in reasoning models by calibrating reasoning depth based on problem difficulty. It achieves significant token reduction (24–66%) on models like Qwen3-Thinking and DeepSeek-R1-Distill using only 1K self-annotated samples.

0 favorites 0 likes
#token-reduction

Learning Adaptive Reasoning Paths for Efficient Visual Reasoning

Hugging Face Daily Papers · 2026-04-16 Cached

AVR is an adaptive visual reasoning framework that dynamically selects optimal reasoning formats to reduce token usage by 50-90% while maintaining accuracy in visual reasoning tasks. The method addresses reasoning path redundancy by decomposing visual reasoning into three cognitive functions and using FS-GRPO training to encourage efficient format selection.

0 favorites 0 likes
← Back to home

Submit Feedback