agentic-llm

#agentic-llm

@jinyuhou0: On popular benchmarks, our 30B model matches systems 20-30x its size (gpt-5.4-xhigh, DeepSeek-V3.2, Kimi-K2.5), while u…

X AI KOLs Timeline ↗ · 2026-05-22 Cached

A new 30B model matches systems 20-30x its size on popular benchmarks while using up to 95% fewer reasoning tokens than comparable agentic LLMs, achieved through a learned configurator that decides when and how to reason. Model and code are openly available.

0 favorites 0 likes

#agentic-llm

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

Hugging Face Daily Papers ↗ · 2026-05-15 Cached

AstraFlow is a dataflow-oriented RL system that enables efficient multi-policy collaborative training and elastic scaling for agentic LLMs, achieving a 2.7x training speedup over existing systems.

0 favorites 0 likes

#agentic-llm

HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

HAGE introduces a weighted multi-relational memory framework that enables query-conditioned traversal over unified relational memory graphs, improving long-horizon reasoning accuracy through adaptive memory retrieval and reinforcement learning-based optimization.

0 favorites 0 likes

agentic-llm

@jinyuhou0: On popular benchmarks, our 30B model matches systems 20-30x its size (gpt-5.4-xhigh, DeepSeek-V3.2, Kimi-K2.5), while u…

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution

Submit Feedback