@HuggingPapers: Ai2 just released TMax 27B on Hugging Face A 27B terminal agent that hits 42.7% on Terminal Bench 2.0, rivaling models …

X AI KOLs Following 06/22/26, 01:21 PM Models

ai2 tmax-27b terminal-agent hugging-face benchmark open-source

Summary

Ai2 released TMax 27B, a 27B terminal agent that achieves 42.7% on Terminal Bench 2.0, rivaling models 40 times its size.

Ai2 just released TMax 27B on Hugging Face A 27B terminal agent that hits 42.7% on Terminal Bench 2.0, rivaling models 40× its size. https://t.co/LfCksOXL9L

Original Article

View Cached Full Text

Cached at: 06/23/26, 01:51 PM

Ai2 just released TMax 27B on Hugging Face

A 27B terminal agent that hits 42.7% on Terminal Bench 2.0, rivaling models 40× its size. https://t.co/LfCksOXL9L

Similar Articles

Tmax-27b - a Qwen3.6-27b terminal agent for small GPUs trained with DPPO (RL)

Reddit r/LocalLLaMA

Ai2 released Tmax-27B, a terminal-agent LLM trained with DPPO (RL) on Qwen3.6-27B, and the author provides importance-matrix-calibrated GGUF quantizations that achieve competitive performance on agentic benchmarks even at very low bit-widths, with a grafted MTP draft head for speculative decoding.

TMax: A Simple Recipe for Terminal Agents

Reddit r/LocalLLaMA

TMax presents a straightforward method for building AI agents that operate in terminal environments, combining practical design principles for effective command-line automation.

MiniMaxAI/MiniMax-M2.7

Hugging Face Models Trending

MiniMaxAI releases MiniMax-M2.7, an open-weight model featuring self-evolution capabilities, advanced agent team support, and strong performance on software engineering benchmarks (56.22% on SWE-Pro, 66.6% medal rate on MLE Bench Lite), with notable applications in production incident recovery and professional work tasks.

@LottoLabs: Interesting model here 35b a3b trained for agentic use It gets 60.7 on Terminal Bench2 qwen 3.6 27b gets 59.3 Essential…

X AI KOLs Following

Nex-AGI releases Nex-N2, an open-source agentic model series (Nex-N2-Pro and Nex-N2-mini) with an Agentic Thinking framework that unifies reasoning, tool use, and environment execution, achieving top-tier performance on agentic and coding benchmarks.

Qwen3.6-35B-A3B and 9B are officially on the public Terminal-Bench 2.0 leaderboard!

Reddit r/LocalLLaMA

Qwen3.6-35B-A3B and Qwen3.5-9B models are officially on the Terminal-Bench 2.0 leaderboard, with little-coder achieving 24.6% on the 35B variant, surpassing Gemini 2.5 Pro and Qwen3-Coder-480B, while the 9B model shows that sub-10B local models can compete on hard agentic benchmarks.

Similar Articles

Tmax-27b - a Qwen3.6-27b terminal agent for small GPUs trained with DPPO (RL)

TMax: A Simple Recipe for Terminal Agents

MiniMaxAI/MiniMax-M2.7

@LottoLabs: Interesting model here 35b a3b trained for agentic use It gets 60.7 on Terminal Bench2 qwen 3.6 27b gets 59.3 Essential…

Qwen3.6-35B-A3B and 9B are officially on the public Terminal-Bench 2.0 leaderboard!

Submit Feedback