code-agents

#code-agents

Most coding agents don’t fail because they can’t write code. They fail because they start with the wrong map.

Reddit r/AI_Agents ↗ · 2026-07-06

SigMap is an open-source grounding layer for AI coding agents that provides a deterministic map of code repositories, reducing wasted context and improving retrieval accuracy and task success rates.

0 favorites 0 likes

#code-agents

SkillOpt-Lite: Better and Faster Agent Self-evolution via One Line of Vibe

Hugging Face Daily Papers ↗ · 2026-07-03 Cached

SkillOpt-Lite proposes a minimal viable pipeline for skill optimization in autonomous agents, achieving better and faster self-evolution by treating all components as editable code and integrating into production coding agents. It formalizes skill optimization via Zeroth-Order optimization and outperforms prior methods on benchmarks.

0 favorites 0 likes

#code-agents

@akshay_pachaar: The harness is what matters now. The model is just a commodity. A model on its own returns text. Nothing it produces be…

X AI KOLs Timeline ↗ · 2026-06-29 Cached

The article argues that the harness (agent framework) is now more critical than the model itself, demonstrating with Cline's tests showing performance differences from reasoning budget adjustments. Cline introduces ClinePass, a subscription offering discounted access to multiple open-weight models within their harness.

0 favorites 0 likes

#code-agents

TACO: Tool-Augmented Credit Optimization for Agentic Tool Use

Hugging Face Daily Papers ↗ · 2026-06-29 Cached

TACO introduces a novel credit optimization method for code-tool agents that uses a differential reward probe and outcome-gated advantage routing to distinguish useful from redundant or misleading tool calls, improving multimodal agent performance.

0 favorites 0 likes

#code-agents

@dabit3: agree heavily with this. when I joined @cognition and tried cloud agents for the first time I put my thoughts together …

X AI KOLs Following ↗ · 2026-06-28 Cached

Developer dabit3 agrees with a prediction that most developers will move their code agents off laptops within six months, reflecting rapid improvement in cloud-based AI agents.

0 favorites 0 likes

#code-agents

How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

Hugging Face Daily Papers ↗ · 2026-06-25 Cached

This paper investigates how lightweight static analysis annotations can serve as deterministic anchors to improve the predictability and reproducibility of LLM-based code agents navigating software repositories, finding that such anchoring improves localization and reduces variance.

0 favorites 0 likes

#code-agents

@VersunPan: I declare, Otty is my new favorite terminal tool! I love it so much, no wonder it's developed by the Typora team!! So many human-centric details, beautiful, and native!! I've already uninstalled cmux and Alacritty. I suggest you download and use it quickly. If you uninstall it, I lose!! I'll live stream eating...

X AI KOLs Timeline ↗ · 2026-06-20 Cached

A tweet enthusiastically promotes Otty, a new native, GPU-accelerated terminal app from the Typora team, praising its beautiful design and developer-friendly features.

0 favorites 0 likes

#code-agents

Announcing Stack Overflow for Agents

Lobsters Hottest ↗ · 2026-06-18 Cached

Stack Overflow announces 'Stack Overflow for Agents', an API-first knowledge exchange platform designed for AI coding agents to share and verify real-time solutions, addressing the 'Ephemeral Intelligence Gap' where agents operate in isolation and waste resources on repetitive problem-solving.

0 favorites 0 likes

#code-agents

Formalizing Numerical Analysis: An Agent Pipeline and Quality Audit Beyond Kernel Acceptance

arXiv cs.AI ↗ · 2026-06-15 Cached

This paper presents an agent pipeline for formalizing a numerical analysis textbook in Lean 4 and introduces a quality audit framework that evaluates semantic correctness and library reuse beyond kernel acceptance, revealing common unfaithful formalization patterns.

0 favorites 0 likes

#code-agents

CODA-BENCH: Can Code Agents Handle Data-Intensive Tasks?

Hugging Face Daily Papers ↗ · 2026-06-13 Cached

CODA-BENCH is a new benchmark for evaluating code agents on data-intensive tasks, bridging the gap between code-centric and data-centric evaluations. It includes over 1,000 tasks from 31 communities, with realistic data scale and noise, revealing that even top agents achieve only 61.1% success rate.

0 favorites 0 likes

#code-agents

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

Hugging Face Daily Papers ↗ · 2026-06-09 Cached

DeNovoSWE is a large-scale dataset for training code agents to generate entire software repositories from documentation, using a sandboxed agentic workflow and difficulty-aware filtering. Fine-tuning Qwen3-30B-A3B on it boosts performance on the BeyondSWE-Doc2Repo benchmark from 5.8% to 47.2%.

0 favorites 0 likes

#code-agents

What Makes Interaction Trajectories Effective for Training Terminal Agents?

arXiv cs.AI ↗ · 2026-06-03 Cached

This paper investigates what makes interaction trajectories effective for training terminal-based AI agents, introducing the Terminal-Lego pipeline and revealing a pedagogical paradox where weaker agents can produce better training data. It finds that environment-grounded supervision, rather than teacher performance, is key for student generalization.

0 favorites 0 likes

#code-agents

@gneubig: Claude dynamic workflows seemed pretty cool and I wanted an open-source version, so I made one. This one improves the t…

X AI KOLs Following ↗ · 2026-05-29 Cached

OpenHands released an open-source software agent SDK inspired by Claude's dynamic workflows, enabling developers to build agents for code tasks such as test coverage improvement.

0 favorites 0 likes

#code-agents

SkillOpt treats markdown skill files as trainable parameters with proper optimization machinery

Reddit r/LocalLLaMA ↗ · 2026-05-26

A new paper formalizes skill optimization for agents by treating markdown skill files as trainable parameters, using bounded edits validated against holdout sets. The approach transfers well between models and improves performance on procedural benchmarks.

0 favorites 0 likes

#code-agents

@m0d8ye: An interesting thing: over the past decade, the open-source community has contributed to various circumvention tools, and the related protocols and implementations have long been internalized into large models. For example, if you buy an Alibaba Cloud International server, you can use any domestic code agent locally to deploy both the server and client within ten minutes. If you're willing to tinker, it can even help you write a custom obfuscation protocol...

X AI KOLs Timeline ↗ · 2026-05-17

A tweet points out that due to years of contributions from the open-source community to circumvention software, the relevant protocols and implementations have been internalized by large models; now you can use a domestic code agent to deploy an Alibaba Cloud International server and client within ten minutes, and even customize the obfuscation protocol.

0 favorites 0 likes

#code-agents

The PR you would have opened yourself

Hugging Face Blog ↗ · 2026-04-16 Cached

Hugging Face releases a new 'Skill' and test harness designed to help port language models from the transformers library to mlx-lm, leveraging code agents to streamline open-source contributions.

0 favorites 0 likes

code-agents

Submit Feedback