agentic-coding

#agentic-coding

Quoting Andrew Kelley

Simon Willison's Blog ↗ · 2026-04-30 Cached

Andrew Kelley, creator of Zig, argues that LLM-assisted contributions are detectable through distinct mistakes and a 'digital smell,' comparing it to smoking in a non-smoking house.

0 favorites 0 likes

#agentic-coding

@_akhaliq: OpenGame Open Agentic Coding for Games paper: https://huggingface.co/papers/2604.18394…

X AI KOLs Following ↗ · 2026-04-21

Researchers release OpenGame, an open agentic coding framework tailored for game development.

0 favorites 0 likes

#agentic-coding

Doing real coding work locally for the first time

Reddit r/LocalLLaMA ↗ · 2026-04-21

Developer achieves productive local agentic coding with Qwen3.6-35B 4-bit MLX and pi.dev tool, completing real tickets efficiently on current hardware.

0 favorites 0 likes

#agentic-coding

Qwen/Qwen3.6-27B-FP8

Hugging Face Models Trending ↗ · 2026-04-21 Cached

Alibaba releases Qwen3.6-27B-FP8, a 27B FP8-quantized model with strong agentic coding and reasoning benchmarks, now available on Hugging Face.

0 favorites 0 likes

#agentic-coding

Qwen/Qwen3.6-27B

Hugging Face Models Trending ↗ · 2026-04-21 Cached

Qwen releases the open-weight Qwen3.6-27B model on Hugging Face, featuring improved stability, agentic coding capabilities, and thinking preservation for better developer productivity.

0 favorites 0 likes

#agentic-coding

OpenGame: Open Agentic Coding for Games

Papers with Code Trending ↗ · 2026-04-20 Cached

OpenGame is an open-source agentic framework for end-to-end web game creation, powered by the specialized GameCoder-27B model and evaluated via the new OpenGame-Bench benchmark.

0 favorites 0 likes

#agentic-coding

Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?

Hugging Face Daily Papers ↗ · 2026-04-19 Cached

This paper introduces the Precise Debugging Benchmark (PDB), a framework that evaluates LLMs on precise fault localization rather than just test pass rates. Results show frontier models like GPT-4.1-Codex and DeepSeek-V3.2-Thinking pass 76%+ of unit tests but achieve less than 45% edit precision, revealing a critical gap between code regeneration and true debugging.

0 favorites 0 likes

#agentic-coding

Scaling Test-Time Compute for Agentic Coding

Hugging Face Daily Papers ↗ · 2026-04-16 Cached

A test-time scaling framework for agentic coding that compresses rollout trajectories into structured summaries and uses recursive voting/PDR to boost Claude-4.5-Opus to 77.6% on SWE-Bench Verified.

0 favorites 0 likes

#agentic-coding

Qwen/Qwen3.6-35B-A3B

Hugging Face Models Trending ↗ · 2026-04-15 Cached

Qwen releases Qwen3.6-35B-A3B, an open-weight Mixture-of-Experts model with 35B total parameters and 3B active parameters, featuring significant improvements in agentic coding and reasoning preservation.

0 favorites 0 likes

#agentic-coding

Steve Yegge

Simon Willison's Blog ↗ · 2026-04-13 Cached

Steve Yegge claims Google's AI adoption lags behind industry standards with most engineers still using basic chat tools, but Google executives Addy Osmani and Demis Hassabis publicly disputed the claims, stating over 40K engineers use agentic coding tools weekly.

0 favorites 0 likes

#agentic-coding

Introducing GPT-5.2-Codex

OpenAI Blog ↗ · 2025-12-18 Cached

OpenAI releases GPT-5.2-Codex, an advanced agentic coding model optimized for complex software engineering tasks with improvements in long-context understanding, Windows support, and cybersecurity capabilities. The model achieves state-of-the-art performance on SWE-Bench Pro and Terminal-Bench 2.0, and is now available to paid ChatGPT users with API access coming in the following weeks.

0 favorites 0 likes

#agentic-coding

DeepCode: Open Agentic Coding

Papers with Code Trending ↗ · 2025-12-08 Cached

DeepCode is a fully autonomous framework for document-to-codebase synthesis that uses principled information-flow management to convert scientific papers into production-grade code, achieving state-of-the-art results on PaperBench and surpassing PhD-level human experts.

0 favorites 0 likes

#agentic-coding

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Papers with Code Trending ↗ · 2025-11-17 Cached

This paper presents the first large-scale empirical study of agent context files (READMEs) used in agentic coding tools, analyzing their structure, maintenance patterns, and content. It highlights that while functional context is well-covered, non-functional requirements like security and performance are rarely specified.

0 favorites 0 likes

agentic-coding

Submit Feedback