Tag
Treedocs is a Swift CLI tool that generates a documented tree view of a repository and automatically checks for stale documentation entries, helping teams keep docs up to date.
Linux 7.2 has finally removed the deprecated strncpy API after six years and over 360 patches due to its bug-prone nature, replacing it with safer alternatives like strscpy and memcpy.
A team built an AI agent to automatically fix tech debt by scanning the codebase and opening PRs, finding that the hardest part was precisely defining the problem. They discuss challenges of running multiple agents on the same codebase and the need for guardrails.
HumanLayer launches an Agentic IDE and collaboration platform that enables engineers to ship 2-3x faster across the entire SDLC while maintaining rigorous code quality, already used by Fortune 500 companies like Block and Uber.
A reflection on the stages developers go through when adopting AI-assisted coding, from initial amazement to a balanced understanding, with concerns about how less experienced developers can learn to judge code quality when relying heavily on AI agents.
A story from a Windows x86 emulator team about encountering a program with a fully unrolled 64KB initialization loop (65,536 instructions) and adding a special optimization to replace it with a tight loop.
An analysis of how AI coding agents have shifted the bottleneck from writing code to reviewing it, with data showing a 861% increase in code churn and a rise in defect rates, making code review the most leveraged skill in software engineering.
The article argues that increasing the amount of AI-generated code does not necessarily improve team speed and may even reduce efficiency.
Discusses how AI agents enable rapid app creation but produce alien codebases lacking developer understanding, leading to a flood of apps with minimal user traction.
This article discusses how AI coding agent loops can inadvertently learn and propagate deprecated code patterns from existing codebases, leading to technical debt despite appearing successful.
The article explores the phenomenon of 'rockstar developers' who write clever but unmaintainable code, and draws parallels to the challenges introduced by AI-generated code, emphasizing the need for maintainability and team cohesion.
A thread from Charlie Marsh listing a series of pull requests to the Ruff project on GitHub.
FrontierCode is a new benchmark from Cognition AI that measures AI models' ability to write high-quality, maintainable code by evaluating mergeability. Results show even top models like Claude Opus 4.8 score only 13.4% on the hardest subset, highlighting a significant gap in code quality.
FrontierCode is a new coding benchmark from METR and Cognition that evaluates AI models on code maintainability and quality, revealing that many models produce unmergeable code. It includes over 1000 hours of work and shows that even top models struggle, with Opus 4.8 achieving only 13.8% on the hardest tier.
Cognition announces FrontierCode, a new coding evaluation benchmark that goes beyond unit tests to measure code quality, scope, test correctness, and human reviewer approval, addressing the issue of agents writing sloppy code that passes tests but is not maintainable.
compound-engineering-plugin is an AI coding plugin that avoids technical debt by allocating 80% of resources to planning and review, and 20% to execution. It includes 37 skills and 51 agents, supporting three major platforms: Claude Code, Codex, and Cursor.
A blog post arguing that while users don't directly care about code internals, good code quality is essential for performance, bug fixes, and feature delivery, contrary to the common cliché that only user-facing results matter.
A developer blog post argues against the idea of shipping AI-generated code to production without reading it, emphasizing that code review serves critical purposes: diffusing responsibility, reducing bus factor, and maintaining team knowledge of the codebase.
A developer reflects on a messy core data structure in their app, acknowledging technical debt but accepting they will never prioritize fixing it unless it hinders AI code generation.
A repository of Clean Code Skills for AI agents that enforce Robert C. Martin's principles to improve AI-generated code maintainability and reduce technical debt. It provides modular skills for Python and TypeScript to guide agents in writing cleaner, more structured code.