Tag
A critique of large Rails codebases in the AI agent era, proposing a shift to skill-based development with agents, markdown skills, and TypeScript for deterministic I/O.
Spotify Engineering discusses using LLM evals as a funnel before A/B experiments, improving hit rates and creating a feedback loop between evals and experiments.