@pauliusztin_: Every day, 100+ people ask me, "How can I learn AI evals?" I copy-paste these 11 links (every time): 1. AI evals & obse…
Summary
A curated list of 11 links shared daily to help people learn AI evaluation techniques, covering evals, observability, LLM-as-judge, and agent evaluation.
Similar Articles
@xdotli: sharing my personal library on evals 1/n i put together the highest quality blogs, podcasts, papers, and projects on ev…
A Twitter thread sharing a curated personal library of high-quality blogs, podcasts, papers, and projects on AI evaluations (evals), inviting additions.
@MaxForAI: You'd be hard-pressed to find a better eval resource library. If you're interested in eval, these are what you should read. Thanks to @xdotli for sharing.
Share a curated AI evaluation (evals) resource library, including high-quality blogs, podcasts, papers, and projects, compiled by Xiangyi Li.
owainlewis/awesome-artificial-intelligence
A curated collection of must-use, actively maintained resources for building and shipping AI systems, covering AI engineering topics like RAG, agents, evals, guardrails, and deployment, along with recommended books, courses, and landmark papers.
@OpenAI: Let’s talk about evals. We’re always looking for better ways to measure and forecast model progress, especially as benc…
OpenAI discusses the importance of evals (evaluations) for measuring and forecasting model progress, especially as benchmarks become saturated or gamed, featuring insights from Tejal Patwardhan and Andrew Mayne.
@ajitcodes: Stop wasting hours trying to learn AI. I have already done it for you. With one list. Zero confusion. And no fluff. Vid…
A curated collection of links to videos, repositories, guides, books, and papers for learning about AI, LLMs, and building AI agents.