Tag
The author predicts that evals/analytics startups will transition into continual learning platforms in 2026, with some failing and the tasteful ones succeeding.
Abliteration launches a made-to-order synthetic training data workflow that generates negative, rare, and adversarial examples for classifiers, with schema, real-world facts, labels, provenance, and export to platforms like Hugging Face.
Arize AI is hosting the Observe 2026 conference in San Francisco focused on AI agents and evaluations with speakers from OpenAI, Cursor, and Uber. The event features talks on multi-agent systems and frontier agentic AI.
A curated list of 11 links shared daily to help people learn AI evaluation techniques, covering evals, observability, LLM-as-judge, and agent evaluation.