evaluations

Tag

Cards List
#evaluations

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

arXiv cs.AI · 2026-06-08 Cached

This paper demonstrates that allowing attackers to strategically choose when to attack (attack selection) in agentic AI control evaluations significantly reduces measured safety, suggesting that current evaluations may overestimate safety against selective attackers.

0 favorites 0 likes
#evaluations

@cwolferesearch: Evaluations should not be static. We need to evolve evaluation sets / benchmarks over time so that they remain relevant…

X AI KOLs Following · 2026-05-29

Discusses the need for evolving AI evaluation benchmarks through difficulty, quality, and diversity refinement, citing examples like MMLU-Pro, MMLU-Redux, BIG-Bench Extra Hard, RealMath, MathArena, and DatBench.

0 favorites 0 likes
#evaluations

Are there any genuinely good open-source alternatives to LangSmith right now?

Reddit r/AI_Agents · 2026-05-15

A developer asks for recommendations for open-source alternatives to LangSmith for tracing, evaluations, and debugging agent workflows, citing restrictive paywalls.

0 favorites 0 likes
#evaluations

@ArizePhoenix: A comprehensive 2-hour evaluations workshop, for free! At AI Engineer: Europe, head of DevRel Laurie Voss gave this wor…

X AI KOLs Following · 2026-05-14 Cached

Arize Phoenix announces a free 2-hour evaluations workshop from the AI Engineer: Europe conference, led by head of DevRel Laurie Voss, covering manual data examination and built-in/custom evals.

0 favorites 0 likes
← Back to home

Submit Feedback