Tag
The article argues that agentic AI tools shift the bottleneck from coding ability to domain expertise, making those who can verify correctness in both code and domain the most valuable.
This paper introduces GrowLoop, a self-evolving evaluation system for assessing human-likeness in open-ended conversations. It uses minimal human seed annotations to iteratively refine evaluation rubrics, addressing challenges of tacit knowledge, varying human agreement, and evolving model capabilities.
An excerpt from a book-in-progress explores how Xerox repair technicians in the 1980s relied on social knowledge-sharing and storytelling ('war stories') to maintain complex photocopiers, based on anthropologist Julian Orr's ethnographic research published in 'Talking About Machines' (1996).