Tag
TUA-Bench is a comprehensive benchmark for evaluating general-purpose terminal-use agents across diverse digital activities and specialized workflows, revealing significant performance gaps among current frontier agents.
Genesis AI unveiled Eno, its first general-purpose wheeled robot featuring human-scale dexterous hands and an optional cognitive interface screen instead of a head.
OpenAI announces the release of an API for accessing its AI models with a general-purpose text interface, launching in private beta with strict safety measures including mandatory production reviews and content restrictions to prevent harmful use cases.
OpenAI's Codex has evolved from a coding tool into a general-purpose AI agent, now used by knowledge workers for research, coordination, and data analysis, reducing hours of work into minutes.