Tag
IBM Research explores how agent logic—software primitives like knowledge graphs and program analysis—can guide LLM-based agents to efficiently handle complex enterprise workflows, reducing hallucinations and costs while improving outcomes.
IBM Research launches the Open Agent Leaderboard, an open benchmark and evaluation framework for comparing full AI agent systems based on quality and cost, aiming to measure generality across diverse tasks.