decision-making

#decision-making

Games people — and machines — play: Untangling strategic reasoning to advance AI

MIT News — Artificial Intelligence ↗ · 4d ago Cached

MIT professor Gabriele Farina is advancing AI decision-making by combining game theory with machine learning, building on his earlier work with the diplomatic AI Cicero.

0 favorites 0 likes

#decision-making

RefereeBench: Are Video MLLMs Ready to be Multi-Sport Referees

arXiv cs.CL ↗ · 2026-04-20 Cached

RefereeBench introduces the first large-scale benchmark with 925 curated sports videos and 6,475 QA pairs to evaluate whether video MLLMs can reliably act as multi-sport referees. Evaluation of state-of-the-art models shows current MLLMs fall short (≤60% accuracy), struggling with rule application and temporal grounding despite their generic video understanding capabilities.

0 favorites 0 likes

#decision-making

Why production systems keep making “correct” decisions that are no longer right [D]

Reddit r/MachineLearning ↗ · 2026-04-19

Analysis of a recurring failure pattern in production AI systems where technically correct decisions become contextually wrong as underlying assumptions shift, framed as the 'Formalisation Trap' where meaning gets locked into outdated structures.

0 favorites 0 likes

#decision-making

PangeAI

Product Hunt ↗ · 2026-04-16

PangeAI is a product offering instant, agent-driven spatial analysis and decision-making capabilities.

0 favorites 0 likes

#decision-making

Evaluating the ethics of autonomous systems

MIT News — Artificial Intelligence ↗ · 2026-04-02 Cached

MIT researchers introduce SEED-SET, a framework using LLMs to proactively evaluate the ethical alignment of autonomous systems in high-stakes scenarios like power distribution, addressing gaps in static testing methods.

0 favorites 0 likes

decision-making

Games people — and machines — play: Untangling strategic reasoning to advance AI

RefereeBench: Are Video MLLMs Ready to be Multi-Sport Referees

Why production systems keep making “correct” decisions that are no longer right [D]

PangeAI

Evaluating the ethics of autonomous systems

Submit Feedback