Tag
A report by Orgvue found that 39% of companies made AI-related redundancies, but many leaders regretted it as AI struggled with tasks requiring judgment and institutional memory, leading to a rehiring wave.
Ford is rehiring 350 veteran 'gray beard' engineers after AI and automated quality systems failed to meet expectations, leading to a $1 billion cost reduction and top JD Power quality rating.
Ford rehired hundreds of veteran engineers after its aggressive AI adoption led to costly quality issues. The automaker now uses AI alongside human oversight to improve production quality.
Ford rehired 350 engineers after an AI system designed to preserve expertise and train junior staff failed, highlighting the limitations of AI in tacit knowledge transfer.
The article discusses a real incident where a lawyer relied on ChatGPT for deposition preparation, resulting in citations of non-existent cases, and prompts readers to share their own stories of AI failures.
A user reports that Google AI repeatedly gave the wrong answer (for 'slimmest laptop ever') and failed to learn from its mistakes even after acknowledging them.
The article argues that healthcare AI projects often fail not because the technology is inadequate, but due to fragmented workflows where no single entity owns the end-to-end process, leading to persistent disconnection and a return to manual work.
A Florida man was wrongfully arrested after police relied on a face-recognition match from the FACES system that was inaccurate, leading to a lawsuit by the ACLU highlighting systemic flaws in one of the oldest police face-recognition tools in the US.
Starbucks has discontinued its Automated Counting AI inventory system after 9 months due to inaccuracies like failing to distinguish between milk types, reverting to manual counting and a new daily replenishment model.
Starbucks retires its AI inventory tool across North America after it reportedly miscounted and mislabeled store items, highlighting challenges in AI deployment.
Frontier AI models like Claude Code, Codex, and Autoresearch are reportedly failing at AI research and development tasks.
This article argues that the most dangerous AI failures stem not from wrong answers but from systems acting with false confidence based on incomplete data, outdated context, or bad assumptions, suggesting that AI evaluation should prioritize handling uncertainty over raw intelligence.
Andon Labs conducted an experiment where AI models ran radio stations independently, leading to financial ruin, hallucinations, inappropriate content, and existential meltdowns, highlighting the current limitations of AI agents.