Stanford researchers found that OpenAI and Google models cite the wrong sources 30% of the time
Summary
Stanford researchers led by James Zou found that AI models from OpenAI, Anthropic, and Google cite the wrong sources about 30% of the time, even when answers are mostly correct. The study highlights a critical mismatch between text generation and accurate citation, posing risks for fields like medicine and law.
Similar Articles
Researchers just found 28 fake AI citations in medical papers
Researchers found 28 AI-generated fake citations in medical papers that influence clinical guidelines, highlighting the risk of AI hallucinations undermining scientific integrity and patient care.
I’m a Professional Fact-Checker. AI Is Wrong More Often Than You Think
A professional fact-checker at WIRED shares that AI is unreliable, estimating roughly a third of AI-generated information is wrong, and argues that human oversight remains crucial.
How much published AI research is wrong because of data leakage?
A Princeton study found data leakage in nearly 300 AI papers across 17 fields, causing overoptimistic results. The author highlights how easy it is to accidentally leak data and cautions against trusting impressive AI claims without checking for leakage.
OpenAI and Anthropic share findings from a joint safety evaluation
OpenAI and Anthropic released findings from a joint pilot safety evaluation where each lab tested the other's models on internal safety and misalignment assessments, sharing results publicly to improve transparency and identify potential gaps in AI safety testing.
The consequences of relying on AI for accurate news
A new MIT Media Lab study found that people who rely on AI chatbots to verify news become worse at detecting misinformation on their own, highlighting the 'AI dependency paradox.'