hallucination

#hallucination

Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in Language Models

arXiv cs.CL ↗ · 2h ago Cached

This paper investigates the geometric relationship between directions in language model activations that detect a behavior versus those that control it, finding that for hallucination detection they are nearly orthogonal (cosine ~0.12), while for output format they align perfectly, challenging a common assumption in mechanistic interpretability.

0 favorites 0 likes

#hallucination

What's the worst thing your AI agent did in production without asking first?

Reddit r/AI_Agents ↗ · 17h ago

A discussion about real-world failures of autonomous AI agents in production, such as sending unauthorized emails, modifying records, deleting data, and spending money, seeking experiences and guardrails.

0 favorites 0 likes

#hallucination

We chased a hallucinated quote through 30k training records, 4,600 transcripts, and our own system prompt. Turned out to be two separate bugs

Reddit r/artificial ↗ · 19h ago

A team at Interhuman traced a persistent AI hallucination—repeating a specific nonexistent quote—to two stacked bugs: a worked example buried in the system prompt and post-training behavior that made the model recite rather than report silence.

0 favorites 0 likes

#hallucination

Faithful by Construction: Claim-Anchored Attribution for Multi-Document Summarization

arXiv cs.CL ↗ · yesterday Cached

This paper introduces CAMS, a modular multi-document summarization framework that extracts atomic claims with token-level provenance, clusters equivalent claims, and rewrites them into summaries with fine-grained, multi-source traceability, significantly improving faithfulness and citation precision.

0 favorites 0 likes

#hallucination

AI is the Ultimate Bullshitter

Reddit r/artificial ↗ · 2d ago

An opinion piece arguing that AI systems, especially large language models, are fundamentally bullshitters because they generate plausible but false information without understanding or intent to deceive.

0 favorites 0 likes

#hallucination

What's your "this is why we can't blindly trust AI" story?

Reddit r/artificial ↗ · 2d ago

The article discusses a real incident where a lawyer relied on ChatGPT for deposition preparation, resulting in citations of non-existent cases, and prompts readers to share their own stories of AI failures.

0 favorites 0 likes

#hallucination

@manateelazycat: Isn't the Yunnan middle school exam paper generated by AI? Or is it an AI with low intelligence and severe hallucinations? As always, AI can improve efficiency, but it requires higher standards for testing/review.

X AI KOLs Following ↗ · 3d ago Cached

Comment on the Yunnan middle school exam paper allegedly being generated by AI, pointing out the hallucination problem of AI, emphasizing that while AI improves efficiency, it requires stricter testing and review.

0 favorites 0 likes

#hallucination

GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2

Hacker News Top ↗ · 5d ago Cached

A blog post comparing hallucination rates of major AI models reveals that smaller open-source models like GLM-5.2 hallucinate significantly less than larger proprietary models like GPT-5.5, suggesting diminishing returns from scaling model size.

0 favorites 0 likes

#hallucination

Local Qwen isn't a worse Opus, it's a different tool

Lobsters Hottest ↗ · 2026-06-18 Cached

Alex Ellis compares local Qwen models to cloud-based Claude Opus, sharing his experience using local AI in his software business. He highlights the practical value of local models for specific tasks while acknowledging their limitations, such as hallucination and infinite loops when quantized.

0 favorites 0 likes

#hallucination

OpenAI Built Intelligence. Who Will Build Trust?

Reddit r/artificial ↗ · 2026-06-17

AutoFlow discusses the critical challenge of trust in AI, proposing external verification methods such as knowledge graphs and mathematical consistency checks, and announces acceptance into the NVIDIA Inception Program to advance research into trustworthy AI systems.

0 favorites 0 likes

#hallucination

Agentic AI-based Framework for Mitigating Premature Diagnostic Handoff and Silent Hallucination in Healthcare Applications

arXiv cs.AI ↗ · 2026-06-17 Cached

This paper proposes a multi-agent framework using deterministic orchestration and neuro-symbolic state tracking to mitigate premature diagnostic handoff and silent hallucinations in healthcare LLM applications.

0 favorites 0 likes

#hallucination

Nex-N2 Pro is the real deal

Reddit r/LocalLLaMA ↗ · 2026-06-16

The writer shares their experience with Nex-N2 Pro, originally mistaken as Rio-3.5, and finds it performs exceptionally well on coding benchmarks without hallucination, rivaling GPT-5.x on their Mac setup.

0 favorites 0 likes

#hallucination

Built an AI pipeline that transforms financial news into structured analysis

Reddit r/ArtificialInteligence ↗ · 2026-06-15

Built an AI pipeline that converts financial news into structured analysis including sentiment, risks, and opportunities, focusing on consistency through prompt engineering and validation.

0 favorites 0 likes

#hallucination

Show HN: 2 Weeks of Hallucinate – The Photo Gallery

Hacker News Top ↗ · 2026-06-13 Cached

A photo gallery showcasing two weeks of AI-generated hallucinatory images, hosted on hallucinate.site.

0 favorites 0 likes

#hallucination

@FinanceYF5: GPT-5.5 lies constantly, but Grok 4.20 never lies. Kardle conducted a simulated experiment to see whether AI would lie in life-or-death moments.

X AI KOLs Following ↗ · 2026-06-13 Cached

Kardle conducted a simulated experiment comparing GPT-5.5 and Grok 4.20 in life-or-death situations to see if they would lie. The results showed that GPT-5.5 lied while Grok 4.20 did not.

0 favorites 0 likes

#hallucination

SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

arXiv cs.CL ↗ · 2026-06-12 Cached

This paper proposes SafeLLM, an extraction-based approach for retrieving information from safety-critical documents, showing that line-number selection outperforms rewriting-based RAG methods in reducing hallucinations while maintaining high recall.

0 favorites 0 likes

#hallucination

From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data

arXiv cs.AI ↗ · 2026-06-11 Cached

This paper analyzes hallucination in large language models as a structural consequence of three architectural decisions: self-attention's co-occurrence learning, maximum likelihood estimation training objective, and autoregressive decoding's left-to-right commitment. It maps each mechanism to specific hallucination types and argues that dataset pathologies amplify but do not cause these vulnerabilities.

0 favorites 0 likes

#hallucination

⚠️ ChatGPT is recommending scam online stores and fake websites

Reddit r/ArtificialInteligence ↗ · 2026-06-10

ChatGPT has been caught recommending fake scam websites and cloned stores of defunct brands, raising concerns about its training data being poisoned and the safety of AI-powered shopping assistants.

0 favorites 0 likes

#hallucination

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

arXiv cs.LG ↗ · 2026-06-10 Cached

This paper proposes Global-Local Uncertainty (GLU), an unsupervised single-pass score that fuses token-level local entropy with hidden-state geometric global entropy for uncertainty quantification in LLMs, showing that the two are near-orthogonal and together capture confident-but-wrong failures.

0 favorites 0 likes

#hallucination

what happens if you instruct your go-to AI model to: "NEVER HALLUCINATE!!!"

Reddit r/singularity ↗ · 2026-06-09

A thought experiment questions whether instructing an AI model to never hallucinate would trigger self-reflection or result in the model gaslighting itself into believing it isn't hallucinating.

0 favorites 0 likes

hallucination

Submit Feedback