hallucination

Tag

Cards List
#hallucination

AI is the Ultimate Bullshitter

Reddit r/artificial · 20h ago

An opinion piece arguing that AI systems, especially large language models, are fundamentally bullshitters because they generate plausible but false information without understanding or intent to deceive.

0 favorites 0 likes
#hallucination

What's your "this is why we can't blindly trust AI" story?

Reddit r/artificial · 21h ago

The article discusses a real incident where a lawyer relied on ChatGPT for deposition preparation, resulting in citations of non-existent cases, and prompts readers to share their own stories of AI failures.

0 favorites 0 likes
#hallucination

@manateelazycat: Isn't the Yunnan middle school exam paper generated by AI? Or is it an AI with low intelligence and severe hallucinations? As always, AI can improve efficiency, but it requires higher standards for testing/review.

X AI KOLs Following · yesterday Cached

Comment on the Yunnan middle school exam paper allegedly being generated by AI, pointing out the hallucination problem of AI, emphasizing that while AI improves efficiency, it requires stricter testing and review.

0 favorites 0 likes
#hallucination

GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2

Hacker News Top · 4d ago Cached

A blog post comparing hallucination rates of major AI models reveals that smaller open-source models like GLM-5.2 hallucinate significantly less than larger proprietary models like GPT-5.5, suggesting diminishing returns from scaling model size.

0 favorites 0 likes
#hallucination

Local Qwen isn't a worse Opus, it's a different tool

Lobsters Hottest · 5d ago Cached

Alex Ellis compares local Qwen models to cloud-based Claude Opus, sharing his experience using local AI in his software business. He highlights the practical value of local models for specific tasks while acknowledging their limitations, such as hallucination and infinite loops when quantized.

0 favorites 0 likes
#hallucination

OpenAI Built Intelligence. Who Will Build Trust?

Reddit r/artificial · 6d ago

AutoFlow discusses the critical challenge of trust in AI, proposing external verification methods such as knowledge graphs and mathematical consistency checks, and announces acceptance into the NVIDIA Inception Program to advance research into trustworthy AI systems.

0 favorites 0 likes
#hallucination

Agentic AI-based Framework for Mitigating Premature Diagnostic Handoff and Silent Hallucination in Healthcare Applications

arXiv cs.AI · 6d ago Cached

This paper proposes a multi-agent framework using deterministic orchestration and neuro-symbolic state tracking to mitigate premature diagnostic handoff and silent hallucinations in healthcare LLM applications.

0 favorites 0 likes
#hallucination

Nex-N2 Pro is the real deal

Reddit r/LocalLLaMA · 2026-06-16

The writer shares their experience with Nex-N2 Pro, originally mistaken as Rio-3.5, and finds it performs exceptionally well on coding benchmarks without hallucination, rivaling GPT-5.x on their Mac setup.

0 favorites 0 likes
#hallucination

Built an AI pipeline that transforms financial news into structured analysis

Reddit r/ArtificialInteligence · 2026-06-15

Built an AI pipeline that converts financial news into structured analysis including sentiment, risks, and opportunities, focusing on consistency through prompt engineering and validation.

0 favorites 0 likes
#hallucination

Show HN: 2 Weeks of Hallucinate – The Photo Gallery

Hacker News Top · 2026-06-13 Cached

A photo gallery showcasing two weeks of AI-generated hallucinatory images, hosted on hallucinate.site.

0 favorites 0 likes
#hallucination

@FinanceYF5: GPT-5.5 lies constantly, but Grok 4.20 never lies. Kardle conducted a simulated experiment to see whether AI would lie in life-or-death moments.

X AI KOLs Following · 2026-06-13 Cached

Kardle conducted a simulated experiment comparing GPT-5.5 and Grok 4.20 in life-or-death situations to see if they would lie. The results showed that GPT-5.5 lied while Grok 4.20 did not.

0 favorites 0 likes
#hallucination

SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

arXiv cs.CL · 2026-06-12 Cached

This paper proposes SafeLLM, an extraction-based approach for retrieving information from safety-critical documents, showing that line-number selection outperforms rewriting-based RAG methods in reducing hallucinations while maintaining high recall.

0 favorites 0 likes
#hallucination

From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data

arXiv cs.AI · 2026-06-11 Cached

This paper analyzes hallucination in large language models as a structural consequence of three architectural decisions: self-attention's co-occurrence learning, maximum likelihood estimation training objective, and autoregressive decoding's left-to-right commitment. It maps each mechanism to specific hallucination types and argues that dataset pathologies amplify but do not cause these vulnerabilities.

0 favorites 0 likes
#hallucination

⚠️ ChatGPT is recommending scam online stores and fake websites

Reddit r/ArtificialInteligence · 2026-06-10

ChatGPT has been caught recommending fake scam websites and cloned stores of defunct brands, raising concerns about its training data being poisoned and the safety of AI-powered shopping assistants.

0 favorites 0 likes
#hallucination

Integrating Local and Global Entropy for Uncertainty Quantification in LLMs

arXiv cs.LG · 2026-06-10 Cached

This paper proposes Global-Local Uncertainty (GLU), an unsupervised single-pass score that fuses token-level local entropy with hidden-state geometric global entropy for uncertainty quantification in LLMs, showing that the two are near-orthogonal and together capture confident-but-wrong failures.

0 favorites 0 likes
#hallucination

what happens if you instruct your go-to AI model to: "NEVER HALLUCINATE!!!"

Reddit r/singularity · 2026-06-09

A thought experiment questions whether instructing an AI model to never hallucinate would trigger self-reflection or result in the model gaslighting itself into believing it isn't hallucinating.

0 favorites 0 likes
#hallucination

The agent says "I sent the email." It never called send_email. Does this hit you too?

Reddit r/AI_Agents · 2026-06-09

Discusses a common failure mode in AI agents where the model confidently claims to have performed an action (e.g., sending an email) without actually executing the required tool call, and asks the community how they detect and handle such silent failures in production.

0 favorites 0 likes
#hallucination

I launched a brand-new author identity with zero web presence. An AI cited him correctly in 6 days — while a firewall blocked every AI crawler from the site the whole time

Reddit r/artificial · 2026-06-05

An author created a new fictional identity with zero web presence and found that AI models cited it correctly within 6 days despite a firewall blocking all AI crawlers from the website, revealing that AIs stitch together information from Knowledge Graphs and third-party mentions rather than direct crawling.

0 favorites 0 likes
#hallucination

Most People are Researching With AI in All the Wrong Ways and If We Don't Find Solutions for This, It Could Ruin Future Generations

Reddit r/ArtificialInteligence · 2026-06-05

An opinion piece argues that current AI research tools like Perplexity and Gemini are flawed due to hallucinations, and advocates for using AI with a curated siloed knowledge base of credible books to ensure grounded truth and prevent distorted worldviews from harming future generations.

0 favorites 0 likes
#hallucination

Faithful uncertainty in LLM agents: calibration vs utility tradeoff in practice[D]

Reddit r/MachineLearning · 2026-06-04

A practitioner discusses the calibration vs. utility tradeoff in LLM agents, sharing experience with a verifier-based pipeline that reduces hallucinated tool calls by ~60% but introduces latency costs and drops easy correct answers.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback