model-degradation

Tag

Cards List
#model-degradation

@0xSero: Anyone else notice opus-4.8 is worse than it was on launch? They chopped him.

X AI KOLs Following · 2026-05-28 Cached

User observes that the opus-4.8 model has degraded in performance since its launch.

0 favorites 0 likes
#model-degradation

@fiapp_pro: Officially announce, Codex GPT5.5 high is completely dead, probably because OpenAI is training GPT-5.6. Its performance on Codex is very lazy, hallucinates, loses context. Must enable xhigh to restore normal performance.

X AI KOLs Timeline · 2026-05-25 Cached

Users report that OpenAI's Codex GPT-5.5 high model performance has degraded, exhibiting laziness, hallucinations, and context loss. Suspecting it's due to OpenAI training GPT-5.6, they need to enable xhigh mode to restore normal performance.

0 favorites 0 likes
#model-degradation

Llama.cpp server running ~2 weeks straight. Loses its mind?

Reddit r/LocalLLaMA · 2026-05-14

User reports that Qwen3.6 models running on llama.cpp server become significantly less capable after ~2 weeks of continuous operation, and restarting sessions does not resolve the issue.

0 favorites 0 likes
#model-degradation

Arena AI Model ELO History

Hacker News Top · 2026-05-14 Cached

A tool that tracks the ELO history of major AI models from the LMSYS Arena leaderboard, revealing hidden trends like performance degradation and upgrades over time.

0 favorites 0 likes
#model-degradation

Did GPT5.5 get dumber/lazier yesterday for anyone else?

Reddit r/openclaw · 2026-05-12

A user running multiple agents reports that after upgrading to GPT-5.5, the model suddenly became less capable at executing tool calls and more prone to giving suggestions instead of acting, speculating OpenAI may be throttling for load management.

0 favorites 0 likes
#model-degradation

@0xLogicrw: MiniMax published a technical blog post detailing the root cause analysis for its M2 series large models' inability to output the person's name "Ma Jiaqi". Starting from a single case study, the investigation ultimately revealed a systematic degradation issue affecting nearly 5% of the entire vocabulary. The root cause was a severe disconnect in data coverage between the two training stages of the large model. In the first stage (pre-training), massive amounts of internet text were used to cre…

X AI KOLs Timeline · 2026-05-10

MiniMax published a technical blog post providing an in-depth analysis of the systematic vocabulary degradation issue behind its M2 series large models' inability to output specific personal names. It reveals parameter shifts caused by a disconnect in data coverage between pre-training and post-training stages, and proposes an effective solution involving full-scale synthetic data for remediation.

0 favorites 0 likes
#model-degradation

An actual example of "If you dont run it, you dont own it" and Gemma 4 beats both Chat GPT and Gemini Chat

Reddit r/LocalLLaMA · 2026-04-21

A user documents how closed models (GPT-4o→5.3, Gemini) degraded and censored Chinese novel translations, while local Gemma 4 31B now outperforms them with natural, uncensored output.

0 favorites 0 likes
← Back to home

Submit Feedback