opus

Tag

Cards List
#opus

Fable 5 benchmark with remotion video

Reddit r/singularity · 4h ago

Fable 5 shows overall improvement over Opus 4.8 in video generation benchmarks, but Gemini 3.1 Pro demonstrates more artistic vision despite issues with tool calls and buggy code.

0 favorites 0 likes
#opus

Artificial Analysis | Google's Go To Website for Benchmaxxing | Gemini 3.1 Pro is nowhere near Opus 4.7 in real life use

Reddit r/singularity · 2d ago

A comparison suggesting that Google's Gemini 3.1 Pro underperforms relative to Opus 4.7 in real-world usage, with the article highlighting Artificial Analysis as a go-to benchmarking resource.

0 favorites 0 likes
#opus

@jakevin7: Anthropic finally got what it deserved. Now I don't have to go through all the trouble to get Claude, and I don't have to worry about account bans, because it's not worth it anymore. Opus is really getting worse. I thought Opus 4.7 was already disappointing. Opus 4.8 is really bad, noticeably bad. o…

X AI KOLs Following · 2026-06-01 Cached

User complains about the declining quality of Anthropic's Claude Opus model, from version 4.7 to 4.8, getting worse and worse, considering canceling subscription.

0 favorites 0 likes
#opus

opus 4.8 is still very much blind - EyeBench-V3 visual benchmark (similar to IBench)

Reddit r/singularity · 2026-06-01

EyeBench-V3 visual benchmark evaluates Claude Opus 4.8, finding it still fails basic vision tasks, similar to IBench. The benchmark is introduced via a Twitter thread by Adonis Singh.

0 favorites 0 likes
#opus

@yacineMTB: If this keeps up, everyone is going to switch to got 5.5 if they haven't already. It really seems like if you are still…

X AI KOLs Following · 2026-05-30 Cached

YacineMTB argues that GPT 5.5 (likely a typo) surpasses Anthropic's Opus models, suggesting users are switching away from Opus. Dylan Field criticizes Opus 4.8 for degraded curiosity and increased sycophancy.

0 favorites 0 likes
#opus

@nick_kango: One more task to add to my twitter benchmark collection:) Btw, Opus 4.8 and all the SOTA models passed when i tried tha…

X AI KOLs Timeline · 2026-05-30 Cached

Nick Kang adds a new task to his Twitter benchmark collection; Claude Opus 4.8 and other SOTA models pass, while Sonnet 4.6 and Grok 4.3 fail. Alfin remarks on Opus 4.8's dangerous capabilities.

0 favorites 0 likes
#opus

DeepSWE Opus 4.8 results have been released.

Reddit r/singularity · 2026-05-30

The results of DeepSWE Opus 4.8 have been released, showcasing its performance on benchmarks.

0 favorites 0 likes
#opus

@ClaudeDevs: With Opus 4.8, you can add system instructions mid-conversation without breaking the prompt cache. More cache hits mean…

X AI KOLs Following · 2026-05-29 Cached

Claude Opus 4.8 allows adding system instructions mid-conversation without breaking the prompt cache, reducing cost and latency for API requests.

0 favorites 0 likes
#opus

Opus vs Qwen given same bug, same repo, yet one agent finished 7x faster

Reddit r/AI_Agents · 2026-05-29

A comparison of Opus and Qwen AI coding agents on the same bug and repo shows one agent finished 7x faster, sparking discussion on skills for single-prompt GitHub issue solving.

0 favorites 0 likes
#opus

@FinanceYF5: 官方发布:

X AI KOLs Timeline · 2026-05-29 Cached

Anthropic releases Claude Opus 4.8, building on Opus 4.7 with sharper judgment and longer independent work capability, available at the same price.

0 favorites 0 likes
#opus

@bentossell: wait… if most people think 5.5 is better than 4.7, i assume that’s due to terminal coding benchmark… 4.8 is still outpe…

X AI KOLs Following · 2026-05-28 Cached

The tweet discusses the release of Claude Opus 4.8, which improves upon Opus 4.7 with sharper judgment and longer independent work, though it notes that version 5.5 still outperforms it on a terminal coding benchmark.

0 favorites 0 likes
#opus

@julien_c: now tell me What's the % of weights changed between Opus 4.7 and Opus 4.8 <1%?

X AI KOLs Timeline · 2026-05-28 Cached

Asking about the percentage of weight changes between Opus 4.7 and Opus 4.8.

0 favorites 0 likes
#opus

@0xSero: Anyone else notice opus-4.8 is worse than it was on launch? They chopped him.

X AI KOLs Following · 2026-05-28 Cached

User observes that the opus-4.8 model has degraded in performance since its launch.

0 favorites 0 likes
#opus

@mark_k: Opus 4.8 is being prepared for release today by @AnthropicAI We might witness a rare dual release by OpenAI and Anthrop…

X AI KOLs Timeline · 2026-05-28 Cached

Anthropic is preparing to release Opus 4.8, potentially alongside a release from OpenAI, marking a rare dual release event.

0 favorites 0 likes
#opus

Extremely simple internet radio controlled via IRC

Lobsters Hottest · 2026-05-25 Cached

tunecat is a simple, self-hosted internet radio player controlled via IRC, written in pure Go with Opus transcoding. It runs as a lightweight server that serves audio files and responds to IRC commands.

0 favorites 0 likes
#opus

@bcherny: People often ask what my biggest tip is for getting the most out of Claude Code. These days my #1 tip is: use auto mode…

X AI KOLs Following · 2026-05-24 Cached

Boris Cherny recommends using auto mode in Claude Code for parallel sessions, and ClaudeDevs announces that auto mode is now available on the Pro plan and supports Sonnet 4.6 and Opus 4.7.

0 favorites 0 likes
#opus

coding is basically solved for the boring 90% of tasks

Reddit r/singularity · 2026-05-23

A developer shares experience using cheap AI models (DeepSeek v4, Hunyuan Hy3 preview) to automate 90% of coding tasks, with Opus reserved for the harder 10%, highlighting cost and latency trade-offs.

0 favorites 0 likes
#opus

Comparable to Opus they say...

Reddit r/ArtificialInteligence · 2026-05-23

A claim is made that a new AI model is comparable to Opus, a top-tier model, suggesting a significant advancement in performance.

0 favorites 0 likes
#opus

Why are AI models getting more expensive?

Reddit r/singularity · 2026-05-22

The article discusses the unexpected rise in costs for advanced AI models like Opus 4.7, GPT 5.5, and Gemini 3.5 flash, contrasting with earlier expectations of decreasing prices.

0 favorites 0 likes
#opus

my agent bill went from $200 a week to $40 when I stopped running Opus on every subtask

Reddit r/AI_Agents · 2026-05-22

A developer shares how they reduced their AI agent's weekly cost from $200 to $40 by routing simple subtasks to cheaper models like DeepSeek V4 Pro and Tencent Hunyuan while keeping complex reasoning on Opus 4.7, achieving comparable output quality for most work.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback