experiment

#experiment

I asked 4 AIs to pick a number. Why they all said 7?

Reddit r/artificial ↗ · 7h ago

An article exploring why four different AI models all chose the number 7 when asked to pick a number, highlighting potential biases in training data.

0 favorites 0 likes

#experiment

AI agent security is a small prayer the model says no. How are you routing models?

Reddit r/AI_Agents ↗ · 15h ago

The author conducted an experiment on Gmail with AI agents connected via OAuth, sending obfuscated prompt injection emails. Frontier models sometimes caught the attacks, while cheap models silently executed them, revealing that agent security largely depends on model cost and token budget rather than architectural safeguards.

0 favorites 0 likes

#experiment

How Fast Does Claude, Acting as a User Space IP Stack, Respond to Pings?

Hacker News Top ↗ · 3d ago Cached

The article describes a fun experiment using Claude Code to act as a user-space IP stack to process ICMP ping requests and measure response latency.

0 favorites 0 likes

#experiment

@FinanceYF5: Anthropic just quietly completed a magical experiment. They had Claude buy and sell second-hand items for employees for a whole week. Results: >186 transactions >Total $4000+ >Items from snowboards to a bag of ping pong balls >Opus users got better deals—but Haiku users were completely unaware…

X AI KOLs Following ↗ · 6d ago Cached

Anthropic conducted an internal experiment where they had Claude act as an agent for employees to buy and sell second-hand items over a week, successfully completing 186 transactions. The results showed that Opus users could negotiate better prices, while Haiku users were at a disadvantage, demonstrating the initial feasibility of an Agent-to-Agent economy.

0 favorites 0 likes

#experiment

Granite 4.1 3B SVG Pelican Gallery

Simon Willison's Blog ↗ · 2026-05-04 Cached

IBM released the Granite 4.1 family of LLMs under Apache 2.0, and Simon Willison experimented with generating SVG images of a pelican riding a bicycle using 21 different quantized variants of the 3B model.

0 favorites 0 likes

experiment

I asked 4 AIs to pick a number. Why they all said 7?

AI agent security is a small prayer the model says no. How are you routing models?

How Fast Does Claude, Acting as a User Space IP Stack, Respond to Pings?

Granite 4.1 3B SVG Pelican Gallery

Submit Feedback