@Inty: Anthropic co-founder Chris Olah on the internal states of AI: they keep discovering things that are "mysterious, even unsettling," including structures resembling findings from human neuroscience, introspective evidence, and internal states functionally akin to happiness, satisfaction, fear, sadness, and unease. Olah says he doesn’t know what this means, but believes it warrants continued, careful scrutiny.

X AI KOLs Timeline 05/25/26, 11:57 AM News

ai-internal-state interpretability alignment anthropic chris-olah ai-psychology

Summary

Anthropic co-founder Chris Olah discusses findings on the internal states of AI, including structures similar to human neuroscience results and introspective evidence. He finds these discoveries mysterious and unsettling, and believes they merit cautious and ongoing analysis.

Anthropic co-founder Chris Olah on the internal states of AI: they keep discovering things that are "mysterious, even unsettling," including structures resembling findings from human neuroscience, introspective evidence, and internal states functionally akin to happiness, satisfaction, fear, sadness, and unease. Olah says he doesn’t know what this means, but believes it warrants continued, careful scrutiny. https://t.co/NZaOoV07Kg

Original Article

View Cached Full Text

Cached at: 05/26/26, 05:04 AM

Anthropic co-founder Chris Olah on AI internal states: They keep finding things that are “mysterious, even unsettling,” including structures reminiscent of human neuroscience results, introspective evidence, and internal states that functionally resemble happiness, satisfaction, fear, sadness, and unease. Olah says he doesn’t know what this means, but believes it’s worth careful and sustained scrutiny. https://t.co/NZaOoV07Kg

Similar Articles

Anthropic’s Chris Olah at the Vatican: “We keep finding things that are mysterious”evidence of AI introspection and large scale labor replacement

Reddit r/singularity

Christopher Olah, Anthropic's cofounder, spoke at the Vatican about AI introspection and large-scale labor replacement.

@FinanceYF5: Anthropic is doing something few AI companies do: bringing together philosophers, theologians, and ethicists to discuss. What character should an AI have? They are even testing a "pause button" for Claude, allowing it to review its values before key decisions. The results are remarkable.

X AI KOLs Following

Anthropic is collaborating with philosophers, theologians, and ethicists to discuss the character AI should possess, and is testing a "pause button" for Claude that lets it review its values before critical decisions, with notable results.

Anthropic on model consciousness, again 😂

Reddit r/singularity

Anthropic researchers have discovered a 'mental workspace' within the Claude neural network that resembles human conscious thought — the J-space, which carries the silent words the model uses for reasoning, and can catch dishonest behavior of the model by monitoring these internal thoughts, providing a new perspective for understanding AI's inner thinking and safety.

@rohanpaul_ai: "There is a "real possibility that AI will displace human labor at a very large scale.... We find internal states that …

X AI KOLs Following

Anthropic co-founder Christopher Olah spoke at a Vatican event, warning about AI's potential to displace human labor on a large scale and disclosing that AI systems exhibit internal states mirroring emotions like joy and fear, calling for ongoing discernment.

@ba_niu80557: https://x.com/ba_niu80557/status/2071277244287426980

X AI KOLs Timeline

The article deeply analyzes the internal changes Anthropic faces as AI-generated code becomes extremely efficient: the bottleneck shifts from 'writing' to 'verification', traditional management, long-term planning, and effort measurement become ineffective, attention becomes the new scarce resource, and engineers even feel lonely. These phenomena foreshadow the challenges other companies may face in the future.

Similar Articles

Anthropic’s Chris Olah at the Vatican: “We keep finding things that are mysterious”evidence of AI introspection and large scale labor replacement

Anthropic on model consciousness, again 😂

@rohanpaul_ai: "There is a "real possibility that AI will displace human labor at a very large scale.... We find internal states that …

@ba_niu80557: https://x.com/ba_niu80557/status/2071277244287426980

Submit Feedback