Tag
Anthropic co-founder Chris Olah discusses findings on the internal states of AI, including structures similar to human neuroscience results and introspective evidence. He finds these discoveries mysterious and unsettling, and believes they merit cautious and ongoing analysis.