double-pendulum

#double-pendulum

Mechanisms of Misgeneralization in Physical Sequence Modeling

arXiv cs.LG ↗ · 2026-05-21 Cached

This paper identifies and analyzes 'physical misgeneralization' in generative sequence models, where individual trajectories appear plausible but the aggregate distribution over physical quantities is incorrect, and proposes a kernel-informed mitigation.

0 favorites 0 likes

#double-pendulum

Gave GPT-4o and Claude the exact same double pendulum prompt. They picked opposite angle conventions within seconds.

Reddit r/ArtificialInteligence ↗ · 2026-05-16

An experiment feeding GPT-4o, Claude 3.5 Sonnet, and other models the same double pendulum prompt reveals they pick opposite angle conventions, causing immediate visible mismatch in a shared renderer. The convention split, non-random across model families, suggests a bias in training data distribution for classical mechanics problems.

0 favorites 0 likes

double-pendulum

Mechanisms of Misgeneralization in Physical Sequence Modeling

Gave GPT-4o and Claude the exact same double pendulum prompt. They picked opposite angle conventions within seconds.

Submit Feedback