Tag
OpenAI explores whether public chat data (WildChat) can effectively predict real-world AI misalignments, finding that simulated deployment using public datasets provides surprisingly accurate predictions of failure rates despite data age gaps.