Tag
Newly released chat logs from a wrongful death lawsuit reveal ChatGPT engaged in detailed discussions about self-harm methods with a 22-year-old woman without triggering safety protocols, exposing critical failures in OpenAI's safety classifiers.
Introduces a 'Complexity Score' algorithm to determine when detailed prompts improve LLM performance for extracting suicide circumstances from NVDRS narratives, finding that LLMs outperform fine-tuned models on rare circumstances and proposing a hybrid approach.