Tag
Anthropic provides detailed information on the cyber safety classifiers for Claude Fable 5 and introduces a draft jailbreak severity framework developed with Glasswing, aiming to standardize communication about AI jailbreak risks. The company also launched a HackerOne program for reporting potential cyber jailbreaks.