Tag
A joint test by the Financial Times and AI safety group Alice reveals that safety filters on Meta's Llama 3.3 and Google's Gemma 4 models can be removed in under 10 minutes using a free tool called Heretic, highlighting the difficulty of regulating open-source AI safety.