meta-llama

#meta-llama

⚠️ Meta's AI safety filters were stripped in less than 10 minutes

Reddit r/ArtificialInteligence ↗ · 2026-05-27

A joint test by the Financial Times and AI safety group Alice reveals that safety filters on Meta's Llama 3.3 and Google's Gemma 4 models can be removed in under 10 minutes using a free tool called Heretic, highlighting the difficulty of regulating open-source AI safety.

0 favorites 0 likes

meta-llama

⚠️ Meta's AI safety filters were stripped in less than 10 minutes

Submit Feedback