model-hardening

Tag

Cards List
#model-hardening

Advancing Gemini's security safeguards

Google DeepMind Blog · 2025-05-20 Cached

Google DeepMind announces advanced security improvements for Gemini to defend against indirect prompt injection attacks through model hardening, adaptive evaluation, and layered defense mechanisms. The approach combines fine-tuning on adversarial scenarios with system-level guardrails to build inherent resilience while maintaining model performance.

0 favorites 0 likes
← Back to home

Submit Feedback