model-defense

Tag

Cards List
#model-defense

The Distillation Game: Adaptive Attacks & Efficient Defenses

Hugging Face Daily Papers · 2026-05-29 Cached

This paper studies distillation attacks where model outputs can enable imitation, proposing a minimax game framework and a forward-pass-only defense called Product-of-Experts, showing that adaptive students recover more capability than passive evaluation suggests.

0 favorites 0 likes
← Back to home

Submit Feedback