explainability-attack

Tag

Cards List
#explainability-attack

Can Subgraph Explanations Be Weaponized to Steal Graph Neural Networks?

arXiv cs.LG · 2026-06-01 Cached

This paper presents the first model extraction attack on graph classification under strict black-box constraints, exploiting subgraph explanations to estimate decision boundaries. The findings reveal that mandated explainability interfaces create exploitable security vulnerabilities in Graph Neural Network services.

0 favorites 0 likes
← Back to home

Submit Feedback