llm-red-teaming

Tag

Cards List
#llm-red-teaming

@HuggingPapers: Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance Naver AI eliminates unsta…

X AI KOLs Following · 2026-05-09 Cached

Naver AI introduces Stable-GFlowNet, a method to improve LLM red-teaming by eliminating unstable partition function estimation in Generative Flow Networks through contrastive trajectory balance.

0 favorites 0 likes
#llm-red-teaming

STAR-Teaming: A Strategy-Response Multiplex Network Approach to Automated LLM Red Teaming

arXiv cs.CL · 2026-04-22 Cached

STAR-Teaming introduces a multiplex-network-driven multi-agent framework that automates LLM red-teaming, achieving higher attack success rates with lower compute by organizing attack strategies into interpretable semantic communities.

0 favorites 0 likes
← Back to home

Submit Feedback