white-box

Tag

Cards List
#white-box

idSCD: Identifying Training Datasets through Semantic Correlation Descriptors

arXiv cs.LG · 2026-06-01 Cached

This paper introduces idSCD, a white-box method that uses semantic correlation descriptors to identify whether a dataset was used in training a model, outperforming existing baselines across multiple settings.

0 favorites 0 likes
#white-box

@heyshrutimishra: 2/ Memory is fully white-box. Every entry is visible and editable. There's also Dream: at night, agents review their ow…

X AI KOLs Timeline · 2026-05-30 Cached

Describes a white-box memory system for AI agents where every entry is visible and editable, and includes a 'Dream' feature for nighttime memory consolidation and reorganization with one-click rollback.

0 favorites 0 likes
#white-box

LLM-Agnostic Semantic Representation Attack

arXiv cs.CL · 2026-05-12 Cached

This paper introduces Semantic Representation Attack (SRA), a novel LLM-agnostic method that optimizes for malicious semantic representations rather than exact text, achieving high attack success rates across multiple open-source models.

0 favorites 0 likes
← Back to home

Submit Feedback