white-box

#white-box

Knowledge Distillation of Black-Box Large Language Models

Hacker News Top ↗ · 2026-06-28 Cached

Introduces Proxy-KD, a novel method for distilling knowledge from black-box large language models (like GPT-4) into smaller models using a proxy model, surpassing both traditional black-box and white-box KD techniques.

0 favorites 0 likes

#white-box

idSCD: Identifying Training Datasets through Semantic Correlation Descriptors

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper introduces idSCD, a white-box method that uses semantic correlation descriptors to identify whether a dataset was used in training a model, outperforming existing baselines across multiple settings.

0 favorites 0 likes

#white-box

@heyshrutimishra: 2/ Memory is fully white-box. Every entry is visible and editable. There's also Dream: at night, agents review their ow…

X AI KOLs Timeline ↗ · 2026-05-30 Cached

Describes a white-box memory system for AI agents where every entry is visible and editable, and includes a 'Dream' feature for nighttime memory consolidation and reorganization with one-click rollback.

0 favorites 0 likes

#white-box

LLM-Agnostic Semantic Representation Attack

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces Semantic Representation Attack (SRA), a novel LLM-agnostic method that optimizes for malicious semantic representations rather than exact text, achieving high attack success rates across multiple open-source models.

0 favorites 0 likes

white-box

Knowledge Distillation of Black-Box Large Language Models

idSCD: Identifying Training Datasets through Semantic Correlation Descriptors

@heyshrutimishra: 2/ Memory is fully white-box. Every entry is visible and editable. There's also Dream: at night, agents review their ow…

LLM-Agnostic Semantic Representation Attack

Submit Feedback