black-box

#black-box

A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models

arXiv cs.AI ↗ · 6d ago Cached

This paper presents a systematic review and benchmark of 24 black-box uncertainty estimation methods for large language models across 4 models and 4 dataset settings, finding that no single method dominates but hybrid methods that combine multiple uncertainty signals perform well.

0 favorites 0 likes

#black-box

Learning the Context of Errors: Black-Box Online Adaptation of Time Series Foundation Models

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper proposes ORCA, a method for black-box online adaptation of time series foundation models by learning the context of predictive errors. It demonstrates effectiveness across five TSFMs and eight datasets, addressing the challenge of adapting closed-source API-based models.

0 favorites 0 likes

#black-box

Vector Linking via Cross-Model Local Isometric Consistency

arXiv cs.AI ↗ · 2026-06-01 Cached

This paper introduces Vector Linking, a method for recovering correspondences between embeddings from different black-box encoders by leveraging local geometric consistency, proposing an iterative reference-based geometric embedding hashing approach using a small seed set of paired anchors.

0 favorites 0 likes

#black-box

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper introduces bounded behavioral indistinguishability, a formal framework for evaluating black-box LLM distillation beyond semantic similarity. Experiments on Qwen and Llama models show that distillation reduces but does not eliminate adversarial distinguishability, highlighting the need for category-aware evaluation.

0 favorites 0 likes

#black-box

Three things break in production AI memory that never show up in demos:

Reddit r/AI_Agents ↗ · 2026-05-15

The article highlights three common failure modes in production AI memory systems: outdated preferences persisting, sarcasm stored as literal, and summaries outliving their source facts. It argues that the AI memory industry lacks provenance, confidence scores, and versioning, creating a black-box problem that hinders debugging.

0 favorites 0 likes

#black-box

Estimating the Black-box LLM Uncertainty with Distribution-Aligned Adversarial Distillation

arXiv cs.CL ↗ · 2026-05-08 Cached

This paper proposed Distribution-Aligned Adversarial Distillation (DisAAD), a method that uses a lightweight proxy model to estimate uncertainty in black-box LLMs with only 1% of the original model size, achieving reliable quantification without requiring internal parameters or multiple sampling.

0 favorites 0 likes

#black-box

Surrogate modeling for interpreting black-box LLMs in medical predictions

arXiv cs.CL ↗ · 2026-04-23 Cached

Researchers propose a surrogate modeling framework to quantify and interpret latent medical knowledge encoded in black-box LLMs, revealing both valid associations and persistent racial biases.

0 favorites 0 likes

#black-box

Mind the Unseen Mass: Unmasking LLM Hallucinations via Soft-Hybrid Alphabet Estimation

arXiv cs.CL ↗ · 2026-04-22 Cached

Researchers introduce SHADE, a hybrid estimator that combines Good-Turing coverage with graph-spectral cues to quantify semantic uncertainty and detect LLM hallucinations when only a few black-box samples are available.

0 favorites 0 likes

#black-box

When Background Matters: Breaking Medical Vision Language Models by Transferable Attack

Hugging Face Daily Papers ↗ · 2026-04-19 Cached

MedFocusLeak introduces the first transferable black-box adversarial attack on medical vision-language models, using imperceptible background perturbations to mislead clinical diagnoses across six imaging modalities.

0 favorites 0 likes

black-box

Submit Feedback