trustworthiness

Tag

Cards List
#trustworthiness

TrustLDM: Benchmarking Trustworthiness in Language Diffusion Models

arXiv cs.CL · 2d ago Cached

Introduces TrustLDM, a comprehensive benchmark for evaluating safety, privacy, and fairness of Language Diffusion Models, revealing that their alignment degrades with malicious post contexts. Proposes an automatic evaluation framework, TrustLDM-Auto, to identify vulnerable configurations.

0 favorites 0 likes
#trustworthiness

Smoothed Elicitation Complexity for Approximate $\Gamma$-calibration of Discrete Classification Tasks

arXiv cs.LG · 2026-05-25 Cached

This paper characterizes approximate property calibration for discrete properties in multiclass classification, using Lipschitz continuous properties as an intermediary to reduce complexity from the number of classes to the elicitation complexity dimension.

0 favorites 0 likes
#trustworthiness

The Expense of Seeing: Attaining Trustworthy Multimodal Reasoning Within the Monolithic Paradigm

Hugging Face Daily Papers · 2026-05-21 Cached

This paper challenges the assumption that current Vision-Language Models faithfully synthesize multimodal data, proposing an information-theoretic Modality Translation Protocol with new metrics (Toll, Curse, Fallacy of Seeing) to evaluate trustworthiness over traditional multimodal gain.

0 favorites 0 likes
#trustworthiness

Trustworthy Agent Network: Trust in Agent Networks Must Be Baked In, Not Bolted On

arXiv cs.AI · 2026-05-20 Cached

This vision paper argues that trust in Agent-to-Agent (A2A) networks must be integrated from the ground up, as existing agent alignment techniques are insufficient to address systemic vulnerabilities like adversarial composition and semantic misalignment.

0 favorites 0 likes
#trustworthiness

A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

Hugging Face Daily Papers · 2026-05-18 Cached

A comprehensive survey reviewing the trustworthiness challenges of Large Audio Language Models (LALMs), including vulnerabilities like cross-modal jailbreaking and acoustic backdoors, and proposing a defense-in-depth roadmap.

0 favorites 0 likes
← Back to home

Submit Feedback