ai-robustness

#ai-robustness

If someone spoofs your IoT sensor data, does your AI even have a way to know it's been fooled?

Reddit r/AI_Agents ↗ · 2026-05-27

Discusses how AI systems often trust sensor inputs without validation, using an example of a logistics company where spoofed temperature sensor data led to cargo damage, and questions whether AI can detect such spoofing.

0 favorites 0 likes

#ai-robustness

FragileFlow: Spectral Control of Correct-but-Fragile Predictions for Foundation Model Robustness

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces FragileFlow, a plug-in regularizer that improves the robustness of LLMs and VLMs by controlling 'correct-but-fragile' predictions through spectral analysis and PAC-Bayes bounds.

0 favorites 0 likes

#ai-robustness

PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks

Hugging Face Daily Papers ↗ · 2026-05-09 Cached

The paper introduces PASA, a robust watermarking algorithm for LLM-generated text that operates at the semantic level using latent embedding spaces to resist semantic-invariant attacks like paraphrasing.

0 favorites 0 likes

#ai-robustness

How Maximum Entropy makes Reinforcement Learning Robust

ML at Berkeley ↗ · 2021-07-26 Cached

This article explains how incorporating Shannon entropy into reinforcement learning objectives creates more robust agents capable of handling unexpected or adversarial changes in rewards and dynamics.

0 favorites 0 likes

ai-robustness

If someone spoofs your IoT sensor data, does your AI even have a way to know it's been fooled?

FragileFlow: Spectral Control of Correct-but-Fragile Predictions for Foundation Model Robustness

PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks

How Maximum Entropy makes Reinforcement Learning Robust

Submit Feedback