safety-critical

Tag

Cards List
#safety-critical

Built a Paninian Retrieval-Augmented Generation (PRAG) framework for safer medical AI — seeking feedback

Reddit r/artificial · 4d ago

The PRAG framework combines traditional RAG with a Paninian rule engine for safer medical AI, achieving a 71% reduction in unsafe answers on MedQA. It provides auditable rule traces and is open-sourced.

0 favorites 0 likes
#safety-critical

ESBMC-PLC: Formal Verification of IEC 61131-3 Ladder Diagram Programs Using SMT-Based Model Checking

arXiv cs.CL · 4d ago Cached

This paper presents ESBMC-PLC, the first open-source formal verifier with native support for IEC 61131-3 Ladder Diagram programs using SMT-based model checking, enabling automated verification of safety-critical industrial control logic.

0 favorites 0 likes
#safety-critical

DiRecT: Safe Diffusion-Based Planning via Receding-Horizon Denoising

arXiv cs.LG · 4d ago Cached

DiRecT introduces a training-free algorithm for safe diffusion-based planning that enforces constraints only on final clean trajectories using receding-horizon denoising, improving safety and performance over existing methods.

0 favorites 0 likes
#safety-critical

BadWorld: Adversarial Attacks on World Models

Hugging Face Daily Papers · 6d ago Cached

BadWorld is a label-free adversarial framework that reveals structural vulnerabilities in visual world models by generating imperceptible perturbations that cause catastrophic failures in future rollouts.

0 favorites 0 likes
#safety-critical

SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings

arXiv cs.CL · 2026-06-12 Cached

This paper proposes SafeLLM, an extraction-based approach for retrieving information from safety-critical documents, showing that line-number selection outperforms rewriting-based RAG methods in reducing hallucinations while maintaining high recall.

0 favorites 0 likes
#safety-critical

ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving

arXiv cs.AI · 2026-05-22 Cached

ScenePilot proposes a feasibility-guided, boundary-driven framework for generating safety-critical scenarios for autonomous driving, using constrained multi-objective reinforcement learning to produce physically valid yet failure-inducing scenarios.

0 favorites 0 likes
#safety-critical

Precise Verification of Transformers through ReLU-Catalyzed Abstraction Refinement

arXiv cs.AI · 2026-05-15 Cached

This paper proposes a novel transformer verification approach that uses ReLU to represent precise but non-linear bounds for dot products, enabling precise and efficient verification. The method outperforms state-of-the-art baselines on sentiment analysis models.

0 favorites 0 likes
← Back to home

Submit Feedback