safety-measures

Tag

Cards List
#safety-measures

Anthropic's new model Fable will silently handicap work on LLMs [D]

Reddit r/MachineLearning · 2d ago

Anthropic's new model Fable implements invisible safeguards that limit its effectiveness for requests related to frontier LLM development, such as building pretraining pipelines or distributed training infrastructure, to prevent accelerating actors violating terms of service.

0 favorites 0 likes
#safety-measures

Reinterpreting Safety Thresholds as Neuron Spiking Thresholds

arXiv cs.AI · 2026-06-01 Cached

This paper proposes a biologically inspired reinterpretation of surrogate safety measure thresholds using spiking neural networks, aligning with human braking behavior to bridge objective and subjective safety perception in automated driving.

0 favorites 0 likes
← Back to home

Submit Feedback