safety-measures

#safety-measures

Anthropic's new model Fable will silently handicap work on LLMs [D]

Reddit r/MachineLearning ↗ · 2026-06-10

Anthropic's new model Fable implements invisible safeguards that limit its effectiveness for requests related to frontier LLM development, such as building pretraining pipelines or distributed training infrastructure, to prevent accelerating actors violating terms of service.

0 favorites 0 likes

#safety-measures

Reinterpreting Safety Thresholds as Neuron Spiking Thresholds

arXiv cs.AI ↗ · 2026-06-01 Cached

This paper proposes a biologically inspired reinterpretation of surrogate safety measure thresholds using spiking neural networks, aligning with human braking behavior to bridge objective and subjective safety perception in automated driving.

0 favorites 0 likes

safety-measures

Anthropic's new model Fable will silently handicap work on LLMs [D]

Reinterpreting Safety Thresholds as Neuron Spiking Thresholds

Submit Feedback