risk-mitigation

Tag

Cards List
#risk-mitigation

Our commitment to community safety

OpenAI Blog · 2026-04-28 Cached

OpenAI outlines its commitment to community safety, detailing how ChatGPT is trained to detect and mitigate risks of violence and harm through refined safeguards and expert input.

0 favorites 0 likes
#risk-mitigation

Preparing for future AI risks in biology

OpenAI Blog · 2025-06-18 Cached

OpenAI publishes a comprehensive approach to managing dual-use risks from advanced AI models in biology, outlining strategies for enabling beneficial scientific discovery while preventing misuse for bioweapons development through expert collaboration, model training, detection systems, and security controls.

0 favorites 0 likes
#risk-mitigation

Updating the Frontier Safety Framework

Google DeepMind Blog · 2025-02-04 Cached

DeepMind has published an updated Frontier Safety Framework (v2.0) with stronger security protocols for frontier AI models, including new Critical Capability Level (CCL) security recommendations and enhanced approaches to deceptive alignment risks. The framework aims to prevent unauthorized model weight exfiltration and manage risks as AI systems become more powerful.

0 favorites 0 likes
#risk-mitigation

OpenAI’s Approach to Frontier Risk

OpenAI Blog · 2023-10-26 Cached

OpenAI publishes details on its approach to frontier AI risks and announces progress on voluntary safety commitments made in July 2023, including the release of DALL-E 3 system card and the development of a new Preparedness Framework to manage catastrophic risks from advanced AI systems.

0 favorites 0 likes
← Back to home

Submit Feedback