Tag
OpenAI publishes details on its Model Spec, a formal framework defining how its AI models should behave across diverse use cases, emphasizing transparency, fairness, and safety as core principles for democratized AI development.
OpenAI announces its approach to AI localization through the OpenAI for Countries initiative, enabling governments to build sovereign AI systems adapted to local contexts while maintaining global frontier-level models. The company publishes detailed Model Spec guidelines with red-line principles ensuring human safety, rights, and factual accuracy across all deployments.
OpenAI has updated its Model Spec with new Under-18 Principles to guide ChatGPT's behavior for teen users aged 13-17, focusing on safety, age-appropriate interactions, and stronger guardrails around high-risk topics like self-harm and explicit content. The update was developed with input from the American Psychological Association and is grounded in developmental science.
OpenAI launches a collective alignment initiative to gather public input on AI model behavior, collecting feedback from over 1,000 people globally to inform updates to their Model Spec. The company is also releasing their public inputs dataset on HuggingFace to enable further AI alignment research.
OpenAI publishes a blog post outlining its commitment to intellectual freedom in ChatGPT design, emphasizing objectivity by default, user controls, and transparent principles through its Model Spec framework. The company highlights new personalization settings and ongoing efforts to evaluate and reduce political bias through stakeholder feedback.
OpenAI has released a major update to its Model Spec, a document defining desired AI model behavior, now publicly available under CC0 license. The update emphasizes customizability, transparency, and intellectual freedom while maintaining safety guardrails through a clear chain-of-command framework.
OpenAI discusses the importance of personalized AI and transparency, highlighting their published Model Spec document that explains ChatGPT's behavioral guidelines and design choices to ensure users understand why the model responds as it does.