transparency

Tag

Cards List
#transparency

What Information Should Agents Disclose When Recommending Products?

Reddit r/AI_Agents · 4h ago

The article raises design and ethical questions about what information AI agents should disclose when recommending products or services, including business partnerships, ranking criteria, and affiliate relationships, drawing parallels with traditional online advertising transparency patterns.

0 favorites 0 likes
#transparency

Bureaucratic Silences: What the Canadian AI Register Reveals, Omits, and Obscures

arXiv cs.AI · 2026-04-20 Cached

This paper analyzes Canada's Federal AI Register (409 systems) and argues that such transparency artifacts configure accountability through ontological design rather than enabling genuine contestability, finding that 86% of systems are internal-efficiency focused while human discretion is systematically obscured.

0 favorites 0 likes
#transparency

Towards Intrinsic Interpretability of Large Language Models: A Survey of Design Principles and Architectures

arXiv cs.CL · 2026-04-20 Cached

A comprehensive survey reviewing recent advances in intrinsic interpretability for Large Language Models, categorizing approaches into five design paradigms: functional transparency, concept alignment, representational decomposability, explicit modularization, and latent sparsity induction. The paper addresses the challenge of building transparency directly into model architectures rather than relying on post-hoc explanation methods.

0 favorites 0 likes
#transparency

Changes in the system prompt between Claude Opus 4.6 and 4.7

Simon Willison's Blog · 2026-04-18 Cached

Anthropic released Claude Opus 4.7 with notable system prompt changes including expanded child safety instructions, new tool integrations (Claude in PowerPoint, Chrome, Excel), and behavioral adjustments to reduce verbosity and improve task completion without unnecessary clarification.

0 favorites 0 likes
#transparency

Inside our approach to the Model Spec

OpenAI Blog · 2026-03-25 Cached

OpenAI publishes details on its Model Spec, a formal framework defining how its AI models should behave across diverse use cases, emphasizing transparency, fairness, and safety as core principles for democratized AI development.

0 favorites 0 likes
#transparency

Strengthening our safety ecosystem with external testing

OpenAI Blog · 2025-11-19 Cached

OpenAI announces a strengthened safety ecosystem through external third-party testing and evaluations of frontier AI models, including independent assessments, methodology reviews, and subject-matter expert probing. The company commits to transparency by publicly sharing third-party assessment results and supporting independent evaluations since GPT-4's launch.

0 favorites 0 likes
#transparency

Sharing the latest Model Spec

OpenAI Blog · 2025-02-12 Cached

OpenAI has released a major update to its Model Spec, a document defining desired AI model behavior, now publicly available under CC0 license. The update emphasizes customizability, transparency, and intellectual freedom while maintaining safety guardrails through a clear chain-of-command framework.

0 favorites 0 likes
#transparency

Disrupting deceptive uses of AI by covert influence operations

OpenAI Blog · 2024-05-30 Cached

OpenAI reports disrupting five covert influence operations attempting to misuse its AI models for deceptive campaigns, with findings showing that safety-designed models prevented threat actors from generating desired content. The company is publishing trend analysis and collaborating with industry, civil society, and government to combat AI-enabled information manipulation.

0 favorites 0 likes
#transparency

Introducing the Model Spec

OpenAI Blog · 2024-05-08 Cached

OpenAI introduces the Model Spec, a document outlining how its models should behave in ChatGPT and the API, covering objectives, rules, and default behaviors. An updated version was released in February 2025, reinforcing commitments to customizability, transparency, and intellectual freedom while maintaining safety guardrails.

0 favorites 0 likes
#transparency

Moving AI governance forward

OpenAI Blog · 2023-07-21 Cached

OpenAI publishes AI governance recommendations committing companies to internal and external red-teaming for safety risks, information sharing on emerging capabilities, and mechanisms for detecting AI-generated audio and visual content.

0 favorites 0 likes
#transparency

Improving verifiability in AI development

OpenAI Blog · 2020-04-16 Cached

OpenAI publishes a report on mechanisms to improve verifiability in AI development, addressing how stakeholders can verify organizations' claims about AI system properties and safety practices.

0 favorites 0 likes
← Back to home

Submit Feedback