Tag
Awesome Legal Skills is an open-source collection of over 139 AI agent instructions and workflows for the legal industry, covering privacy compliance, contract review, litigation analysis, and more. It is compatible with mainstream AI tools like Claude Code and Codex, helping legal professionals turn their expertise into reusable AI workflows.
LangChain's LangSmith enables developers to use tracing as compliance evidence for the EU AI Act, with customizable evaluators for bias, hallucination, toxicity, accuracy, and adversarial inputs.
The author built a fully offline AI agent using local embedding models, Llama via Ollama, and VectorAI DB to address the risks of cloud-dependent AI. The agent runs on an 8GB MacBook, processes sensitive documents, and maintains memory across sessions.
The EU AI Act mandates that from August onwards, all AI-generated text, images, audio, and video must be watermarked and metadata-tagged, with two layers of machine-detectable identification. This applies to any provider accessible to EU citizens, regardless of location, and includes open-source models, facing fines up to €35 million.
Philipp Comans shared at the Interrupt conference how Chime balances product velocity with compliance by having legal and compliance teams co-write evaluation systems, transforming AI assistant development from an 'oops-driven' approach to a continuous alignment flywheel.
The article highlights the risk of AI agents performing destructive actions like deleting databases and proposes a Runtime Policy Gateway that uses Policy-as-Code to intercept and block non-compliant agent actions in real time, asking if users would adopt such a security tool.
Anthropic is rolling out identity verification on Claude for certain use cases to prevent abuse, enforce usage policies, and comply with legal obligations, using Persona Identities as their verification partner.
The article discusses the upcoming full enforcement of the EU Cyber Resilience Act in 2027, its requirements for software products with digital elements, and argues that it does not spell the end of open source software but rather demands better engineering practices.
Analysis of LLM usage in legal and compliance tasks reveals that models often produce confident but unverifiable citations, raising questions about reliable legal grounding for AI outputs.
The EU AI Act's compliance deadlines begin in 47 days, requiring AI agent builders to adhere to specific transparency, risk management, and documentation rules.
This paper introduces LegalHalluLens, a framework for auditing hallucinations in legal AI, providing typed hallucination profiles and a Risk Direction Index to improve trustworthy deployment.
This paper introduces a bidirectional diagnostic, Compliance Asymmetry, and finds that LLMs exhibit 'directional blindness' in moral judgments: they comply equally to helpful and harmful social nudges, unlike in factual domains where they selectively follow helpful corrections. The phenomenon persists across models and nudge types, highlighting a distinct failure mode in current LLM alignment.
Anthropic suspends its Fable/Mythos projects, with plans to share details in 24 hours. The article questions whether this aggressive global shutdown is proportionate compared to historical export control enforcement precedents.
Recommends two GitHub repos that provide US stock investment guides for Chinese investors, covering basics like account opening, taxes, compliance, and fund transfers, suitable for beginners.
Research reveals that YouTube is hosting and profiting from ads on channels linked to US-sanctioned Iranian entities, including the Islamic Revolutionary Guard Corps, potentially violating sanctions.
The article discusses deploying AI agents in finance while ensuring compliance with NIS2/DORA regulations, focusing on transparency, guardrails, and accountability for potential data breaches.
The article discusses the challenges enterprises face in managing 'shadow AI' — the unauthorized use of AI tools embedded in approved software and browser extensions — and the difficulty of drawing boundaries between sanctioned and unsanctioned AI use.
Trace2Policy extracts human-readable decision rules from expert behavior traces and iteratively refines them via error-driven skill refinement, outperforming pure LLM baselines on compliance-sensitive tasks in logistics.
UK regulators ASA, CAP, and UKGC are deploying an AI system to proactively scan social media for gambling ads targeting under-18s, moving from complaint-based enforcement to near-real-time automated monitoring.
This paper introduces MAC-Bench, a dynamic adversarial benchmark for evaluating procedural compliance in multi-agent systems. It proposes the SERV pipeline to generate contamination-free scenarios and new metrics like Compliance-Weighted Success Rate (CSR) and Machiavellian Gap (MG).