vulnerability-detection

Tag

Cards List
#vulnerability-detection

@FinanceYF5: 2/ First, look at the numbers Codex Security delivered in three months: 【Scanned 30,000 code repositories】 【Scanned over 30 million commits】 【Over 500,000 vulnerabilities fixed】 This is since the research preview went online in March. At this speed, manual work simply cannot keep up.

X AI KOLs Following · 2d ago Cached

Codex Security has scanned 30,000 code repositories, over 30 million commits, and fixed over 500,000 vulnerabilities in three months, demonstrating the efficiency of AI automation.

0 favorites 0 likes
#vulnerability-detection

@FinanceYF5: 1/AI finding vulnerabilities is no longer the bottleneck. The bottleneck now is: found, but no one fixes. OpenAI today launched the Daybreak expansion plan, aiming to automate patching with AI. A thread to explain clearly

X AI KOLs Following · 2d ago Cached

OpenAI launches Daybreak expansion plan, aiming to automate vulnerability fixes with AI, addressing the current bottleneck in security where vulnerabilities are found but no one fixes them.

0 favorites 0 likes
#vulnerability-detection

Anthropic's open-source framework for AI-powered vulnerability discovery

Hacker News Top · 2026-06-04 Cached

Anthropic has released an open-source reference implementation for autonomous vulnerability discovery and remediation using Claude, featuring a full pipeline (recon → find → verify → report → patch) with sandboxing support. It accompanies Claude Security, a hosted product for managing vulnerabilities across codebases.

0 favorites 0 likes
#vulnerability-detection

@_mattata: Anthropic released a pretty clean code auditing harness for identifying bugs with potential security implications. It’s…

X AI KOLs Timeline · 2026-06-04 Cached

Anthropic released an open-source code auditing reference harness for autonomous vulnerability discovery and remediation using Claude, covering a recon→find→triage→report→patch pipeline, primarily targeting C/C++ memory vulnerabilities. It is a template/reference implementation rather than a production-ready product, with a managed hosted option called Claude Security also available.

0 favorites 0 likes
#vulnerability-detection

Anthropic expands Mythos to 150 additional organizations in more than 15 countries

Reddit r/artificial · 2026-06-02 Cached

Anthropic expands access to its Mythos AI cybersecurity model to 150 additional organizations across more than 15 countries under Project Glasswing, including critical infrastructure sectors like power, water, healthcare, and communications.

0 favorites 0 likes
#vulnerability-detection

Astra Autonomous Pentest

Product Hunt · 2026-06-01

Astra Security launches an autonomous pentest product that uses AI agents to find, validate, and fix vulnerabilities automatically.

0 favorites 0 likes
#vulnerability-detection

@CyKorKU: Korea’s #1-ranked hacker on HackerOne is back with a follow-up post! Hyunseo Shin (KU, 4th year) previously shared how …

X AI KOLs Timeline · 2026-06-01 Cached

Hyunseo Shin, Korea's #1 HackerOne hacker, shares a follow-up post detailing his AI-based vulnerability detection workflow using LLM agents to uncover open-source 0-days.

0 favorites 0 likes
#vulnerability-detection

@dani_avila7: NVIDIA built exactly what I needed to secure agent skills https://github.com/nvidia/skillspector… Adding it as a GitHub…

X AI KOLs Timeline · 2026-05-31 Cached

NVIDIA released SkillSpector, an open-source security scanner for AI agent skills that detects vulnerabilities like prompt injection and data exfiltration before installation.

0 favorites 0 likes
#vulnerability-detection

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

arXiv cs.LG · 2026-05-26 Cached

PromptAudit is a controlled evaluation framework that isolates the effects of prompt formulations on LLM-based vulnerability detection, finding that chain-of-thought prompting achieves the best overall performance while prompt sensitivity must be treated as a first-class system property.

0 favorites 0 likes
#vulnerability-detection

@levie: Here’s a key line in this mythos update. This is precisely an example of why engineers don’t go away, ever. We’ve made …

X AI KOLs Following · 2026-05-23 Cached

A commentary highlights that AI's ability to find more security vulnerabilities will increase the need for human engineers to triage and fix them, predicting a security engineer boom.

0 favorites 0 likes
#vulnerability-detection

@AnthropicAI: Patching these vulnerabilities will make us safer. But the software industry will need to adapt to the volume of vulner…

X AI KOLs · 2026-05-22 Cached

Anthropic's Project Glasswing has used Claude Mythos Preview to find over 10,000 high or critical severity vulnerabilities in critical software, with partners like Cloudflare reporting a tenfold increase in bug finding rates, highlighting the shift from discovery to patching as the bottleneck.

0 favorites 0 likes
#vulnerability-detection

Cloudflare just published what they found after running Anthropic's Mythos Preview against 50+ of their own repos and the results are worth reading

Reddit r/artificial · 2026-05-18

Cloudflare shares their experience with Anthropic's Mythos Preview model, which autonomously discovered high-severity vulnerabilities across major OS and web browsers. The model demonstrates senior-level reasoning in chaining exploit primitives but has inconsistent guardrails, highlighting the need for hardened safeguards before public release.

0 favorites 0 likes
#vulnerability-detection

Depthfirst claims that their AI has discovered critical vulnerabilities that Anthropic's Mythos system missed, at just one-tenth the cost of Anthropic's Mythos model.

Reddit r/singularity · 2026-05-16

Cybersecurity startup Depthfirst claims its AI model discovered critical vulnerabilities missed by Anthropic's Mythos system, achieving the same results at one-tenth the cost.

0 favorites 0 likes
#vulnerability-detection

Microsoft's multi-agent AI system tops Anthropic's Mythos on cybersecurity benchmark (3 minute read)

TLDR AI · 2026-05-14

Microsoft's MDASH multi-agent AI system, using over 100 specialized agents, surpasses Anthropic's Mythos on the CyberGym cybersecurity benchmark by effectively finding and confirming real-world software vulnerabilities.

0 favorites 0 likes
#vulnerability-detection

@DailyDoseOfDS_: OpenAI paid $500k for this! > A Kaggle contest to find LLM vulnerabilities DeepTeam does it for free. It implements 20+…

X AI KOLs Timeline · 2026-05-09 Cached

DeepTeam is a free, open-source tool that implements 20+ state-of-the-art attacks to detect over 50 LLM vulnerabilities, including bias and PII leakage, running locally without a dataset.

0 favorites 0 likes
#vulnerability-detection

Behind the Scenes Hardening Firefox with Claude Mythos Preview

Simon Willison's Blog · 2026-05-07 Cached

Mozilla used the Claude Mythos preview to systematically find and fix hundreds of security vulnerabilities in Firefox, dramatically increasing their bug-fix rate from around 20-30 per month to 423 in April 2026.

0 favorites 0 likes
#vulnerability-detection

Hardening Firefox with Claude Mythos Preview

Hacker News Top · 2026-05-07 Cached

Mozilla details how they used Claude Mythos Preview and other AI models to identify and fix a significant number of latent security bugs in Firefox, demonstrating a shift in the efficacy of AI for code hardening.

0 favorites 0 likes
#vulnerability-detection

Quoting Bobby Holley

Simon Willison's Blog · 2026-04-22 Cached

Firefox 150 shipped with 271 security fixes found by Anthropic’s Claude Mythos Preview, marking a major AI-driven win for defensive security.

0 favorites 0 likes
#vulnerability-detection

The zero-days are numbered

Lobsters Hottest · 2026-04-21 Cached

Mozilla used Anthropic's Claude Mythos Preview AI to find and fix 271 zero-day vulnerabilities in Firefox 150, marking a major shift in cybersecurity where AI enables defenders to decisively outpace attackers.

0 favorites 0 likes
#vulnerability-detection

Cybersecurity Looks Like Proof of Work Now

Simon Willison's Blog · 2026-04-14 Cached

The UK's AI Safety Institute's evaluation of Claude Mythos shows that AI-driven security vulnerability detection creates a new economic model where cybersecurity becomes a token-spending competition, incentivizing continuous investment in security reviews and making open-source libraries more valuable as shared security infrastructure.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback