Anthropic and OpenAI have unveiled free reasoning-based vulnerability scanners, Claude Code Security and Codex Security, challenging traditional static application security testing (SAST) tools. These tools, utilizing large language models (LLMs), have exposed entire classes of vulnerabilities that older pattern-matching methods missed. This development signifies a shift in the enterprise security landscape, with both companies pushing the boundaries of what vulnerability detection can achieve.
### The Companies and Their Products
Anthropic released Claude Code Security as part of its Claude Opus 4.6 on February 20, following its zero-day research findings on February 5. The tool identified over 500 high-severity vulnerabilities in open-source codebases, including a heap buffer overflow in the CGIF library. These vulnerabilities had previously eluded detection despite extensive expert review and fuzzing.
OpenAI launched Codex Security on March 6, evolving from its internal tool, Aardvark. During its beta phase, Codex Security scanned over 1.2 million commits, uncovering 792 critical findings and 10,561 high-severity vulnerabilities across various repositories. The tool’s false positive rates significantly decreased during testing, highlighting its effectiveness.
### Context and Competition
The simultaneous release of these tools by Anthropic and OpenAI, valued at over $1.1 trillion combined, underscores the competitive pressure in the market. Both companies aim to enhance detection capabilities beyond what any single vendor can achieve. Despite their advancements, neither tool replaces existing security stacks but instead complements them, altering procurement strategies.
Checkmarx Zero researchers noted limitations in Claude Code Security’s detection capabilities, indicating that moderately complex vulnerabilities might evade its scans. Both Anthropic and OpenAI have yet to submit their detection claims for independent third-party audits, urging caution in interpreting their results.
### Market Implications
The introduction of these free tools is reshaping the application security market. Merritt Baer, CSO at Enkrypt AI, emphasizes the need for security teams to prioritize patches based on exploitability rather than CVSS scores. The rapid advancements in vulnerability detection compress the window between discovery and exploitation, challenging traditional vulnerability management practices.
Snyk, a developer security platform, acknowledges the technical breakthrough but highlights the ongoing challenge of fixing vulnerabilities at scale. The rise of AI-generated code, which is more prone to security flaws, adds another layer of complexity. Cycode CTO Ronen Slavin stresses that AI models, while innovative, require consistent and reproducible results, which traditional SAST platforms provide.
As Anthropic and OpenAI head toward potential IPOs, the race to improve vulnerability detection continues. Both companies are expected to update their models monthly, keeping the competitive cycle dynamic. This rapid development pace means that enterprises must adapt quickly to leverage these tools effectively.
The unveiling of Claude Code Security and Codex Security marks a pivotal moment in application security, pushing the boundaries of vulnerability detection. As these tools evolve, the industry must remain vigilant, adjusting strategies to mitigate risks in an ever-changing security landscape.




















