Factory's Droid Now Catches Real CVE-Level Security Bugs in Every PR

EDITORIAL LEADERBOARD

Factory

6H AGO

2 min read

6 hrs ago

2 min read

Security bugs are sneaky. They rarely look alarming in a diff -- a missing authorization check here, a user-controlled string crossing a trust boundary there. By the time a human reviewer notices, the code is already merged. Factory's Droid is now trying to close that gap with Automated Security Review: a dedicated security pass that runs on every non-draft pull request, automatically, alongside the standard code review.

The feature is live today and available on all plans. No configuration required to get started -- it activates the moment a PR is opened.

Not just another linter

The core of this system is a structured threat-modeling framework called STRIDE -- short for Spoofing, Tampering, Repudiation, Information Disclosure, Denial of Service, and Elevation of Privilege. Rather than pattern-matching for known bad strings the way a traditional static analyzer would, Droid runs a security-focused review using STRIDE methodology along with OWASP Top 10 and OWASP LLM Top 10 checks -- the latter covering AI-specific attack surfaces like prompt injection and insecure model output handling.

What makes this more than a fancy grep is the two-pass validation pipeline. It enables a two-pass security workflow that traces untrusted input across trust boundaries, validates exploitability, and reports only findings with a realistic path to impact. Candidate vulnerabilities are re-checked for reachability and existing controls before anything gets posted, which is how the system avoids drowning engineers in false positives.

GitHub PR comment from factory-droid bot showing a P0 command injection finding via execSync with user-controlled filenames

When something is caught, each finding includes a severity level, CWE reference (where applicable), an explanation, and a suggested fix posted as inline review comments. Severity is tagged P0 through P3:

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

Takeaways

Not just another linter

Don't miss what's next in AI