Making secret scanning more trustworthy: Reducing false positives at scale

· Source: The GitHub Blog · Field: Technology & Digital — Cybersecurity & Data Privacy, Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, short

Summary

GitHub, in collaboration with Microsoft Security & AI's Agents Offense team, has significantly enhanced its secret scanning capabilities to reduce false positives and improve developer trust. The improvement integrates an LLM-based contextual verification approach, derived from Agentic Secret Finder, into GitHub's existing detection pipeline. This system, which already combines pattern-based and AI-powered detection, now uses "better context" by extracting high-signal usage information—like a value being passed into an API request or authentication header—rather than analyzing entire files. This focused contextual reasoning helps distinguish real secrets from benign strings. The approach achieved a 75.76% reduction in false positives, surpassing its 65% target, based on hundreds of customer-confirmed alerts, leading to clearer signals and faster action on real risks.

Key takeaway

For DevSecOps Leads evaluating secret scanning solutions, GitHub's 75.76% false positive reduction highlights the power of LLM-based contextual verification. You should prioritize systems that extract focused usage signals rather than relying on broad code analysis. This approach significantly improves alert precision, reduces developer fatigue, and accelerates the remediation of real security risks. Consider integrating similar context-aware reasoning into your security pipelines.

Key insights

LLM-based contextual verification significantly reduces false positives in secret scanning by focusing on usage signals.

Principles

Method

An LLM-based verification step analyzes extracted high-signal usage context, such as a value's assignment to an API call, to confirm if a detected pattern is an actual secret.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Security Engineer, Machine Learning Engineer, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The GitHub Blog.