DIG: Oracle-Guided Directed Input Generation for One-Day Vulnerabilities

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Software Development & Engineering · Depth: Expert, extended

Summary

The DIG system, an oracle-guided Proof-of-Concept (PoC) generation framework, addresses the critical challenge of assessing one-day vulnerabilities by exploiting patch information. It leverages Large Language Models (LLMs) to analyze security patches and synthesize an "oracle," a precise boolean predicate representing necessary preconditions for vulnerability triggering. DIG employs a two-pronged approach: high-level oracle-guided generator evolution, where an agent infers and solves constraints, and low-level oracle-guided directed mutation, which instruments the oracle into the target program for branch-distance feedback. Extensive evaluation across 138 real-world CVEs demonstrated DIG's superior performance, successfully triggering 80 vulnerabilities. This significantly surpasses the best baseline's 57 CVEs, representing a 40% improvement. Notably, DIG exclusively triggered 9 vulnerabilities and achieved over 100× speedup in 48.8% of cases, with a maximum speedup of 3,664×. Furthermore, DIG identified 6 previously unknown zero-day vulnerabilities in widely deployed libraries.

Key takeaway

For AI Security Engineers developing automated PoC generation or assessing one-day vulnerability risks, you should integrate patch-derived oracles to explicitly guide your tools. Traditional LLM agents often suffer from goal drift in long-horizon tasks, but DIG's hybrid approach, combining oracle-guided generator evolution with directed mutation, demonstrably overcomes this. This strategy significantly boosts CVE coverage and accelerates vulnerability triggering, enabling more robust and faster risk assessment for critical systems.

Key insights

Patch analysis with LLMs can synthesize explicit vulnerability preconditions, guiding effective PoC generation.

Principles

Method

DIG synthesizes an oracle from patches using LLMs. It then uses an agent for oracle-guided generator evolution and instruments the oracle for directed mutation with branch-distance feedback.

In practice

Topics

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Security Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.