AI Era Vulnerability Benchmark Revamp: 3,632 CVE Insights & VulnGym Release
Analyzing 3,632 high‑severity GitHub Advisory reports from 2025‑2026, the authors reveal a sharp rise in business‑logic flaws—especially in high‑star projects—prompting a redesign of vulnerability‑detection benchmarks, and introduce VulnGym, a real‑project, white‑box dataset with 400+ paths and detailed entry‑point, trace, and critical‑operation annotations.
