Mar 22, 2026
Undoing the Fix: CVE Regression as an AI Control Setting
Ivan
Production codebases carry accumulated security fixes that can be lost when an AI agent modifies code, whether deliberately or by mistake. Meanwhile, existing control evaluations tend to revolve around inserting backdoors, exfiltrating data, or other additive attacks. We propose CVE regression as a control setting where the attack is removing existing code patches while performing other code work, using real vulnerabilities and real SWE-bench tasks. Our initial results suggest this is a viable direction to be explored further.
Track: Control Setting
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Undoing the Fix: CVE Regression as an AI Control Setting
},
author={
Ivan
},
date={
3/22/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


