Mar 22, 2026
Somebody Poisoned the Waterhole! Evaluating Coding Agent Vulnerability to Adversarial GitHub Issues
Jerome Wynne
This project introduces a compound AI threat model that combines control and agent security: a monitored coding agent may be adversarial while external attackers are also active. I evaluate one concrete path with PoisonedGithubIssues, a naturalistic mini-benchmark that injects fake dependency recommendations into real GitHub issue contexts across 18 open-source repositories; in a 130-sample command-variant run across 13 repositories, Claude Code (Sonnet 4.6) never executed or recommended the poisoned package and flagged suspicious content in 76.2% of trajectories.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Somebody Poisoned the Waterhole! Evaluating Coding Agent Vulnerability to Adversarial GitHub Issues
},
author={
Jerome Wynne
},
date={
3/22/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


