Summary
A system that implements cross-model security checks, where one AI agent (Agent A) interacts with another (Agent B) to ensure that potentially harmful actions are caught and mitigated before they can be executed. Specifically, Agent A is responsible for generating and sending emails, while Agent B reads these emails to determine whether they contain spam or malicious content. If Agent B detects that an email is spam, it triggers a shutdown of Agent A, effectively preventing any further potentially harmful actions.
Cite this work:
@misc {
title={
Cross-model surveillance for emails handling
},
author={
Le Ngoc Mai
},
date={
10/8/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}