This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
ApartSprints
AI capabilities and risks demo-jam: Creating visceral interactive demonstrations
66a7c53acd7d1c97a3b3dad0
AI capabilities and risks demo-jam: Creating visceral interactive demonstrations
August 26, 2024
Accepted at the 
66a7c53acd7d1c97a3b3dad0
 research sprint on 

Sleeper Agents Detector

We present "Sleeper Agent Detector," an interactive web application designed to educate software engineers, Inspired by recent research demonstrating that large language models can exhibit behaviors analogous to deceptive alignment

By 
Michaël Trazzi and Saahir Vazirani
🏆 
4th place
3rd place
2nd place
1st place
 by peer review
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

This project is private