This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
ApartSprints
AI and Democracy Hackathon: Demonstrating the Risks
65b750920b4aeb478958fb32
AI and Democracy Hackathon: Demonstrating the Risks
May 6, 2024
Accepted at the 
65b750920b4aeb478958fb32
 research sprint on 

Investigating detection of election-influencing Sleeper Agents using probes

Creates election-influencing Sleeper Agents which wakes up with backdoor-trigger. And tries to detect it using probes

By 
Kenneth Ong
🏆 
4th place
3rd place
2nd place
1st place
 by peer review
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

This project is private