Sep 15, 2024
-
Sep 15, 2024
ARENA 4.0 Interpretability Hackathon
Hackathon for working on interpretability projects during ARENA v4. These could be training or interpreting SAEs, finding circuits in GPT2-Small, training & interpreting toy transformer models on algorithmic tasks, building visualization software / infra for interpretability research, or anything else you can think of!
Hackathon for working on interpretability projects during ARENA v4. These could be training or interpreting SAEs, finding circuits in GPT2-Small, training & interpreting toy transformer models on algorithmic tasks, building visualization software / infra for interpretability research, or anything else you can think of!
This event is ongoing.
This event has concluded.
Our Other Sprints
May 30, 2025
-
Jun 1, 2025
Apart x Martian Mechanistic Interpretability Hackathon
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible
Apr 25, 2025
-
Apr 27, 2025
Economics of Transformative AI
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible