Sep 15, 2024
-
Sep 15, 2024
ARENA 4.0 Interpretability Hackathon
Hackathon for working on interpretability projects during ARENA v4. These could be training or interpreting SAEs, finding circuits in GPT2-Small, training & interpreting toy transformer models on algorithmic tasks, building visualization software / infra for interpretability research, or anything else you can think of!
This event is ongoing.
This event has concluded.
Organized by ARENA
Hackathon for working on interpretability projects during ARENA v4. These could be training or interpreting SAEs, finding circuits in GPT2-Small, training & interpreting toy transformer models on algorithmic tasks, building visualization software / infra for interpretability research, or anything else you can think of!
This event ran on the 15th of September 2024
Entries
Our Other Sprints
Apr 25, 2025
-
Apr 27, 2025
Economics of Transformative AI: Research Sprint
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible
Apr 25, 2025
-
Apr 26, 2025
Berkeley AI Policy Hackathon
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible