Apr 14, 2023
-
Apr 17, 2023
Online & In-Person
The Interpretability Hackathon 2.0




We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability
00:00:00:00
00:00:00:00
00:00:00:00
00:00:00:00
We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability
This event is ongoing.
This event has concluded.
32
Sign Ups
Overview
Resources
Schedule
Overview

Hosted by Zaki, Apart Research, Esben Kran, StefanHex, calcan, Neel Nanda · #alignmentjam
Join this AI safety hackathon to find new perspectives on the "brains" of AI!
48 hours intense research in interpretability, the modern AI neuroscience.
We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability! With the starter Colab code, you should be able to quickly get a technical insight into the process.

We provide you with the best starter templates that you can work from so you can focus on creating interesting research instead of browsing Stack Overflow. You're also very welcome to check out some of the ideas already posted!
How to participate
Create a user on the itch.io (this) website and click participate. We will assume that you are going to participate and ask you to please cancel if you won't be part of the hackathon.
Instructions
You will work on research ideas you generate during the hackathon and you can find more inspiration below.
Submission
Everyone will help rate the submissions together on a set of criteria that we ask everyone to follow. This will happen over a 2-week period and will be open to everyone!
32
Sign Ups
Overview
Resources
Schedule
Overview

Hosted by Zaki, Apart Research, Esben Kran, StefanHex, calcan, Neel Nanda · #alignmentjam
Join this AI safety hackathon to find new perspectives on the "brains" of AI!
48 hours intense research in interpretability, the modern AI neuroscience.
We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability! With the starter Colab code, you should be able to quickly get a technical insight into the process.

We provide you with the best starter templates that you can work from so you can focus on creating interesting research instead of browsing Stack Overflow. You're also very welcome to check out some of the ideas already posted!
How to participate
Create a user on the itch.io (this) website and click participate. We will assume that you are going to participate and ask you to please cancel if you won't be part of the hackathon.
Instructions
You will work on research ideas you generate during the hackathon and you can find more inspiration below.
Submission
Everyone will help rate the submissions together on a set of criteria that we ask everyone to follow. This will happen over a 2-week period and will be open to everyone!
32
Sign Ups
Overview
Resources
Schedule
Overview

Hosted by Zaki, Apart Research, Esben Kran, StefanHex, calcan, Neel Nanda · #alignmentjam
Join this AI safety hackathon to find new perspectives on the "brains" of AI!
48 hours intense research in interpretability, the modern AI neuroscience.
We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability! With the starter Colab code, you should be able to quickly get a technical insight into the process.

We provide you with the best starter templates that you can work from so you can focus on creating interesting research instead of browsing Stack Overflow. You're also very welcome to check out some of the ideas already posted!
How to participate
Create a user on the itch.io (this) website and click participate. We will assume that you are going to participate and ask you to please cancel if you won't be part of the hackathon.
Instructions
You will work on research ideas you generate during the hackathon and you can find more inspiration below.
Submission
Everyone will help rate the submissions together on a set of criteria that we ask everyone to follow. This will happen over a 2-week period and will be open to everyone!
32
Sign Ups
Overview
Resources
Schedule
Overview

Hosted by Zaki, Apart Research, Esben Kran, StefanHex, calcan, Neel Nanda · #alignmentjam
Join this AI safety hackathon to find new perspectives on the "brains" of AI!
48 hours intense research in interpretability, the modern AI neuroscience.
We focus on interpreting the innards of neural networks using methods developed from mechanistic interpretability! With the starter Colab code, you should be able to quickly get a technical insight into the process.

We provide you with the best starter templates that you can work from so you can focus on creating interesting research instead of browsing Stack Overflow. You're also very welcome to check out some of the ideas already posted!
How to participate
Create a user on the itch.io (this) website and click participate. We will assume that you are going to participate and ask you to please cancel if you won't be part of the hackathon.
Instructions
You will work on research ideas you generate during the hackathon and you can find more inspiration below.
Submission
Everyone will help rate the submissions together on a set of criteria that we ask everyone to follow. This will happen over a 2-week period and will be open to everyone!
Speakers & Collaborators
Neel Nanda
Speaker & Judge
Team lead for the mechanistic interpretability team at Google Deepmind and a prolific advocate for open source interpretability research.
Registered Jam Sites
Register A Location
Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.
The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.
Registered Jam Sites
Register A Location
Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.
The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.
Our Other Sprints
May 30, 2025
-
Jun 1, 2025
Research
Apart x Martian Mechanistic Interpretability Hackathon
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible
Sign Up
Sign Up
Sign Up
Apr 25, 2025
-
Apr 27, 2025
Research
Economics of Transformative AI
This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible
Sign Up
Sign Up
Sign Up

Sign up to stay updated on the
latest news, research, and events

Sign up to stay updated on the
latest news, research, and events

Sign up to stay updated on the
latest news, research, and events

Sign up to stay updated on the
latest news, research, and events