Join us for a groundbreaking exploration of AI model internals, where we'll dive deep into mechanistic interpretability and feature manipulation. In partnership with Goodfire, we're bringing you unprecedented access to state-of-the-art tools for understanding and steering AI behavior.
Whether you're an AI researcher, a curious developer, or passionate about making AI systems more transparent and controllable, this hackathon is for you. As a participant, you will:
Register now and be part of the movement towards more transparent, reliable, and beneficial AI systems. We provide access to Goodfire's SDK/API and research preview playground, enabling participation regardless of prior experience with AI observability.
As AI models become more powerful and widespread, understanding their internal mechanisms isn't just academic curiosity—it's crucial for building reliable, controllable AI systems. Mechanistic interpretability gives us the tools to peek inside these "black boxes" and understand how they actually work, neuron by neuron and feature by feature.
While participants are welcome to use their existing setups, Goodfire's API brings exceptional value to this hackathon as a primary option for participants.
Goodfire provides:
The hackathon serves as a unique opportunity for Goodfire to gather valuable feedback from the developer community on their API/SDK. To ensure all participants can pursue ambitious research projects without constraints, Goodfire is providing free compute credits to every team.
"I learned so much about AI Safety and Computational Mechanics. It is a field I have never heard of, and it combines two of my interests - AI and Physics. Through the hackathons, I gained valuable connections and learned a lot from researchers with extensive experience." - Doroteya Stoyanova, Computer Vision Intern
To ensure you're well-equipped for the Reprogramming AI Models Hackathon, we've compiled a set of resources to support your participation:
Here is the schedule for the Hackathon:
We start with an introductory talk and end the event during the following week with an awards ceremony. Join the public ICal here. You will also find Explorer events, such as collaborative brainstorming and team match-making before the hackathon begins on Discord and in the calendar.
Join us for the Hackathon in Hereplein 4, 9711GA, Groningen!
Join us for the Hackathon in Hereplein 4, 9711GA, Groningen!
This is a collaboration between Warwick AI and Warwick Effective Altruism. We will be hosting groups that wish to participate in the hackathon for the weekend.
This is a collaboration between Warwick AI and Warwick Effective Altruism. We will be hosting groups that wish to participate in the hackathon for the weekend.
Cambridge hub for hosting the Reprogramming AI hackathon. Office available with monitors and snacks!
Cambridge hub for hosting the Reprogramming AI hackathon. Office available with monitors and snacks!
A local hub for the hackathon on the EPFL campus (luma coming soon). We will provide a room, snacks and drinks for the participants.
A local hub for the hackathon on the EPFL campus (luma coming soon). We will provide a room, snacks and drinks for the participants.
Each team should submit a research paper that includes:
Additionally, teams should provide:
Submissions will be judged based on the following criteria:
A: The hackathon runs from Friday, 22nd November, through Monday, 25th November(3 AM UTC). There will be two brainstorming sessions (Tuesday and Thursday), with API access beig granted starting Thursday morning.
A: Teams should be 2-4 people, with a maximum of 5. While individual submissions are allowed, team participation is encouraged for better project outcomes.
A:
A: The SDK provides:
A:
A:
A: The SDK provides several methods:
A: Yes, you can perform feature interventions to modify model behavior, including conditional interventions.
A: Yes, Goodfire's SAE features are proprietary but may be open-sourced in the future.
A:
A:
A: No, features are pre-discovered, but you can:
A:
A:
A:
A:
A:
A:
A: You can:
A: Submit a request through the provided form with justification, and the team will review it case by case.
A: Yes, Goodfire provides several example notebooks, including:
A: While immediate changes during the hackathon may not be possible, Goodfire welcomes all feedback and feature requests for future improvements.
A:
A:
Here are entries for the hackathon