The hackathon is happening right now! Join by signing up below and be a part of our community server.
Apart > Sprints

Reprogramming AI Models Hackathon

--
No items found.
Signups
--
Entries
November 22, 2024 4:00 PM
 to
November 25, 2024 3:00 AM
 (UTC)
Hackathon starts in
--
Days
--
Hours
--
Minutes
--
Seconds
Sign upSign up
This event is finished. It occurred between 
November 22, 2024
 and 
November 25, 2024

Project ideas:

  • Feature phenomenology
    • Discovering useful feature interventions
    • Measuring feature intervention quality (when do feature interventions work well vs. fail)
    • Understand how feature weight plays a role in intervention behavior
  • Novel research
    • Investigating auto-interp improvements
    • Exploring jail breaking and elicitation techniques
  • Interface
    • Create interesting visualizations built off Goodfire SAE’s (e.g., feature map visualizations)

Resources

Hackathon participants will have access to:

  • Goodfire’s SDK/ API with hosted inference
  • Goodfire’s research preview playground
  • TO DISCUSS: resources for general compute / openai or anthropic api calls (how much?)

Speakers & Collaborators

Tom McGrath

Chief Scientist at Goodfire, previously Senior Research Scientist at Google DeepMind, where he co-founded the interpretability team
Organiser

Dan Balsam

CTO at Goodfire, previously Founding Engineer and Head of AI at RippleMatch. Goodfire makes Interpretability products for safe and reliable generative AI models.
Organiser

Archana Vaidheeswaran

Archana is responsible for organizing the Apart Sprints, research hackathons to solve the most important questions in AI safety.
Organizer

📍 Registered jam sites

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.
Register the first event below!

🏠 Register a location

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community. Read more about organizing.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Thank you! Your submission has been received! Your event will show up on this page.
Oops! Something went wrong while submitting the form.

📣 Social media images and text snippets

No media added yet
No text snippets added yet

Submission Requirements

Each team should submit a research paper that includes:

  1. Project title and team members
  2. Executive summary (max 250 words)
  3. Introduction and problem statement
  4. Methodology and approach
  5. Results and analysis
  6. Discussion of implications for AI interpretability
  7. Conclusion and future work
  8. References

Additionally, teams should provide:

  • A link to their code repository (e.g., GitHub)
  • Any demo materials or visualizations (if applicable)

Evaluation Criteria

Submissions will be judged based on the following criteria:

  1. Interpretability Advancement
    • Does the project contribute to the field of AI interpretability?
    • Does it provide new insights into understanding or steering AI model behavior?
    • How well does it align with the hackathon's focus on reprogramming AI models?
  2. Research Quality
    • How original and innovative is the approach?
    • Does it present novel ideas or combine existing techniques in unique ways?
    • Do we expect the results to generalize beyond the specific case(s) presented in the submission?
  3. Technical Implementation
    • How well is the project executed from a technical standpoint?
    • Is the code well-structured, documented, and reproducible?
    • How effectively does it utilize Goodfire's SDK/API and other provided resources?
  4. Presentation and Communication
    • How clearly and effectively is the research presented in the paper?
    • Quality of visualizations and demos (if applicable)
    • Clarity of methodology explanation and results interpretation

Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
You have successfully submitted! You should receive an email and your project should appear here. If not, contact operations@apartresearch.com.
Oops! Something went wrong while submitting the form.
No projects submitted yet! Add your project information in the form. We usually see projects submitted quite close to the deadline.