Sep 29, 2022

-

Oct 1, 2022

Online & In-Person

Language Model Hackathon

Alignment Jam #1

00:00:00:00

00:00:00:00

00:00:00:00

00:00:00:00

This event is ongoing.

This event has concluded.

Overview

Overview

Arrow
Arrow
Arrow

Join this AI safety hackathon to compete in uncovering novel aspects of how language models work! This follows the "black box interpretability" agenda of Buck Shlegeris: 

Interpretability research is sometimes described as neuroscience for ML models. Neuroscience is one approach to understanding how human brains work. But empirical psychology research is another approach. I think more people should engage in the analogous activity for language models: trying to figure out how they work just by looking at their behavior, rather than trying to understand their internals. Read more.

Resources

Resources

Arrow
Arrow
Arrow

InspirationSee a full list of inspiration, code, and data for the weekend here.AI Safety Ideas project ideasLanguage models are few-shot learners (NeurIPS)TruthfulQA - Stephanie Lin, Jacob Hilton, Owain Evans (ArXiv)Chain of thought prompting (ArXiv) Apart Research's cognitive bias testing for the inverse scaling prize - Esben Kran, Jonathan RystrømEpistemic biases in LLMs - Siméon CamposThe inverse scaling Google Colab - see the left side file view to experiment with datasets (Inverse Scaling prize)


Schedule

Schedule

Arrow
Arrow
Arrow

The schedule makes space for 46 hours of research jamming. You can decide your commitment level during the jam with your teammates but we encourage you to remember to sleep and eat.

Entries

Check back later to see entries to this event

Registered Jam Sites

Register A Location

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.

We haven't announced jam sites yet

Check back later

Registered Jam Sites

Register A Location

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.

We haven't announced jam sites yet

Check back later