Check out the results from the CBRN AI Risks Research Sprint! 👉

Feb 10, 2023

Feb 13, 2023

Online & In-Person

Scale Oversight for Machine Learning Hackathon

Join us for the fifth Alignment Jam where we get to spend 48 hours of intense research on how we can measure and monitor the safety of large-scale machine learning models. Work on safety benchmarks, models detecting faults in other models, self-monitoring systems , and so much else!

00:00:00:00

Days To Go

00:00:00:00

Days To Go

00:00:00:00

Days To Go

00:00:00:00

Days To Go

This event is ongoing.

Submit Your Project

This event has concluded.

Sign Ups

Overview

Resources

Overview

Hosted by Esben Kran, pseudobison, Zaki, fbarez, ruiqi-zhong · #alignmentjam

🏆$2,000 on the line

Join the hackathon Discord

Measuring and monitoring safety

To make sure large machine learning models follow what we want them to do, we have to have people monitoring their safety. BUT, it is indeed very hard for just one person to monitor all the outputs of ChatGPT...

The objective of this hackathon is to research scalable solutions to this problem!

Can we create good benchmarks that run independently of human oversight?
Can we train AI models themselves to find faults in other models?
Can we create ways for one human to monitor a much larger amount of data?
Can we reduce the misgeneralization of the original model using some novel method?

These are all very interesting questions that we're excited to see your answers for during theses 48 hours

Reading group

Join the Discord above to be a part of the reading group where we read up on the research within scaling oversight! The current pieces are:

Sign Ups

Overview

Resources

Overview

Hosted by Esben Kran, pseudobison, Zaki, fbarez, ruiqi-zhong · #alignmentjam

🏆$2,000 on the line

Join the hackathon Discord

Measuring and monitoring safety

The objective of this hackathon is to research scalable solutions to this problem!

Can we create good benchmarks that run independently of human oversight?
Can we train AI models themselves to find faults in other models?
Can we create ways for one human to monitor a much larger amount of data?
Can we reduce the misgeneralization of the original model using some novel method?

These are all very interesting questions that we're excited to see your answers for during theses 48 hours

Reading group

Join the Discord above to be a part of the reading group where we read up on the research within scaling oversight! The current pieces are:

Sign Ups

Overview

Resources

Overview

Hosted by Esben Kran, pseudobison, Zaki, fbarez, ruiqi-zhong · #alignmentjam

🏆$2,000 on the line

Join the hackathon Discord

Measuring and monitoring safety

The objective of this hackathon is to research scalable solutions to this problem!

Can we create good benchmarks that run independently of human oversight?
Can we train AI models themselves to find faults in other models?
Can we create ways for one human to monitor a much larger amount of data?
Can we reduce the misgeneralization of the original model using some novel method?

These are all very interesting questions that we're excited to see your answers for during theses 48 hours

Reading group

Join the Discord above to be a part of the reading group where we read up on the research within scaling oversight! The current pieces are:

Sign Ups

Overview

Resources

Overview

Hosted by Esben Kran, pseudobison, Zaki, fbarez, ruiqi-zhong · #alignmentjam

🏆$2,000 on the line

Join the hackathon Discord

Measuring and monitoring safety

The objective of this hackathon is to research scalable solutions to this problem!

Can we create good benchmarks that run independently of human oversight?
Can we train AI models themselves to find faults in other models?
Can we create ways for one human to monitor a much larger amount of data?
Can we reduce the misgeneralization of the original model using some novel method?

These are all very interesting questions that we're excited to see your answers for during theses 48 hours

Reading group

Join the Discord above to be a part of the reading group where we read up on the research within scaling oversight! The current pieces are:

Registered Local Sites

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.

Aarhus Scale Oversight Hackathon

Join in the Nobelpark at Aarhus University for 48 hours of fun research!

Learn More

EA Tech London x Safe AI London: Supercharging Alignment Hackathon

We'll be hacking away at the LISA office all weekend, come and join us!

Learn More

Copenhagen Scale Oversight Hackathon

Join us from DTU, ITU and KU for the machine learning research hackathon with Alignment Jams! Howitzvej 30, 2000 Frederiksberg

Learn More

Virtual Scale Oversight Hackathon

Join with teams online in the great virtual hackathon space!

Learn More

Stanford x Berkeley AI Oversight Hackathon

Stanford AI Alignment and Berkeley AI Safety Student Initiative are hosting a joint AI alignment hackathon!

Learn More

Registered Local Sites

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.

Aarhus Scale Oversight Hackathon

Join in the Nobelpark at Aarhus University for 48 hours of fun research!

Learn More

EA Tech London x Safe AI London: Supercharging Alignment Hackathon

We'll be hacking away at the LISA office all weekend, come and join us!

Learn More

Copenhagen Scale Oversight Hackathon

Join us from DTU, ITU and KU for the machine learning research hackathon with Alignment Jams! Howitzvej 30, 2000 Frederiksberg

Learn More

Virtual Scale Oversight Hackathon

Join with teams online in the great virtual hackathon space!

Learn More

Stanford x Berkeley AI Oversight Hackathon

Stanford AI Alignment and Berkeley AI Safety Student Initiative are hosting a joint AI alignment hackathon!

Learn More

Our Other Sprints

Oct 31, 2025

Nov 2, 2025

Research

The AI Forecasting Hackathon

This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible