Among:
Join our groundbreaking hackathon to create tests that identify when AI can copy itself - a critical safety risk. Your innovative tasks could shape global AI standards. Make your mark, potentially collaborate with a prominent AI safety lab, and win big. Sign up now for an impactful weekend!
Imagine an AI that can match human intelligence, capable of performing tasks across the board. But there's a catch: if such AI gets in the wrong hands, this could spell disaster, especially if it can replicate and defy control measures. That's where we step in.
In this hackathon, we develop the tests that will be used to evaluate today's best LLMs for autonomous replication (the ability to copy itself). During the 54 hours the hackathon runs (Friday to Sunday), we will develop tasks to test AI systems on that are:
If you're ready to test the limits of AI, sign up above and read on!
Welcome! During this exciting community weekend, you'll use the great starter resources to develop a new task implementation from your own or existing ideas. Using the process described in the "More Resources" tab, we'll be creating high-quality, error-free, and reliable tasks for agents by following these steps:
We also want to make sure data does not go into the training set for models such as ChatGPT, Claude, and Gemini. Therefore:
As you go through the guide, we highly recommend you read the resources presented in the "More Resources" tab! METR's task standard resources are high quality and provides a good context for your evaluation work.
The important part of this hackathon is the implementation, and to ensure this goes well, it's super helpful to have your ideas ready before kickoff! Check out the existing ideas that you are welcome to work on directly on this live airtable or write your own ideas and submit them here.
On Friday the 22nd of March, Beth Barnes, CEO & Lead Researcher at METR, will take the stage to introduce you to the weekend's task and give an introduction to the field. This talk will be livestreamed and recorded so you can join over the weekend from anywhere in the world! If you have a location with some friends, go to "Hackathon sites" and register your location.
Under the "More resources" tab, you will find all the resources you need to get set up. This will include a full starter package of code and instructions to get you started!
Remember that you can get feedback directly from the team that evaluates your work during the weekend's office hours and talk Q&As. See more under the "Schedule".
Go to the "Submission" tab and send in your task files!
We invite established professionals, AI researchers, cybersecurity professionals, students, writers, cognitive scientists, and many more to join! Task quality will only increase as you get to converse with a diverse crowd of professionals.
The prizes are given to any task that fulfills the core criteria. The prize will be:
For this hackathon, the criteria are even more important than usual. These tasks might be used by frontier AI labs and government agencies!
Core criteria:
Bonus Criteria:
If you are part of a local machine learning or AI safety group, we invite you to set up a local in-person site for this hackathon! We will have several across the world and you can easily sign up under the “Hackathon sites” tab above where you will also find templates to share with your friends on social media.
Copy and follow the Hackbook to make your submission! It contains all the information and steps you need to make your submission. You can follow the below walkthrough to get a sense of which steps are involved!
Thanks goes to Bart Bussmann for helping out with the Hackbook development.
This guide documentation provides an in-depth look into how you can develop a new task for the task standard. We recommend going through it!
A list of task ideas we feel excited about getting implementations of. Crucially, you are very welcome to simply take these ideas, implement them, and collect your bounty! There's no need to be super original in this hackathon but if you decide to make your own task idea, go through this list anyways to get inspired and understand what types of tasks are relevant (or have already been proposed).
METR's official resources for autonomy evaluations. The resources includes their interesting initial research report and the example protocol for evaluating autonomy-related risks.
Here you can find some of METR's many great resources about the task standard:
The schedule is written in UTC. To get the schedule corresponding to your own time zone, please subscribe to the calendar.
We're going to be hacking over the weekend in the EA Denmark office in Copenhagen!
We're going to be hacking over the weekend in the EA Denmark office in Copenhagen!
Join us in Lausanne center - register to the lu.ma to see the full address.
Join us in Lausanne center - register to the lu.ma to see the full address.
An AI Safety hub in Toronto, organized to supplement the local AI Safety meetups.
An AI Safety hub in Toronto, organized to supplement the local AI Safety meetups.
Koperníkova 6, Prague
Koperníkova 6, Prague
Join our local jam site in the LEAH Coworking Space located near Farringdon.
Join our local jam site in the LEAH Coworking Space located near Farringdon.
In collaboration with Apart Research and METR, the initiative for AI safety Amsterdam is going to host a hackathon on March 22-24 at Science Park.
In collaboration with Apart Research and METR, the initiative for AI safety Amsterdam is going to host a hackathon on March 22-24 at Science Park.
Join us in Vietnam for a local version of the Code Red Hackathon.
Join us in Vietnam for a local version of the Code Red Hackathon.
Join our Hackathon Site for the "Code Red: Autonomous AI Threat Hackathon" this weekend! Dive into AI innovation, where we'll challenge, create, and test tasks designed to push the limits of Large Language Models (LLMs).
🚀 Why Join?
📅 When & Where:
💡 Participation Highlights:
🔗 Join Us:Whether you're an AI guru or an enthusiastic beginner, this hackathon site is the perfect place to contribute to the future of safe AI. Sign up, bring your laptop, and let's innovate together!
🏆 Prizes:Your efforts could earn you bounties for contributing to safer AI development.
Ready to make a difference? Mark your calendar and invite your network to join us at [Your Hackathon Site Name]. Let’s shape the future of AI, together!
The task ideas and specifications need to be submitted to the form below.
If the above form does not work, go to the direct link here.
The final task implementation will be a .zip file with your full implementation uploaded in the following form.
If the above form does not work, go to the direct link here.
We are excited to welcome you in 3 hours when Beth Barnes, CEO & Research Lead at METR, joins us to inspire you with the latest research from their team. METR is one of the foremost independent labs figuring out whether cutting-edge AI systems might pose catastrophic risks to society and we're certain her talk will be insightful! Esben will also go over the logistics for this weekend.
Together with everyone joining us online, we're also happy to welcome all of our local hackers in Prague, Toronto, London, Copenhagen, Switzerland, Vietnam, and Amsterdam at their hackathon locations. Welcome!
While you are getting ready, let's get an overview of the weekend:
Project schedule
Events
Today we'll have the keynote, while Saturday and Sunday invites you to the unique opportunity to discuss your task projects with METR and Apart researchers at the office hour. On Thursday, we have project presentations from participants.
We really look forward to welcome you and help you craft your successful project.
Good luck, research hackers!
We're excited to welcome you for the Code Red Hackathon this coming weekend! This email is a short overview of the latest resources and what you need to do to get ready for the weekend.
First off, the home page has received a makeover and will link you in all the right directions! The gold however is to be found in the “More Resources” tab! Specifically:
To make your weekend productive and exciting, we recommend that you take the following three steps before we kick off:
For more information about the prizes you can expect this weekend, jump to the overview section. TLDR;
You will undoubtedly have questions that you need answered. Remember that the #
questions channel is always available and that the organizers will be available there.
We really look forward to this weekend and we're excited to welcome you with Beth Barnes on Friday on Discord and YouTube!
Remember that there are no dumb questions and that we're all here to help each other succeed in making a positive difference for AI safety, so don't hesitate to reach out.
See you there, research hackers!