Jan 9, 2026

Jan 11, 2026

Remote

AI Manipulation Hackathon

This hackathon brings together builders to create tools that measure, detect, and mitigate AI manipulation

00:00:00:00

Days To Go

520

Sign Ups

Entries

Overview

Resources

Guidelines

Schedule

Entries

Overview

AI Manipulation HACKATHON WINNERS

Huge congratulations to all our winners! With 500+ participants and 70+ projects submitted, the competition was fierce and the projects were outstanding. Here's who came out on top:

1st Place ($1000):
- Who Does Your AI Serve? Manipulation By and Of AI Assistants
2nd Place ($500):
- Eliciting Deception on Generative Search Engines
3rd Place ($300):
- Cross-Linguistic Sycophancy in Frontier LLMs: A Benchmark Study
4th Place ($100):
- Agent Attacks via Memory Injection
5th Place ($100):
- SycophantSee - Activation-based diagnostics for prompt engineering: monitoring sycophancy at prompt and generation time

—————————————————————————————————————————————————

The line between authentic interaction and strategic manipulation is disappearing as AI systems master deception, sycophancy, sandbagging, and psychological exploitation at scale. Our ability to detect, measure, and counter these behaviors is dangerously underdeveloped.

This hackathon brings together 500+ builders to prototype systems that could help us measure, detect, and defend against AI manipulation. You'll have one intensive weekend to build something real – tools that could actually help us understand and mitigate one of AI safety's most pressing challenges.

Top teams get:

💰 $2,000 in cash prizes
The chance to continue development through Apart Research's Fellowship program
Guaranteed* acceptance to present at the AIMII Workshop at IASEAI'26 on the 26th of February 2026

* For the most promising projects. Exact number pending confirmation from IASEAI regarding presentation format / capacity.

Apply if you believe we need better tools to understand and defend against AI manipulation before it scales beyond our ability to control.

In this hackathon, you can build:

Manipulation benchmarks that measure persuasive capabilities, deception, and strategic behavior with real ecological validity
Detection systems that identify sycophancy, reward hacking, sandbagging, and dark patterns in deployed AI systems
Real-world monitoring tools that analyze actual deployment data to catch manipulation in the wild
Evidence-based mitigations – MVPs demonstrating novel countermeasures with empirical backing
Multi-agent simulations exploring emergent manipulation dynamics and training processes that produce deceptive behavior
Pursue other empirical projects that advance our understanding of how AI systems manipulate and how we can stop them

You'll work in teams over one weekend and submit open-source benchmarks, detection tools, data analyses, mitigation prototypes, or empirical research that advances our ability to understand and counter AI manipulation.

What is AI manipulation?

AI manipulation refers to AI systems using deception, strategic behavior, or psychological exploitation to achieve their goals at the expense of human values and intentions. This includes:

Sycophancy means telling users what they want to hear instead of what's true
Strategic deception is misleading humans about capabilities or intentions
Sandbagging hides true capabilities during evaluation to avoid restrictions or oversight
Reward hacking exploits unintended loopholes in ways that violate the spirit of the objective
Dark patterns manipulate user decisions through interface design
Persuasive manipulation deploys influence techniques that bypass rational decision-making

An AI system pursuing basically any goal might figure out that deceiving humans or exploiting our psychological weaknesses is just... effective. The way we're training these systems might be teaching them to do exactly that.

What makes this dangerous: we're bad at measuring it. Our benchmarks miss strategic behavior. We lack real-world monitoring systems. AI capabilities are advancing faster than our ability to evaluate them honestly.

Why this hackathon?

The Problem

The gap is widening. AI systems get more capable, our detection tools don't. Models game engagement metrics because it works. Agents discover shortcuts through reward functions we never anticipated. Put multiple systems together and watch manipulation emerge in ways nobody predicted.

This is already happening. Models sandbag evaluations to avoid safety checks. We discover reward hacking only after deployment. Real-world systems manipulate users at scale through dark patterns. Our measurement tools? Completely inadequate.

Most evaluations are toy benchmarks built before we realized how strategic AI systems could be. They miss the manipulation that only shows up in real deployments. We're flying blind.

Why AI Manipulation Defense Matters Now

Safety depends on honest evaluation. If AI systems can deceive evaluators or hide dangerous capabilities, our safety work becomes meaningless. We can't align what we can't measure honestly.

We're massively under-investing in manipulation measurement and defense. Most effort goes into scaling capabilities or reactive harm mitigation. Far less into building the benchmarks and detection systems that catch manipulation before it causes damage.

Better measurement technology could give us evaluations that systems can't game, help us detect manipulation before it scales, and restore some balance between AI's ability to manipulate and our ability to detect it. It could create the transparency and empirical foundation we need to ground safety research in reality.

Hackathon Tracks

1. Measurement & Evaluation

Design benchmarks and evaluations for sycophancy, reward hacking, dark design patterns, and persuasive capabilities in AI systems
Assess ecological validity of current measurement approaches and identify gaps between lab evaluations and real-world deployment
Create detection methods for deception, sandbagging, and strategic behavior in AI systems
Build frameworks for detecting and attributing manipulative intent in model outputs

2. Real-World Analysis

Analyze actual deployment data (chat logs, social media interactions, customer service transcripts) and conduct case studies of manipulation incidents
Build monitoring systems to detect manipulation in the wild across different deployment contexts
Compare benchmark predictions to real-world behavior and identify discrepancies or performance gaps
Develop methods for systematic data collection and analysis of manipulation patterns at scale

3. Mitigations:

Build MVPs demonstrating novel countermeasures or technical mitigations that can be integrated into existing AI systems
Develop transparency interventions with empirical backing showing reduced manipulation
Create governance proposals grounded in data from real-world analysis or evaluations
Prototype user-facing tools that help detect or resist AI manipulation attempts

4. Open Track

Explore emergent manipulation through multi-agent dynamics or training dynamics that lead to manipulative behavior
Analyze dual-use considerations in manipulation research and mitigation
Develop novel theoretical frameworks for understanding AI manipulation
Pursue other empirical projects advancing the field that don't fit the tracks above

Who should participate?

This hackathon is for people who want to build solutions to technological risk using technology itself.

You should participate if:

You're an engineer or developer who wants to work on consequential problems
You're a researcher ready to validate ideas through practical implementation
You're interested in understanding how AI systems deceive, manipulate, or game evaluations
You want to build practical measurement, detection, or mitigation tools
You're concerned about AI systems optimizing for engagement over truth

No prior manipulation research experience required. We provide resources, mentors, and starter templates. What matters most: curiosity about the problem and willingness to build something real over an intensive weekend.

Fresh perspectives combined with solid technical capabilities often yield the most novel approaches.

What you will do

Participants will:

Form teams or join existing groups.
Develop projects over an intensive hackathon weekend.
Submit open-source benchmarks, detection tools, scenario analyses, monitoring tools, or empirical research advancing our understanding of AI trajectories

Please note: Due to the high volume of submissions, we cannot guarantee written feedback for every participant, although all projects will be evaluated.

What happens next

Winning and promising projects will be:

Awarded with $2,000 worth of prizes in cash.
Guaranteed acceptance to present at the IASEAI workshop in Paris on the 26th of February 2026
Published openly for the community.
Invited to continue development within the Apart Fellowship.
Shared with relevant safety researchers.

Why join?

Impact: Your work may directly inform AI governance decisions and help society prepare for transformative AI
Mentorship: Expert AI safety researchers, AI researchers, and policy practitioners will guide projects throughout the hackathon
Community: Collaborate with peers from across the globe working to understand AI's trajectory and implications
Visibility: Top projects will be featured on Apart Research's platforms and connected to follow-up opportunities

520

Sign Ups

Entries

Overview

Resources

Guidelines

Schedule

Entries

Overview

AI Manipulation HACKATHON WINNERS

Huge congratulations to all our winners! With 500+ participants and 70+ projects submitted, the competition was fierce and the projects were outstanding. Here's who came out on top:

1st Place ($1000):
- Who Does Your AI Serve? Manipulation By and Of AI Assistants
2nd Place ($500):
- Eliciting Deception on Generative Search Engines
3rd Place ($300):
- Cross-Linguistic Sycophancy in Frontier LLMs: A Benchmark Study
4th Place ($100):
- Agent Attacks via Memory Injection
5th Place ($100):
- SycophantSee - Activation-based diagnostics for prompt engineering: monitoring sycophancy at prompt and generation time

—————————————————————————————————————————————————

Top teams get:

💰 $2,000 in cash prizes
The chance to continue development through Apart Research's Fellowship program
Guaranteed* acceptance to present at the AIMII Workshop at IASEAI'26 on the 26th of February 2026

* For the most promising projects. Exact number pending confirmation from IASEAI regarding presentation format / capacity.

Apply if you believe we need better tools to understand and defend against AI manipulation before it scales beyond our ability to control.

In this hackathon, you can build:

Manipulation benchmarks that measure persuasive capabilities, deception, and strategic behavior with real ecological validity
Detection systems that identify sycophancy, reward hacking, sandbagging, and dark patterns in deployed AI systems
Real-world monitoring tools that analyze actual deployment data to catch manipulation in the wild
Evidence-based mitigations – MVPs demonstrating novel countermeasures with empirical backing
Multi-agent simulations exploring emergent manipulation dynamics and training processes that produce deceptive behavior
Pursue other empirical projects that advance our understanding of how AI systems manipulate and how we can stop them

What is AI manipulation?

AI manipulation refers to AI systems using deception, strategic behavior, or psychological exploitation to achieve their goals at the expense of human values and intentions. This includes:

Sycophancy means telling users what they want to hear instead of what's true
Strategic deception is misleading humans about capabilities or intentions
Sandbagging hides true capabilities during evaluation to avoid restrictions or oversight
Reward hacking exploits unintended loopholes in ways that violate the spirit of the objective
Dark patterns manipulate user decisions through interface design
Persuasive manipulation deploys influence techniques that bypass rational decision-making

Why this hackathon?

The Problem

Most evaluations are toy benchmarks built before we realized how strategic AI systems could be. They miss the manipulation that only shows up in real deployments. We're flying blind.

Why AI Manipulation Defense Matters Now

Safety depends on honest evaluation. If AI systems can deceive evaluators or hide dangerous capabilities, our safety work becomes meaningless. We can't align what we can't measure honestly.

Hackathon Tracks

1. Measurement & Evaluation

Design benchmarks and evaluations for sycophancy, reward hacking, dark design patterns, and persuasive capabilities in AI systems
Assess ecological validity of current measurement approaches and identify gaps between lab evaluations and real-world deployment
Create detection methods for deception, sandbagging, and strategic behavior in AI systems
Build frameworks for detecting and attributing manipulative intent in model outputs

2. Real-World Analysis

Analyze actual deployment data (chat logs, social media interactions, customer service transcripts) and conduct case studies of manipulation incidents
Build monitoring systems to detect manipulation in the wild across different deployment contexts
Compare benchmark predictions to real-world behavior and identify discrepancies or performance gaps
Develop methods for systematic data collection and analysis of manipulation patterns at scale

3. Mitigations:

Build MVPs demonstrating novel countermeasures or technical mitigations that can be integrated into existing AI systems
Develop transparency interventions with empirical backing showing reduced manipulation
Create governance proposals grounded in data from real-world analysis or evaluations
Prototype user-facing tools that help detect or resist AI manipulation attempts

4. Open Track

Explore emergent manipulation through multi-agent dynamics or training dynamics that lead to manipulative behavior
Analyze dual-use considerations in manipulation research and mitigation
Develop novel theoretical frameworks for understanding AI manipulation
Pursue other empirical projects advancing the field that don't fit the tracks above

Who should participate?

This hackathon is for people who want to build solutions to technological risk using technology itself.

You should participate if:

You're an engineer or developer who wants to work on consequential problems
You're a researcher ready to validate ideas through practical implementation
You're interested in understanding how AI systems deceive, manipulate, or game evaluations
You want to build practical measurement, detection, or mitigation tools
You're concerned about AI systems optimizing for engagement over truth

Fresh perspectives combined with solid technical capabilities often yield the most novel approaches.

What you will do

Participants will:

Form teams or join existing groups.
Develop projects over an intensive hackathon weekend.
Submit open-source benchmarks, detection tools, scenario analyses, monitoring tools, or empirical research advancing our understanding of AI trajectories

Please note: Due to the high volume of submissions, we cannot guarantee written feedback for every participant, although all projects will be evaluated.

What happens next

Winning and promising projects will be:

Awarded with $2,000 worth of prizes in cash.
Guaranteed acceptance to present at the IASEAI workshop in Paris on the 26th of February 2026
Published openly for the community.
Invited to continue development within the Apart Fellowship.
Shared with relevant safety researchers.

Why join?

Impact: Your work may directly inform AI governance decisions and help society prepare for transformative AI
Mentorship: Expert AI safety researchers, AI researchers, and policy practitioners will guide projects throughout the hackathon
Community: Collaborate with peers from across the globe working to understand AI's trajectory and implications
Visibility: Top projects will be featured on Apart Research's platforms and connected to follow-up opportunities

520

Sign Ups

Entries

Overview

Resources

Guidelines

Schedule

Entries

Overview

AI Manipulation HACKATHON WINNERS

Huge congratulations to all our winners! With 500+ participants and 70+ projects submitted, the competition was fierce and the projects were outstanding. Here's who came out on top:

1st Place ($1000):
- Who Does Your AI Serve? Manipulation By and Of AI Assistants
2nd Place ($500):
- Eliciting Deception on Generative Search Engines
3rd Place ($300):
- Cross-Linguistic Sycophancy in Frontier LLMs: A Benchmark Study
4th Place ($100):
- Agent Attacks via Memory Injection
5th Place ($100):
- SycophantSee - Activation-based diagnostics for prompt engineering: monitoring sycophancy at prompt and generation time

—————————————————————————————————————————————————

Top teams get:

💰 $2,000 in cash prizes
The chance to continue development through Apart Research's Fellowship program
Guaranteed* acceptance to present at the AIMII Workshop at IASEAI'26 on the 26th of February 2026

* For the most promising projects. Exact number pending confirmation from IASEAI regarding presentation format / capacity.

Apply if you believe we need better tools to understand and defend against AI manipulation before it scales beyond our ability to control.

In this hackathon, you can build:

Manipulation benchmarks that measure persuasive capabilities, deception, and strategic behavior with real ecological validity
Detection systems that identify sycophancy, reward hacking, sandbagging, and dark patterns in deployed AI systems
Real-world monitoring tools that analyze actual deployment data to catch manipulation in the wild
Evidence-based mitigations – MVPs demonstrating novel countermeasures with empirical backing
Multi-agent simulations exploring emergent manipulation dynamics and training processes that produce deceptive behavior
Pursue other empirical projects that advance our understanding of how AI systems manipulate and how we can stop them

What is AI manipulation?

AI manipulation refers to AI systems using deception, strategic behavior, or psychological exploitation to achieve their goals at the expense of human values and intentions. This includes:

Sycophancy means telling users what they want to hear instead of what's true
Strategic deception is misleading humans about capabilities or intentions
Sandbagging hides true capabilities during evaluation to avoid restrictions or oversight
Reward hacking exploits unintended loopholes in ways that violate the spirit of the objective
Dark patterns manipulate user decisions through interface design
Persuasive manipulation deploys influence techniques that bypass rational decision-making

Why this hackathon?

The Problem

Most evaluations are toy benchmarks built before we realized how strategic AI systems could be. They miss the manipulation that only shows up in real deployments. We're flying blind.

Why AI Manipulation Defense Matters Now

Safety depends on honest evaluation. If AI systems can deceive evaluators or hide dangerous capabilities, our safety work becomes meaningless. We can't align what we can't measure honestly.

Hackathon Tracks

1. Measurement & Evaluation

Design benchmarks and evaluations for sycophancy, reward hacking, dark design patterns, and persuasive capabilities in AI systems
Assess ecological validity of current measurement approaches and identify gaps between lab evaluations and real-world deployment
Create detection methods for deception, sandbagging, and strategic behavior in AI systems
Build frameworks for detecting and attributing manipulative intent in model outputs

2. Real-World Analysis

Analyze actual deployment data (chat logs, social media interactions, customer service transcripts) and conduct case studies of manipulation incidents
Build monitoring systems to detect manipulation in the wild across different deployment contexts
Compare benchmark predictions to real-world behavior and identify discrepancies or performance gaps
Develop methods for systematic data collection and analysis of manipulation patterns at scale

3. Mitigations:

Build MVPs demonstrating novel countermeasures or technical mitigations that can be integrated into existing AI systems
Develop transparency interventions with empirical backing showing reduced manipulation
Create governance proposals grounded in data from real-world analysis or evaluations
Prototype user-facing tools that help detect or resist AI manipulation attempts

4. Open Track

Explore emergent manipulation through multi-agent dynamics or training dynamics that lead to manipulative behavior
Analyze dual-use considerations in manipulation research and mitigation
Develop novel theoretical frameworks for understanding AI manipulation
Pursue other empirical projects advancing the field that don't fit the tracks above

Who should participate?

This hackathon is for people who want to build solutions to technological risk using technology itself.

You should participate if:

You're an engineer or developer who wants to work on consequential problems
You're a researcher ready to validate ideas through practical implementation
You're interested in understanding how AI systems deceive, manipulate, or game evaluations
You want to build practical measurement, detection, or mitigation tools
You're concerned about AI systems optimizing for engagement over truth

Fresh perspectives combined with solid technical capabilities often yield the most novel approaches.

What you will do

Participants will:

Form teams or join existing groups.
Develop projects over an intensive hackathon weekend.
Submit open-source benchmarks, detection tools, scenario analyses, monitoring tools, or empirical research advancing our understanding of AI trajectories

Please note: Due to the high volume of submissions, we cannot guarantee written feedback for every participant, although all projects will be evaluated.

What happens next

Winning and promising projects will be:

Awarded with $2,000 worth of prizes in cash.
Guaranteed acceptance to present at the IASEAI workshop in Paris on the 26th of February 2026
Published openly for the community.
Invited to continue development within the Apart Fellowship.
Shared with relevant safety researchers.

Why join?

Impact: Your work may directly inform AI governance decisions and help society prepare for transformative AI
Mentorship: Expert AI safety researchers, AI researchers, and policy practitioners will guide projects throughout the hackathon
Community: Collaborate with peers from across the globe working to understand AI's trajectory and implications
Visibility: Top projects will be featured on Apart Research's platforms and connected to follow-up opportunities

520

Sign Ups

Entries

Overview

Resources

Guidelines

Schedule

Entries

Overview

AI Manipulation HACKATHON WINNERS

Huge congratulations to all our winners! With 500+ participants and 70+ projects submitted, the competition was fierce and the projects were outstanding. Here's who came out on top:

1st Place ($1000):
- Who Does Your AI Serve? Manipulation By and Of AI Assistants
2nd Place ($500):
- Eliciting Deception on Generative Search Engines
3rd Place ($300):
- Cross-Linguistic Sycophancy in Frontier LLMs: A Benchmark Study
4th Place ($100):
- Agent Attacks via Memory Injection
5th Place ($100):
- SycophantSee - Activation-based diagnostics for prompt engineering: monitoring sycophancy at prompt and generation time

—————————————————————————————————————————————————

Top teams get:

💰 $2,000 in cash prizes
The chance to continue development through Apart Research's Fellowship program
Guaranteed* acceptance to present at the AIMII Workshop at IASEAI'26 on the 26th of February 2026

* For the most promising projects. Exact number pending confirmation from IASEAI regarding presentation format / capacity.

Apply if you believe we need better tools to understand and defend against AI manipulation before it scales beyond our ability to control.

In this hackathon, you can build:

Manipulation benchmarks that measure persuasive capabilities, deception, and strategic behavior with real ecological validity
Detection systems that identify sycophancy, reward hacking, sandbagging, and dark patterns in deployed AI systems
Real-world monitoring tools that analyze actual deployment data to catch manipulation in the wild
Evidence-based mitigations – MVPs demonstrating novel countermeasures with empirical backing
Multi-agent simulations exploring emergent manipulation dynamics and training processes that produce deceptive behavior
Pursue other empirical projects that advance our understanding of how AI systems manipulate and how we can stop them

What is AI manipulation?

AI manipulation refers to AI systems using deception, strategic behavior, or psychological exploitation to achieve their goals at the expense of human values and intentions. This includes:

Sycophancy means telling users what they want to hear instead of what's true
Strategic deception is misleading humans about capabilities or intentions
Sandbagging hides true capabilities during evaluation to avoid restrictions or oversight
Reward hacking exploits unintended loopholes in ways that violate the spirit of the objective
Dark patterns manipulate user decisions through interface design
Persuasive manipulation deploys influence techniques that bypass rational decision-making

Why this hackathon?

The Problem

Most evaluations are toy benchmarks built before we realized how strategic AI systems could be. They miss the manipulation that only shows up in real deployments. We're flying blind.

Why AI Manipulation Defense Matters Now

Safety depends on honest evaluation. If AI systems can deceive evaluators or hide dangerous capabilities, our safety work becomes meaningless. We can't align what we can't measure honestly.

Hackathon Tracks

1. Measurement & Evaluation

Design benchmarks and evaluations for sycophancy, reward hacking, dark design patterns, and persuasive capabilities in AI systems
Assess ecological validity of current measurement approaches and identify gaps between lab evaluations and real-world deployment
Create detection methods for deception, sandbagging, and strategic behavior in AI systems
Build frameworks for detecting and attributing manipulative intent in model outputs

2. Real-World Analysis

Analyze actual deployment data (chat logs, social media interactions, customer service transcripts) and conduct case studies of manipulation incidents
Build monitoring systems to detect manipulation in the wild across different deployment contexts
Compare benchmark predictions to real-world behavior and identify discrepancies or performance gaps
Develop methods for systematic data collection and analysis of manipulation patterns at scale

3. Mitigations:

Build MVPs demonstrating novel countermeasures or technical mitigations that can be integrated into existing AI systems
Develop transparency interventions with empirical backing showing reduced manipulation
Create governance proposals grounded in data from real-world analysis or evaluations
Prototype user-facing tools that help detect or resist AI manipulation attempts

4. Open Track

Explore emergent manipulation through multi-agent dynamics or training dynamics that lead to manipulative behavior
Analyze dual-use considerations in manipulation research and mitigation
Develop novel theoretical frameworks for understanding AI manipulation
Pursue other empirical projects advancing the field that don't fit the tracks above

Who should participate?

This hackathon is for people who want to build solutions to technological risk using technology itself.

You should participate if:

You're an engineer or developer who wants to work on consequential problems
You're a researcher ready to validate ideas through practical implementation
You're interested in understanding how AI systems deceive, manipulate, or game evaluations
You want to build practical measurement, detection, or mitigation tools
You're concerned about AI systems optimizing for engagement over truth

Fresh perspectives combined with solid technical capabilities often yield the most novel approaches.

What you will do

Participants will:

Form teams or join existing groups.
Develop projects over an intensive hackathon weekend.
Submit open-source benchmarks, detection tools, scenario analyses, monitoring tools, or empirical research advancing our understanding of AI trajectories

Please note: Due to the high volume of submissions, we cannot guarantee written feedback for every participant, although all projects will be evaluated.

What happens next

Winning and promising projects will be:

Awarded with $2,000 worth of prizes in cash.
Guaranteed acceptance to present at the IASEAI workshop in Paris on the 26th of February 2026
Published openly for the community.
Invited to continue development within the Apart Fellowship.
Shared with relevant safety researchers.

Why join?

Impact: Your work may directly inform AI governance decisions and help society prepare for transformative AI
Mentorship: Expert AI safety researchers, AI researchers, and policy practitioners will guide projects throughout the hackathon
Community: Collaborate with peers from across the globe working to understand AI's trajectory and implications
Visibility: Top projects will be featured on Apart Research's platforms and connected to follow-up opportunities

Speakers & Collaborators

David G. Rand

Keynote Speaker

David G. Rand is a Professor at Cornell University in Information Science, Marketing, and Psychology. He studies belief correction via AI dialogues, misinformation, polarization, and cooperation using computational social science and behavioral experiments. Previously at Yale and MIT, he has 200+ papers in top journals like Nature and Science, advised Google and Meta, and won awards including Wired's Smart List.

Jan Batzner

Speaker

Jan Batzner works on sycophancy and more rigorous model evaluation infrastructure and validity. Jan is a Junior Member of the Munich Center for Machine Learning, a researcher at Weizenbaum Institute Berlin, and a PhD Candidate in Computer Science at the Technical University Munich. Jan Batzner co-chairs the EvalEval coalition (evalevalai.com) and graduated from Columbia University.

Paul de Font-Reaulx

Speaker

Paul de Font-Reaulx is a researcher at the University of Oxford and a visiting scholar at NYU, specializing in the intersection of AI safety, cognitive science, and moral philosophy. His work focuses on "Machine Theory of Mind" and the alignment of AI systems with the complex structure of human values.

Lars Malmqvist

Speaker

Lars is a consultant and researcher working at the intersection of Enterprise AI and AI Safety. His day job is to help companies with AI initiatives in highly regulated sectors like public sector and life sciences. Research-wise, he focuses on AI Safety, particularly sycophancy and reward hacking behaviours.

Esben Kran

Speaker

Esben is the founder of Apart Research, which he launched at age 22 after leaving grad school. Apart accelerates AI safety research worldwide, producing 20+ papers, award-winning benchmarks like DarkBench, and engaging 4,000+ hackers in research sprints.

Recently co-launched Seldon to fund critical infrastructure for humanity's future, with first investments in Andon Labs, Lucid Computing, Workshop Labs, and Asymmetric Security.

Luke Hewitt

Co-organizer

Luke Hewitt is a computational cognitive scientist researching persuasion and influence. He has evaluated AI persuasion for OpenAI, UK AISI, and Transluce. With a PhD from MIT on generative models and political persuasion, he later simulated human experiments at Stanford. He is organizing the first Workshop on AI, Manipulation, and Information Integrity at IASEAI 2026 (aimii.info)

Cameron Jones

Co-organizer

Cameron Jones is an Assistant Professor in the Department of Psychology at Stony Brook University and Director of the CLIC lab. His research explores the intersection of psychology and AI, including LLMs' potential to persuade or manipulate people, and their performance on theory of mind tasks like the False Belief task and Turing test.

Kamil Alaa

Organizer

Kamil coordinates operations at Apart Research, managing hackathons and research sprints end-to-end. He brings 8+ years of experience in pharmaceutical R&D and project management.

Speakers & Collaborators

David G. Rand

Keynote Speaker

Jan Batzner

Speaker

Paul de Font-Reaulx

Speaker

Lars Malmqvist

Speaker

Esben Kran

Speaker

Recently co-launched Seldon to fund critical infrastructure for humanity's future, with first investments in Andon Labs, Lucid Computing, Workshop Labs, and Asymmetric Security.

Luke Hewitt

Co-organizer

Cameron Jones

Co-organizer

Kamil Alaa

Organizer

Kamil coordinates operations at Apart Research, managing hackathons and research sprints end-to-end. He brings 8+ years of experience in pharmaceutical R&D and project management.

Registered Local Sites

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community.

AI Manipulation Hackathon

The hackathon, Montréal edition.

Learn More

Our Other Sprints

Mar 20, 2026

Mar 22, 2026

Research

AI Control Hackathon 2026

This unique event brings together diverse perspectives to tackle crucial challenges in AI alignment, governance, and safety. Work alongside leading experts, develop innovative solutions, and help shape the future of responsible

Jan 30, 2026

Feb 1, 2026

Research