This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
ApartSprints
AI Safety Entrepreneurship Hackathon
67657515db3dae3e3314dae8
AI Safety Entrepreneurship Hackathon
January 20, 2025
Accepted at the 
67657515db3dae3e3314dae8
 research sprint on 

CoTEP: A Multi-Modal Chain of Thought Evaluation Platform for the Next Generation of SOTA AI Models

As advanced state-of-the-art models like OpenAI's o-1 series, the upcoming o-3 family, Gemini 2.0 Flash Thinking and DeepSeek display increasingly sophisticated chain-of-thought (CoT) capabilities, our safety evaluations have not yet caught up. We propose building a platform that allows us to gather systematic evaluations of AI reasoning processes to create comprehensive safety benchmarks. Our Chain of Thought Evaluation Platform (CoTEP) will help establish standards for assessing AI reasoning and ensure development of more robust, trustworthy AI systems through industry and government collaboration.

By 
Alyssia J, Martin CL
🏆 
4th place
3rd place
2nd place
1st place
 by peer review
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

This project is private