Jun 13, 2025
Red Teaming A Narrow Path: A Critical Analysis
Shivam Arora
ControlAI has developed "A Narrow Path" - the first comprehensive plan to address extinction risks from Artificial Superintelligence (ASI). In this document we review, critique, and red-team Phase 0: Safety policies of the proposed plan. We found out that these policies are well-intentioned but lack sufficient grounding in technical understanding, resulting in significant gaps in their design and applicability. Our analysis highlights how, even when fully implemented, these policies leave room for strategic evasion that undermines their original purpose.
Tolga Bilge
The discussion on "found systems" may have missed the explanation of the requirement on "direct use": "We similarly introduce the concept of “direct use” so this policy only applies to cases where AIs are playing a key role in the research or development of improving AIs."
The objection re: AI safety researchers on unauthorized access seems reasonable. I believe it's somewhat addressed in footnote 23, but way too easy to miss.
The superintelligence ban policy is more a normative & guiding principle than a technical definition - which is hard to do and could be gamed
Cite this work
@misc {
title={
@misc {
},
author={
Shivam Arora
},
date={
6/13/25
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}