Jun 14, 2025
AI Assistance in AI alignment Improvement: Allow It!
Anthony Bailey
A Narrow Path currently includes a condition (A): “No AIs improving AIs” that underlies various parts of the document.
It makes no exception for what I will abbreviate as AI^4: AI Assisting In AI Alignment Improvement. It should, because sufficiently many in AI safety while acknowledging its unique hazards still see value in exploring “AI helping with our alignment homework.”
Specifically I am Red Teaming Phase 0 policy 3 (“Prohibit the development and use of AIs that improve other AIs”) arguing no policy that bans AI^4 will garner sufficient support to be agreed or enforced.
A variety of Deep Research implementations suggested 70-90% of those expressing relevant opinions in forums associated with AIXR concern would oppose such a ban.
If AI assistance in AI alignment improvements is to be allowed, that needs to be made more clear, and consequences to licensing and enforcement considered.
Dave Kasten
We appreciate this thoughtful feedback. The argument about using AI cautiously for tasks to reduce AI risks is an important one to consider, and the arguments about the need to consider it for both principled and pragmatic reasons were interesting. It would have been helpful to see additional arguments about how likely it would be that a policy regime could succeed at correctly drawing the line between AI work that reduces AI risk and AI work that increases it, but nonetheless worth considering given, e.g., UK AISI's clear interest in this (as well as, of course, all major frontier AI companies)
Tolga Bilge
Seems like a reasonable objection that is worth pondering and either changing or providing some justification for keeping as is.
However, if changes to permit this allow for the possibility of an intelligence recursion, I don't think it should be allowed.
Didn't really engage with the goal of preventing the development of superintelligence for 20+ years.
Cite this work
@misc {
title={
@misc {
},
author={
Anthony Bailey
},
date={
6/14/25
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}