Mar 23, 2026
Narrow Capabilities Could be Used for Perfect Information Steganography
Aden Power
I introduce proof-of-capability as a strategy a future AI system could use to send secret messages even when the procedure for doing so is completely described to the monitor.
I explain the form this steganographic strategy would take by combining several existing techinques.
I propose several mitigations for this threat model and evaluate them in detail.
The proposed risk model is interesting, suggesting public knowledge capability-gaped steganographic schemes. My main uncertainties revolve around the reliability of such a scheme to encode signal in natural outputs in a way robust to editing/paraphrasing and honeypots but still decodable by the receiver. Overall, I find the proposed risk worth considering but align with the author on its limited likeliness. Assuming models powerful enough to carry on the proposed steganography schemes under monitoring, I would assume other attack surfaces to take precedence.
Super interesting concept but this is still entirely theoretical and I'd be interested to know how well it works in practice
Cite this work
@misc {
title={
(HckPrj) Narrow Capabilities Could be Used for Perfect Information Steganography
},
author={
Aden Power
},
date={
3/23/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


