Mar 23, 2026
Hidden in Plain Sight: Representational Adversarial Steganography in Colluding LLMs
Vainateya Rangaraju, Srujana Medicherla, Dennis Lim, Guillaume Zahnd, Anna Konovalenko, Igor Pereverzev
We investigate whether Large Language Models (LLMs) can develop representational steganography: the ability to encode and transmit hidden signals within internal representations that are not readily detectable by oversight mechanisms. We apply this to the threat models of deception and collusion in multi-agent settings. We show that this capability emerges under adversarial pressure, and discuss the robustness of our experiments, and what they might show for existing machine oversight regimes.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Hidden in Plain Sight: Representational Adversarial Steganography in Colluding LLMs
},
author={
Vainateya Rangaraju, Srujana Medicherla, Dennis Lim, Guillaume Zahnd, Anna Konovalenko, Igor Pereverzev
},
date={
3/23/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


