TraceGuard-X: Adaptive Collusion Resistant Monitoring for Agentic AI Systems , a hierarchical governance framework with constitutional dimension prompting, temporal trajectory analysis, and cross-environment transfer
AYUSH KUMAR SAMAL
The deployment of increasingly capable autonomous AI agents raises a fundamental governance challenge: how can a monitoring system reliably distinguish benign from harmful behaviour when both the acting agent and the monitor may be strategically misaligned? Recent work on TraceGuard introduced structured multi-dimensional monitoring as a collusion-resistant
alternative to single-score evaluation. While promising, three key questions remain unresolved:
(i) robustness against rubric-aware adaptive attackers, (ii) computational efficiency of multidimensional monitoring, and (iii) generalisation beyond shell-based environments.
We present TraceGuard-X, a hierarchical monitoring framework that extends TraceGuard through adaptive adversarial evaluation, dynamic dimension routing, temporal trajectory analysis, federated monitoring, and cross-environment governance transfer. TraceGuard-X introduces
six core mechanisms: Constitutional Dimension Prompting (CDP), Temporal Graph of-Thought Monitoring (TGM), Divergence-Based Collusion Detection (DBCD),Dynamic Relevance Routing (DR-TraceGuard), Multi-Model Federated Monitoring (MMFM), and Hierarchical Domain-Adaptive Governance Transfer (HiDAgT).
Preliminary calibration experiments on the BashArena benchmark demonstrate statistically significant separation between benign and attack trajectories across all five monitoring dimensions, with large effect sizes (Cohen’s d > 0.96 on every dimension) and highly significant Kolmogorov-Smirnov statistics (p < 10−5 across all dimensions). A three-dimension routing policy achieves 84% of full-ensemble AUC at 60% API cost. The three-layer HiDAgT architecture enables zero-shot transfer to Infrastructure-as-Code, code-generation, and web-agent environments without modifying the universal meta-monitor.
Our results suggest that structured monitoring remains a viable governance primitive even under increasingly sophisticated attack models and heterogeneous deployment environments.
All code, calibration data, and evaluation infrastructure are publicly available.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) TraceGuard-X: Adaptive Collusion Resistant Monitoring for Agentic AI Systems , a hierarchical governance framework with constitutional dimension prompting, temporal trajectory analysis, and cross-environment transfer
},
author={
AYUSH KUMAR SAMAL
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


