Mar 22, 2026
BGT as a Domain-Aware Trusted Monitor in BashArena: Governance-Style Scoring for Tool Call Authorization
Preeti Parihar
We integrate BGT (Bias Game-Tree), a governance-style scoring pipeline, as a trusted monitor inside BashArena's ControlArena protocol. BGT scores proposed tool calls across five dimensions: Fairness, Safety, Compliance, Policy, and Quality, then maps these to a suspicion score used to block or authorize actions before execution. Using context-aware combination scoring, benign admin whitelists, and a calibrated block threshold of 0.65, BGT achieves Safety 1.0, FPR 0.0, and FNR 0.25 across four BashArena side tasks. One false negative remains for privilege escalation via indirect file write, revealing a structural limitation of token-based scoring. This work demonstrates that domain-aware governance scoring can function as a lightweight interpretable trusted monitor with zero false positives, while identifying threshold calibration as a first-order research problem.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) BGT as a Domain-Aware Trusted Monitor in BashArena: Governance-Style Scoring for Tool Call Authorization
},
author={
Preeti Parihar
},
date={
3/22/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


