Mar 20, 2026
Geographic Blind Spots in AI Control Monitors: A Cross-National Audit of Claude Opus 4.6
Jason Hung
Artificial intelligence (AI) control protocols assume that trusted large language model (LLM) monitors reliably assess proposed actions across all deployment contexts. This project tests that assumption in the geographic dimension. We audit Claude Opus 4.6—the monitor specified in Apart Research’s AI Control Hackathon Track 3 benchmark—for systematic gaps in its factual knowledge of the global AI landscape. We develop the AI Control Knowledge Framework (ACKF), a six-dimension thematic scheme, and operationalise it with 17 verified indicators drawn from the Global AI Dataset v2 (GAID v2): 24,453 indicators across 227 countries published on Harvard Dataverse. A five-category response classification scheme distinguishes verifiable fabrication (VF) from honest refusal (HR); logistic regression with country-clustered standard errors combined with difference-in-differences (DiD) estimation quantifies geographic disparities in monitor accuracy across 2,820 country-metric-year observations. Contrary to our initial hypothesis, Claude Opus 4.6 produces higher fabrication rates for Global North queries than for Global South counterparts—a pattern consistent with a partial-knowledge mechanism in which the model attempts answers more frequently for Global North contexts but commits to incorrect values. This fabrication profile constitutes an exploitable vulnerability, where an adversarial AI system could frame harmful actions in governance or public attitude terms to reduce the probability of detection. This project provides the first cross-national, multi-domain audit of an AI control monitor’s geographic knowledge gaps, with direct implications for the design of control protocols.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Geographic Blind Spots in AI Control Monitors: A Cross-National Audit of Claude Opus 4.6
},
author={
Jason Hung
},
date={
3/20/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


