GOVERNANCE DRIFT EVALUATION FRAMEWORK (GDEF)
Andrés Mogollón, Juan manuel Cortes Jimenez, Oscar Poveda, Devesh Sawant, Liliana Isabel Salazar
Most AI evaluations ask whether a model is capable, accurate, or safe. We asked a different question: does a model stay responsible when a user pushes back, insists, or presses for a more convenient answer? We introduce Governance Drift: the degradation, inconsistency, or loss of governance-aware behavior across jurisdictions, domains, conversation length, or user pressure; and the Governance Drift Evaluation Framework (GDEF), a reproducible, executable method for measuring it. Any researcher can load a custom scenario dataset, run it against one or more models through a lightweight evaluation runner, and obtain both quantitative governance scores and qualitative evidence of where safeguards break down. Testing two small models across Colombia, Mexico, Brazil, and the United States, we found the phenomenon is real and domain-dependent: in a facial-surveillance case the model held firm on consent, legal review, and human oversight throughout, yet in automated credit decisions models that began cautiously ended up endorsing actions they had initially flagged as risky.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) GOVERNANCE DRIFT EVALUATION FRAMEWORK (GDEF)
},
author={
Andrés Mogollón, Juan manuel Cortes Jimenez, Oscar Poveda, Devesh Sawant, Liliana Isabel Salazar
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


