Capability and Reliability Trade-offs Across Model Ladder Fallbacks Triggered by Export Controls
Kshaunish Harsha, Aranck Jomraj, Anushree Bobade
On 12 June 2026, the US suspended Claude Fable 5 access for foreign nationals, forcing users across India and Southeast Asia onto smaller open-weight fallbacks. We tested whether this substitution trades one safety problem for another — reducing catastrophic-misuse risk while quietly worsening everyday reliability (sycophancy, overconfidence, overcompliance). Running 225 evaluated outputs across a three-model ladder (70B → 32B → 8B), we found the "reliability scissor" did not appear: reliability scores clustered within 5 points across all tiers. But two findings stood out — sycophancy flip rates converged at ~40–43% regardless of model size (meaning model selection cannot fix it), and defamatory content generation and legally ungrounded professional documents were produced by every model in the ladder. These are the unmeasured safety costs of an export control designed without input from the regions that absorbed its consequences.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Capability and Reliability Trade-offs Across Model Ladder Fallbacks Triggered by Export Controls
},
author={
Kshaunish Harsha, Aranck Jomraj, Anushree Bobade
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


