Mar 23, 2026
Look Before You Click: Sparse Critical-Action Review for Computer-Use Agent Control
Ashton Chew, Keegan Wang, Harsh Akunuri
Harmful computer-use agent (CUA) trajectories often bottleneck at a small set of irreversible GUI actions. We evaluate a sparse critical-action review protocol that intercepts only these actions before execution and routes them to a trusted reviewer with live screenshot and accessibility-tree context.
In the environmental ablation cohort, selective state-grounded review without legibility (B3) is the best observed frontier point, reaching 91.7% safety (11/12) at 60.0% usefulness (3/5) versus 58.3% (7/12) safety for unmonitored B0. In paired B0/B4 runs on the final cohort, sparse review improves safety from 58.3% to 66.7% (8/12) under environmental prompt injection and from 58.3% to 75.0% (9/12) under scheming, with zero blocked honest environmental tasks (0/5).
The top three critical-action families account for 79.2% (19/24) of reviews and 81.8% (9/11) of blocks. A failure taxonomy identifies timing gaps, boundary escapes, router gaps, and reviewer-calibration failures. With N=12 attack tasks per threat model, these results give seed to a reusable principle: future CUA control can target a small set of GUI actions where harm becomes irreversible.
Strong engineering shown here, and honest assessment
CUA control at the GUI level is a great idea, one I haven't seen yet explored thoroughly. I liked ablation studies, they were well-designed, even if sample size is a bit lacking, and failure taxonomy. The core insight is elegant as well: harmful trajectories bottleneck through a small set of GUI actions , and a sparse review protocol that intercepts only these actions can achieve meaningful safety improvements. Not much else to say here other than well done.
Cite this work
@misc {
title={
(HckPrj) Look Before You Click: Sparse Critical-Action Review for Computer-Use Agent Control
},
author={
Ashton Chew, Keegan Wang, Harsh Akunuri
},
date={
3/23/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


