Apr 26, 2026
FuncScreen: Contrastive PLM Embeddings for Evasion-Resistant Biosecurity Screening
Aheli Poddar
Current DNA synthesis screening relies on sequence homology, which AI protein design tools like ProteinMPNN evade by generating functional threat variants with as low as 7% sequence identity to known threats. We introduce FuncScreen, a contrastive learning framework over frozen ESM-2 embeddings that screens by predicted biological function rather than sequence similarity. Trained with supervised contrastive loss, hard-negative mining, and embedding-space Mixup augmentation on 985 curated pore-forming toxin and benign homolog sequences, FuncScreen achieves 1.000 AUROC [1.000, 1.000] on standard and hard-negative splits. On 4,100 ProteinMPNN-designed adversarial variants, FuncScreen maintains 0.991 AUROC [0.988, 0.993] where homology drops to 0.952 [0.944, 0.959]. We provide a preliminary certified robustness analysis under biologically structured mutations (1,000 Monte Carlo samples, 100 sequences), finding an empirical-certified gap of at most 1%. We validate generalization on a second threat family (ribosome-inactivating proteins, AUROC 0.962) and report out-of-distribution false positive rates.
This is an important contribution to the screening literature. I appreciated the use of two different threat families and the use of multiple evaluation splits. I thought the inclusion of the Leave-One-Subcategory-Out Cross-Validation was particularly interesting. I particularly enjoyed your framing of function-based approaches as a compliment to more traditional sequence based approaches. This struck be as constructive, rather than adversarial. It would also be interesting to see this tested with more threat families.
Seems like a very valuable contribution to a future problem and I'd like to see this work taken further.
Cite this work
@misc {
title={
(HckPrj) FuncScreen: Contrastive PLM Embeddings for Evasion-Resistant Biosecurity Screening
},
author={
Aheli Poddar
},
date={
4/26/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


