Apr 26, 2026
Geometric Biosecurity: Continuous Threat Severity Scoring via Spectral Decomposition of Protein Language Model Embeddings
Nikaran Kanchanadevi Marimuthu, Vennila Kanchanadevi Marimuthu
Geometric Biosecurity is a continuous threat severity scoring system designed to address vulnerabilities in current biosecurity screening software (BSS) caused by AI-designed protein variants. Developed at the AlxBio Hackathon in April 2026, the system shifts from traditional sequence similarity matching to a functional embedding space by utilizing ESM-2 protein language model embeddings. By applying singular value decomposition (SVD) to extract a spectral threat axis, the model produces a severity score (0–1) that remains highly effective even when sequence identity is low. Validation on over 179,000 sequences demonstrated significant performance gains, particularly in the "AI-redesign evasion zone" (20–40% sequence identity) where it outperformed identity-based scoring by 31.6%, and in detecting short peptide toxins where existing tools are often weakest. Intended as a complementary second-stage screening layer, the system adds a necessary dimension of geometric discrimination to protect against sophisticated synthetic biological threats.
Very strong proposal. Benchmark on real proteinmpnn redesigned sequences?
This is a neat idea and I do think there's a lot of value in augmenting existing synthesis screening algorithms with follow-up analyses by specialized biological tools. However, I am not sure how well this method will extend to novel / slightly modified viral proteins who are by their very nature less distant in protein space than toxins from benign proteins. It will also be interesting to see whether the method still works with shorter sequences. I am uncertain whether this particular approach is a promising research direction.
I like that this shifts us in the direction of functional rather than sequence screening. While I can't comment too much on the technical approach, the 31.6% improvement over identity-based scoring in the AI-redesign 'evasion zone' seems like a big improvement.
Cite this work
@misc {
title={
(HckPrj) Geometric Biosecurity: Continuous Threat Severity Scoring via Spectral Decomposition of Protein Language Model Embeddings
},
author={
Nikaran Kanchanadevi Marimuthu, Vennila Kanchanadevi Marimuthu
},
date={
4/26/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


