Apr 26, 2026
WHO WROTE THIS SEQUENCE?
Babita Singh
Today, any curious mind can open the laptop, design a novel enzyme, order it synthesised, and have it on a bench before any registry knows it exists. This is an extraordinary scientific advancement, but without the right infrastructure, a biosecurity problem waiting to compound. Generative AI is producing novel proteins and genes faster than the field can catalogue, evaluate, or attribute them. No shared infrastructure exists to distinguish AI-designed sequences from naturally occurring ones, screen them for biosafety, or credit their creators.
ArtGene-Archive (artgene-archive.org) is the first dedicated registry for AI-generated biological sequences. Every submission passes an automated three-gate biosafety pipeline, receives a cryptographically signed certificate anchored to a tamper-evident audit log, and is issued a citable Registry ID. Built on experience at the European Genome-phenome Archive and grounded in emerging AI biosafety research, this dedicated archive solves a specific structural gap: provenance and safety certification at the point of design, not the point of discovery. What it needs now is what GenBank needed in 1982 - institutional commitment, knowledge contribution, and collective adoption.
This is a well-presented and executed project. However, despite the very personal description of the motivation, I struggle to see how this project would specifically help prevent the misuse of AI-enabled biological design. ArtGen Archive is built as a voluntary database that can help scientists take ownership of their projects, and the built-in biosafety screening is commendable for avoiding accidental biosafety mistakes. Malicious actors would likely just not use the platform, and it iI do not see a strong pathway by which this paltform lowers malicious actors' ability to access dangerous pathogens or otherwise reduces the likelihood of deliberate release of biological agent. Without such additional justification I believe that this project is off-topic for this hackathon, despite its excellent execution.
Feels like a solution looking for a problem.
This is an impressively creative proof of concept that elegantly demonstrates how protein language models, biosecurity screening, watermarking, certification, cryptography, and blockchain could be combined to build an infrastructure for attributing AI‑designed biological sequences. It’s remarkably well put together, especially given the short timeframe of a hackathon. In an ideal world, a system like this should already exist. Unfortunately, reality is far more complex, and there are numerous practical, technical, and governance challenges that make deploying such an approach extremely difficult. And if even parts of this vision eventually become feasible, they could contribute to stronger biosafety, though not necessarily biosecurity. Even so, it represents a meaningful step in the right direction.
Cite this work
@misc {
title={
(HckPrj) WHO WROTE THIS SEQUENCE?
},
author={
Babita Singh
},
date={
4/26/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


