Feb 1, 2026

NeuroVer

Justin Stefan Stoica Tica, David Ghiberdic, Vladimir Necula

Zero-Knowledge verification that LoRA fine-tuning followed safety constraints — without revealing training data or model weights.

Review Project

View Related Sprint

Reviewer's Comments

I like the idea of applying ZK proofs to LoRA fine-tuning compliance. The focus on LoRA, where proofs circuits are tractable, shows good feasibility judgment. The implementation appears to be conventional Python norm-checking rather than actual ZK-SNARK circuits, though--room for development here.

Training verification is an important problem and exploring zero-knowledge proofs for this purpose is interesting. But most most important question remains unaddressed imo: Who ensures the weights committed to the proof are the actual training weights? I would suggest zooming in on that problem.

Some other comments: The weight norm bound needs justification as a safety constraint. Why does ‖ΔW‖_F ≤ C imply safe fine-tuning? Is there literature on that? If so it would be good to reference it. Also the differential privacy invariant would need much more explanation.

Generally, I suggest to zoom in on exactly one problem (e.g. verifying which base model was used XOR verifying model weight updates stay below a specific weight bound) but then going much more in depth on that one: What are all the steps where trust can break down? How can you address all of them? Once there is an argument that a ZK-proof actually fills a gap in a credible trust chain, you can go into full technical depth. But the context is crucial to make sure you're building sth that actually contributes to a solution.

Cite this work

@misc {

title={

(HckPrj) NeuroVer

author={

Justin Stefan Stoica Tica, David Ghiberdic, Vladimir Necula

date={

2/1/26

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Recent Projects

View All

Feb 2, 2026

Markov Chain Lock Watermarking: Provably Secure Authentication for LLM Outputs

We present Markov Chain Lock (MCL) watermarking, a cryptographically secure framework for authenticating LLM outputs. MCL constrains token generation to follow a secret Markov chain over SHA-256 vocabulary partitions. Using doubly stochastic transition matrices, we prove four theoretical guarantees: (1) exponentially decaying false positive rates via Hoeffding bounds, (2) graceful degradation under adversarial modification with closed-form expected scores, (3) information-theoretic security without key access, and (4) bounded quality loss via KL divergence. Experiments on 173 Wikipedia prompts using Llama-3.2-3B demonstrate that the optimal 7-state soft cycle configuration achieves 100\% detection, 0\% FPR, and perplexity 4.20. Robustness testing confirms detection above 96\% even with 30\% word replacement. The framework enables $O(n)$ model-free detection, addressing EU AI Act Article 50 requirements. Code available at \url{https://github.com/ChenghengLi/MCLW}

Feb 2, 2026

Prototyping an Embedded Off-Switch for AI Compute

This project prototypes an embedded off-switch for AI accelerators. The security block requires periodic cryptographic authorization to operate: the chip generates a nonce, an external authority signs it, and the chip verifies the signature before granting time-limited permission. Without valid authorization, outputs are gated to zero. The design was implemented in HardCaml and validated in simulation.

Feb 2, 2026

Fingerprinting All AI Cluster I/O Without Mutually Trusted Processors

We design and simulate a "border patrol" device for generating cryptographic evidence of data traffic entering and leaving an AI cluster, while eliminating the specific analog and steganographic side-channels that post-hoc verification can not close. The device eliminates the need for any mutually trusted logic, while still meeting the security needs of the prover and verifier.

Feb 2, 2026