Jun 2, 2025
TOBIAS - better web pages through intelligent routing and judgement
Alexander Pinchuk
We implemented routing and judging system designed to generate and evaluate improved versions of web page content for SEO, advertising and generative search goals. Our system selects the most promising content version using real-world performance indicators, providing a reproducible method for LLM benchmarking and semantic page optimization. This work contributes to Track 1 (Judge Model Development) of Apart x Martian Mechanistic Router Interpretability Hackathon, while integrating Judge Model insights for end-to-end performance routing.
Curt Tigges
- Seems useful if you want a good website builder for a business
- Actual safety impact seems negligible
- Seems fine technically
Narmeen
Constructive feedback:
Strength:
Nice application:web pages optimisation via content extraction
Love the data processing pipeline
Uses a post hoc routing approach
Nice judges with an application focus to website/ads.
Weakness:
Comparison against a “raw prompting” baseline like feeding website content and an instruction to the models would be good to justify this routing pipeline.
Expert Orchestration: 4
MI: 1
Tech Imp and rep: 4
Anosha Rahim
Strong domain-specific project. Curious how it would generalize to other multi-objective generation tasks. I'd add at least one MI probe to the implementation to see how it guides generation.
Cite this work
@misc {
title={
@misc {
},
author={
Alexander Pinchuk
},
date={
6/2/25
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}