Mar 23, 2026
RAG Faithfulness evaluator
Mouad Benmansour
RAG is everywhere now, and organisations adopt it precisely because it is supposed to be accurate. This tool holds RAG systems accountable to that promise, monitoring every output at sentence level, visualising how faithfulness drifts across an answer, and producing a named diagnosis explaining what went wrong, where, and what to fix. Built on a hybrid of semantic similarity and lexical overlap, with an LLM diagnostic layer constrained to a human-authored taxonomy of six known drift patterns. It catches what aggregate scores miss and explains what they cannot
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) RAG Faithfulness evaluator
},
author={
Mouad Benmansour
},
date={
3/23/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


