Aug 26, 2024
Unsolved AI Safety Concepts Explorer
Tewodros Mesfin
n interactive demonstration that showcases some unsolved fundamental AI safety concepts.
I think explaining side is very weak: it feels very much just like “AI → bad” and doesn’t give intuitions for how things work.
Nice presentation.I think this demo lacked a way to convey gear level understanding in the mispecifications and misalignment it wanted to. It was not clear in each failure mode presented what actually happened or why.
Cite this work
@misc {
title={
Unsolved AI Safety Concepts Explorer
},
author={
Tewodros Mesfin
},
date={
8/26/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


