Aug 26, 2024
Unsolved AI Safety Concepts Explorer
Tewodros Mesfin
n interactive demonstration that showcases some unsolved fundamental AI safety concepts.
Nice presentation.I think this demo lacked a way to convey gear level understanding in the mispecifications and misalignment it wanted to. It was not clear in each failure mode presented what actually happened or why.
I think explaining side is very weak: it feels very much just like “AI → bad” and doesn’t give intuitions for how things work.
Cite this work
@misc {
title={
Unsolved AI Safety Concepts Explorer
},
author={
Tewodros Mesfin
},
date={
8/26/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


