What Does It Mean to Align AI with Human Values?
December 14, 2022
(Quanta) – The DWIM command was a microcosm of the more modern problem of “AI alignment”: We humans are prone to giving machines ambiguous or mistaken instructions, and we want them to do what we mean, not necessarily what we say. Computers frequently misconstrue what we want them to do, with unexpected and often amusing results. (Read More)