What Does It Mean to Align AI With Human Values?

Many years ago, I learned to program on an old Symbolics Lisp Machine. The operating system had a built-in command spelled “DWIM,” short for “Do What I Mean.” If I typed a command and got an error, I could type “DWIM,” and the machine would try to figure out what I meant to do. A surprising fraction of the time, it actually worked. The DWIM command was a microcosm of the more modern problem of “AI…

Source

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.