Discussion about this post

User's avatar
Jisper Plomp's avatar

Beatifull Tech Tales, reminded me of Exurb1a, seems like something he would write

Also if we figure safety alignment to prevent AI catastrophe, it will be very interesting to see what "philosophy of life" a positive aligment program will come up with.

Currently seeing the effects of optimizing for preference satisfaction in AI models(quick answers over actual understanding, sycophancy) is already worrying, let alone with much more intelligent models. Curious to see how we will deal with that

Roso's avatar

Très intéressant merci

19 more comments...

No posts

Ready for more?