Discussion about this post

User's avatar
Nathan Lambert's avatar

"Don't build agents" feels like capitulating that we cant even come close to solving the potential harms of AI. Maybe I'm just too much of an RL researcher.

Expand full comment
Steeven's avatar

I wonder what the minimum prompt is to get the equivalent of self-align. "Make yourself useful" sometimes works with humans

Expand full comment

No posts