2 Comments

"Don't build agents" feels like capitulating that we cant even come close to solving the potential harms of AI. Maybe I'm just too much of an RL researcher.

Expand full comment

I wonder what the minimum prompt is to get the equivalent of self-align. "Make yourself useful" sometimes works with humans

Expand full comment