Discussion about this post

User's avatar
Nathan Lambert's avatar

More like this! Great stuff

Expand full comment
Jason Kincaid's avatar

As a fellow former journalist, and innate skeptic:

You often encourage us to say what we think. But you only tell us half the story.

You wake us up to the monsters in the room, but sidestep why you believe we have the means to tame them. Dario invokes mechanistic interpretability as the foil against bad outcomes, but your former colleague Neel Nanda has been arguing it isn’t a silver bullet. You publish compelling and commendable safety research — but the rate of progress pales in comparison to capability gains.

Your company frames alignment as an empirical problem, but the emphasis is on progress made, rather than the cavernous unknown it's measured against. Even here, “systems that we do not fully understand” is hardly a frank assessment, akin perhaps to saying Mendel did not fully understand genetics.

Take a step back. Are we on track? What good is turning on the lights if we’re wearing rose-tinted glasses?

Here’s what I think: the reason you don’t tell us the full truth is because you believe it would diminish hope, which is a requisite ingredient for having any at all.

Expand full comment
61 more comments...

No posts