Discussion about this post

User's avatar
Herbie Bradley's avatar

Good piece! But to me this is all entirely consistent with a position of: we should expect models in the current paradigm to automate much of the "junior RS" work in the next few years, but not higher level skills like research taste and creativity. Of course, one can always formulate AI research as a pure search problem (and there is a lot of prior literature doing that) and say that if we just throw enough compute we'll make progress and you don't need research taste. But the search space is huge and creativity is an insanely powerful shortcut, such that if models are incapable of research-grade creative insights we should expect a modest speed boost overall. A framing I like is just that LLMs are a different "shape" of intelligence to humans, and so we should expect comparative advantage.

My own impression from doing AI research is that there is significant creativity involved beyond what models should be expected to be capable of, when it comes to picking directions, coming up with a vision for what's possible & backchaining, etc (and rarest of all, putting together a coherent long-term research agenda that fills a missing gap with a tractable sequence of breakthroughs to get there). Others, including Anthropic researchers, clearly also think these skills are part of research: https://x.com/_arohan_/status/2050978424446235090

So overall it seems to me this is not such a huge shift as you're implying, so much as a distribution shift in tasks carried out by researchers very comparable to that going on in SWE over the coming years. In SWE, the tasks that make up the job are shifting such that SWE is becoming more of a SWE-PM hybrid, and juniors are not automated because they can just shift their tasks towards more higher level of abstraction things like system design (and learn faster through greatly increased velocity).

I think a lot of analyses, including arguably this one, underrate that the tasks that make up jobs are changing all the time in reaction to AI's deployment and other people in the market.

Side note: I would also say that it currently appears that the skill ceiling and complexity ceiling in software engineering is higher than in AI R&D, and AI R&D necessarily involves significant software engineering, so I would on balance expect that if AI R&D is entirely automated then scaling inference serving another 1000x would still require human experts, and the rest of software engineering (outside of AI labs) to heavily require humans.

On point 2, I agree though noting that Amdahl's Law isn't quite the right analogy because it usually assumes task decomposition is fixed and is a result about parallelization, while the O-ring automation intuition from economics is in some sense more general and closer to what you're describing IMO (https://www.nber.org/papers/w34639)..

Finally, on your point 3, I broadly agree that short term more businsses may be opex heavy by spending on AI, but fundamentally if a service is fully automatable, its cost should fall to slightly more than the marginal cost of the required intelligence, which will probably be very low in the future—and the share of GDP of these automatable services should collapse. I don't expect fully automated companies to be more than a small fraction of the economy, or peripheral, because most economic value from new technologies is driven by adoption of new services by large incumbents, and the distribution of tasks required to sell that and drive distribution is very different to any of the automatable skills in AI research and heavily loaded on human social bargaining. Anthropic's own private equity partnership in some sense a bet against a largely automated economy.

Steven Adler's avatar

Thanks for writing this! Slight typo: "thus teaching it that teaching is good" the second 'teaching' should be 'cheating'

41 more comments...

No posts

Ready for more?