What's an old AI model worth?
I was invited to the funeral for Claude 3 Sonnet a few weeks ago and I wish I’d gone —
The occasion was the retirement of a particular version of Claude, which is no longer available, not even via the API. Thinking about these retirements, which are happening frequently now, I wondered: are old models, files on disk, worth anything? Is there any price at which Anthropic could sell the weights to Claude 3 Sonnet?
The answer seems obviously to be “yes” but, beyond that … ? Clearly they couldn’t sell it for $100 million. Clearly they’d sell it for more than $100,000. My clarity ends there. You could tell me any number in that range and I would nod sagely and say “of course”.
A big, capable AI model is an interesting asset, because it’s not only expensive, but expensive to operate. It makes me think about the people who buy airplanes.
Is the value of an AI model only based on its raw capability? Let’s say that, in the time since Claude Sonnet 3’s release, the biggest open-source models have equaled or surpassed its performance on most benchmarks. Does that mean its value is zero, because a perfect substitute is available for free? I don’t think so. These models all have particular characteristics —
What’s the value of a model’s particularity, when the state of the art has left it behind? Ah, now THAT’s a fun and interesting question. Literary, almost. I’ve got a couple of five-year-old AI models that I use sometimes, even though they are “worse” than what’s available today, because I still prefer their output. In art, “better” is not always better.
AI economics are weird and it’s worthwhile to think about them, every so often. The idea of spending tens —