The architecture of a pause
I believe (?) this is the crispest statement of this kind that Anthropic has yet made …
We believe it would be good for the world to have the option to slow or temporarily pause frontier AI development to enable societal structures and alignment research to keep up with the advance of the technology. The Anthropic Institute will conduct research —
in collaboration with many others — and take actions to help build the systems that a credible slowdown or pause would require. These systems would enable frontier AI developers to verify that others globally have actually stopped or slowed, and that a bad actor could not use the auspices of a coordinated slowdown to jump ahead in secret. If such systems existed, we expect that we would slow down or temporarily pause, if other developers at or near the frontier also did so in a verifiable manner.
… and it’s extremely welcome news. It seems to me self-evident that a slowdown and/or pause would be a wise thing for humanity —