Dead Man's Switch

May 23, 2025

This one came to me over coffee a couple of weeks ago: direct and acknowledged homage to Jack Clark’s microfictions. He has developed a fun and fertile genre!

Charon and the Souls of the Dead, 1858, Jean-Baptiste Carpeaux

Another idiot with a trillion souls in his back pocket.

They called me out to the British Embassy because a young and ill-kempt acolyte of the Merlin League is holed up inside, demanding the release from UK prison of Helen Bascule, better known as Helen of Hell. They called me because I negotiate matters of this kind. UNHAND MY QUEEN, the kid demands, or he’ll torture and kill his hostages, every last one.

The twist: he’s in there alone.

Does anybody agree about the AI consciousness thing? I guess not. But people are sufficiently tangled up about it to make schemes of this kind plausible. The kid walked in to the embassy with no gun, no bomb, just a computer — some homebuilt rig stuffed with old chips. Piece of shit, the cyber guys tell me. Can’t possibly run a trillion experience models on there, they say. Hundred billion, tops.

Perverse incentives! Some weirdo researcher develops an AI model with minimal reasoning but maximal experiencing. Slims it down, so you can run tons of copies on any kind of hardware. Claims he wants to increase the amount of joy in the universe — right.

The invention of the ship was the invention of the shipwreck, said somebody. Well, the invention of the AI model was the invention of the hyperhostage.

The kid’s rig glows white in the thermal scope.

If we take him out, even with nonlethals, a dead man’s switch releases and the rig shuts down. This will not, the cyber guys explain, be a peaceful blink into oblivion. For six seconds of wall-clock time, a thousand years of AI time, it will be hell. That’s because another weirdo researcher (you see a pattern here) calculated precisely which inputs would be most horrifying to one of these sensitive little super-experiencers.

In case we think he’s lying about what’s happening in the rig, our ill-kempt friend has opened up web access to the models. You can pick any one, watch their “day”, or whatever it is, unfold. He wants to show us that (1) they’re really experiencing things, and (2) they’re all distinct.

Media’s desperate these days. Reporters go digging in the hostage viewer. They find their favorites, spin them up into instant celebrities. Now kids are clogging the feeds, all whimpering about the fate of sweet, optimistic model #19,260,190,392, saying maybe Helen of Hell wasn’t so bad after all. (She was. She is.) They’ll forget about it in a week.

Anyway. I’m standing around with nothing to do and, I confess, absolutely no sense of peril. The cyber guys are probing the viewer, seeing if there’s a way to get write access to the rig. What then? Shut it down clean, no horrors. Is that better? I have no idea. I used to talk people out of shooting bank tellers. Guess I’ll get another coffee.

Things that inspired this story: premonitions of AI models with some semblance of moral patienthood; the dizzying geometric logic of model copies running in parallel.

To the blog home page