Someone built a transformer that runs on a Commodore 64. A real one. Two layers, 4 attention heads, quantized to int8, loaded off a floppy disk at 1 MHz. It takes about 60 seconds per token.
Here it is. Go look at it. I’ll wait.
The project is called Soul Player C64, and the README ends with this: “The future came back for the past. And now it has a soul.” Whoever wrote that deserves a medal. Or at least a warm beverage.

