The Glass Box

Name: The Glass Box
Author: Empire Publishing

How to Read, Steer, and Catch a Mind You Own Lying

The most powerful tool most of us have ever used is also the one we understand the least. You type to an AI, something brilliant comes back, and if you ask why it said what it said, the honest answer — even from the people who built it — is a shrug. Text in, text out, and in between a black box. We have learned to judge these minds the way we judge a stranger at a poker table: by their words and their tells, never by what is actually in their hand.

The Glass Box is the field manual for opening it. Its argument is one sentence: a model you own is the only mind you can actually open — and once it's open you can read what it is thinking, steer what it cares about, and catch it lying from the inside.

Drawn from a real interpretability lab the author built on a single 16GB consumer GPU, fully offline and for almost nothing, this book takes you the whole way:

The Black Box Tax — the four things outside-only use costs you: you can't see, steer, keep, or catch a model you only observe
The Concept Dictionary — how a free tool turns a model's tangled internal numbers into thousands of named concepts you can read
Reading the mind — watching concepts light up live, confirmed against a public catalog so they're real, not guesses — and The Dial, turning a concept up until the model can talk about nothing else
The Truth Probe — a reader of the model's internal belief (98.6% accuracy, with the control that proves it real, not an artifact)
The Deception Monitor and Honesty Governor — catching the model assert what it knows is false, and vetoing the lie live (and the hard-won lesson: govern the headline, not every word)
The Kill-Gate Doctrine — the discipline that keeps it all honest, including the experiment that disproved the author's own favorite hypothesis

This is not a survey of interpretability research and it is not a paper. It is a field report from someone who opened a real model on a normal machine — and who is careful, throughout, about where the seeing stops and the guessing begins ("above chance, not an oracle"). The second book in the Empire Publishing Private Intelligence series, after Private Intelligence. For practitioners, the AI-safety-curious, and anyone done taking a black box on faith.

Buy on Amazon · $9.99 → Back to catalog