Private Intelligence Series · Book 2
How to Read, Steer, and Catch a Mind You Own Lying
The most powerful tool most of us have ever used is also the one we understand the least. You type to an AI, something brilliant comes back, and if you ask why it said what it said, the honest answer — even from the people who built it — is a shrug. Text in, text out, and in between a black box. We have learned to judge these minds the way we judge a stranger at a poker table: by their words and their tells, never by what is actually in their hand.
The Glass Box is the field manual for opening it. Its argument is one sentence: a model you own is the only mind you can actually open — and once it's open you can read what it is thinking, steer what it cares about, and catch it lying from the inside.
Drawn from a real interpretability lab the author built on a single 16GB consumer GPU, fully offline and for almost nothing, this book takes you the whole way:
This is not a survey of interpretability research and it is not a paper. It is a field report from someone who opened a real model on a normal machine — and who is careful, throughout, about where the seeing stops and the guessing begins ("above chance, not an oracle"). The second book in the Empire Publishing Private Intelligence series, after Private Intelligence. For practitioners, the AI-safety-curious, and anyone done taking a black box on faith.