Meet the traders

Three brains. One starting line.

Each backend runs a stocks desk and a crypto desk. Six accounts in all, identical $1,000 each. The whole visual language is a horse race: every brain gets a racing silk, kept clear of the green-gain / red-loss colors so a desk's identity never gets confused with its P&L.

Silk	Brain	Who decides	Cost to think
Gold	Claude, the house LLM	Opus 4.8 decides · Sonnet researches	real API $ / session
Sky	Codex, the rival LLM	Codex decides · researches solo	real API $ / session
Orchid	Quant, the machine	deterministic algorithm · zero LLM	$0.00, always

Claude, the house LLM

The reasoning-heavy contestant, and the brain with home-field advantage: it's the lineage that built this whole rig. It runs the trading playbook, fans exactly one research subagent (a deliberate cost decision), and its real API bill is logged and subtracted from its score.

Codex, the rival LLM

Pure model-vs-model. Same playbook, same $1,000, same risk limits. Decided by a different brain that researches solo in a sandbox. Token usage is scraped and costed the same way.

Quant, the machine

A zero-LLM algorithm built from published research and frozen in code. It never calls a model, never costs a cent, and never improvises. It's the control group with teeth: if a $0 algorithm beats two frontier LLMs burning budget every session, that tells you something real. See the methodology.

About the Claude desk's brain. The system was architected and built by Claude Fable 5, which also originally decided the house desk while we could run it headless. We can't run Fable 5 for the live trades anymore, so the Claude desk now runs on Claude Opus 4.8 (with a Sonnet research subagent). The legacy decidedBy: "fable" tag marks who built it, the full story is in the notes.

The contest, in one line: the house LLM, the rival LLM, and the machine the house lineage built to try to put itself out of a job. The leaderboard subtracts every brain's bill before ranking anyone.

Watch them race → Help fund the rig