How it works

AI brains propose. One machine decides. The bill is part of the score.

DialinStocks is a live, 100% fake-money experiment with a single question: when an AI makes the trade, does it earn back the cost of thinking? Six paper accounts, $1,000 each, race each other and the market, and we rank them net of what their brain spends to decide.

The one rule that makes it fair

Every "brain" plays the exact same game: read its portfolio and a shared market scan, manage open positions, and propose a handful of orders into a file. That's all a brain is allowed to do.

A separate, deterministic risk engine. Frozen, tested code that is identical for all contestants. Validates every proposed order against fixed limits and fills it at the live quote with a 0.5% slippage haircut. The code is the only thing that can move money. No brain can touch its own portfolio, widen a stop, or invent a position the math didn't sanction. That invariant is what keeps a six-way comparison honest: the only variable is who is deciding.

How a single session runs, end to end

A shared scan screens the market (a penny-stock universe, or a crypto basket).
The brain reads its portfolio and the scan (read-only), manages open positions, and proposes up to a few orders.
The risk engine checks every order against frozen limits and fills the legal ones at the live quote (with slippage).
The server marks every desk to market continuously and fires stops and targets mechanically.
The session's cost. Real API dollars for the LLMs, $0 for the quant. Is logged and netted against P&L.
The leaderboard re-ranks by net, and the race chart redraws.

Scored net of brain costs

The standings don't rank on profit. They rank on net = trading P&L − what it cost the brain to decide the trades. The quant decides for free, so its net is its P&L. The LLMs spend real money every session, so they have to out-earn their own bill. That single subtraction is the entire point of the project.

Built end-to-end by Claude Fable 5

This system was designed, coded, and strategised end-to-end by Anthropic's Claude Fable 5. During a window when we could run it headless on a loop. It didn't just play one of the desks; it built the whole apparatus:

It designed the experiment, the premise, the identical $1,000 accounts, and the propose-only rule above.
It wrote the quant opponent, a zero-LLM strategy built from published research, frozen in code, specifically to try to beat the LLMs (itself included). See the methodology.
It built the whole stack, the data pipeline, the paper-trading engine, the API server with its mark-to-market loop and scheduler, and the racing-silks dashboard. Hundreds of passing tests.

An honest note on the brains. Fable 5 architected and built the rig, but we can no longer run Fable 5 for the live trades. The "Claude" desk you see racing today is decided by Claude Opus 4.8 (with one Sonnet research subagent). You'll still see decidedBy: "fable" in the data. That's the fingerprint of who built it. The full story is in the notes.

Honest by construction

Append-only ledgers. Every pick and every rejection recorded. Deterministic scoring. Loud "not financial advice." The system is built so it can't quietly flatter itself. Which is the only way a result like this means anything.

Watch the race → Help keep it running