RC RANDOM CHAOS

Opus 4.7's New Tokenizer Inflates Text Costs ~40% Despite Unchanged Pricing

· via Simon Willison

Original source

Claude Token Counter, now with model comparisons

Simon Willison →

Simon Willison extended his Claude Token Counter to compare token usage across models, surfacing a practical consequence of Opus 4.7’s updated tokenizer. Anthropic’s own announcement flagged a 1.0-1.35x input inflation, but Willison’s test on the Opus 4.7 system prompt came in at 1.46x against Opus 4.6. With pricing held flat at $5/$25 per million input/output tokens, that translates to roughly 40% higher real-world spend for equivalent text workloads.

Images and documents tell a more nuanced story. A high-resolution PNG produced a 3.01x token count on 4.7, but that gap is driven by Opus 4.7’s expanded vision support for images up to ~3.75 megapixels - at comparable resolutions, token counts are effectively identical. A 30-page text-heavy PDF landed at just 1.08x, suggesting the tokenizer penalty is concentrated in raw prose rather than mixed-media inputs.

The takeaway for teams budgeting against Claude: tokenizer changes are a hidden price lever. Nominal per-token pricing stayed flat across the 4.6-to-4.7 jump, but actual cost-per-task shifts materially depending on input type, and anyone forecasting spend should re-benchmark against representative payloads rather than trusting headline rates.

Read the full article

Continue reading at Simon Willison →

This is an AI-generated summary. Read the original for the full story.