🧮LLM Pricing

Calculator Blog Guides Compare Glossary

Glossary

Quick, plain-English definitions for LLM pricing terms.

Token
The basic unit of text an LLM processes. Roughly 0.75 words in English.
Context window
The max number of input tokens a model can process in one request.
Input token
Tokens you send to the model. Usually 3-10x cheaper than output.
Output token
Tokens the model generates. The expensive half of the bill.
Prompt cache
Reusing a prompt prefix across calls for a big input-price discount.
Reasoning token
Hidden tokens used for model's internal thinking. Billed but not shown.
Tool call
When the model requests your code to run a function with arguments.
Batch API
Submit large job offline at 50% discount. Completes within 24 hours.
Rate limit
Max tokens or requests per minute you're allowed. Tier-based.
Cached input
Input tokens already stored in the provider's prompt cache.
Mixture of Experts (MoE)
Architecture where only some model weights activate per token.
Frontier model
Top-tier flagship from each major lab. Claude Opus, GPT-5, Gemini Pro.

Company

Privacy Policy
Terms of Service
🍞 Choppy ToastAlways toasting something.
Contact: choppy.young@gmail.com

Other free tools by Choppy Toast

💵 MinWage Global
🚀 Launch Checklist
🏠 House Afford
💡 Idea Validator
🌿 Eco Score
🏠 UK Stamp Duty Calculator
🤖 AI Coding Tools Directory 2026
😴 Sleep Cycle Calculator
✦ Font Pairing Gallery
💰 Money Quiz
🔢 How Big?
🧰 ToolPilot
💬 Polite Words
🔮 Ticker Oracle
🥇 Gold Rate Today
🛂 Visa Checker

© 2026 Choppy Toast. All rights reserved.