What AI actually costs: tokens, credits & messages, explained.
Every AI company prices differently — tokens, credits, "messages," per-seat — and the names are confusing on purpose. Once you understand the three or four models underneath, you can answer the only question that matters: what is this actually going to cost me? Here's the plain-English version, plus what a single prompt really runs you on each tool.
01 The 4 ways AI gets priced
Strip away the branding and almost every AI uses one of four pricing models:
- Per-token (pay-per-use / API). You pay for the text in and out, measured in "tokens." This is how the APIs charge — you pay only for what you use.
- Flat subscription with usage caps. The $20/month apps (ChatGPT Plus, Claude Pro, etc.). You pay a fixed price and get generous-but-limited usage — measured in "messages," not dollars.
- Credits. A prepaid bucket you spend down — one action costs N credits. Common for image/video generation, and now for agentic work: Microsoft's Copilot Cowork bills $0.01 per credit, where a single multi-step task runs anywhere from ~70 to 1,500+ credits.
- Per-seat. Business pricing — a flat monthly fee per employee (e.g., Microsoft 365 Copilot). The company pays per person, not per use.
Different units make tools hard to compare. "$20/month" feels concrete; "$3 per million tokens" feels abstract; "1,000 credits" feels generous. Same money, different framing. Once you can convert between them, the marketing stops working on you.
02 What's a "token," really?
A token is just a chunk of text — roughly ¾ of a word in English. "Learning" might be one token; "antidisestablishmentarianism" is several. A rule of thumb: ~750 words ≈ 1,000 tokens.
Every prompt has two token counts that both cost money:
- Input tokens — everything you send (your prompt + any document or context).
- Output tokens — everything the AI writes back. Output is almost always priced higher than input — often 4–5× — because generating is the expensive part.
So a "cost per prompt" = (your input tokens × input price) + (the reply's output tokens × output price). That's the whole formula. Everything below is just plugging in numbers.
03 What each tool charges (June 2026)
Here's the honest, side-by-side. "Pay-per-use" is the API price per million tokens (input / output). "Subscription" is the standard individual plan. The last column is roughly what one medium prompt costs on the API (about 800 tokens in, 1,200 out).
| Tool | Pay-per-use (per 1M, in / out) | Subscription | ~1 medium prompt |
|---|---|---|---|
| ChatGPT (GPT-5.2) | $1.75 / $14 | Plus $20/mo · Pro $200/mo | ~$0.018 |
| Claude (Sonnet 4.6) | $3 / $15 | Pro $20/mo · Max $100–200/mo | ~$0.020 |
| Gemini (3.5 Flash) | $1.50 / $9 | Google AI Pro $19.99/mo | ~$0.012 |
| Grok (4.1 Fast) | $0.20 / $0.50 | SuperGrok $30/mo | ~$0.0008 |
| Perplexity | Sonar API (per request + tokens) | Pro $20/mo | query-based* |
| Copilot (M365 chat) | n/a — per seat | $18–30 / seat / mo | included |
| Copilot Cowork (agentic) | $0.01 / credit · ~70–1,500+ credits per task | usage-based (from Jul 1, 2026) | ~$1–$15+/task |
| OpenClaw | free software + the model's API you plug in | self-hosted | = your model's price |
*Perplexity prices around "searches," not raw tokens — Pro gives you a large daily allowance; the Sonar API charges per request plus tokens. Copilot is per-seat, so individual prompts have no marginal cost. Premium reasoning models (GPT-5.4, Claude Opus) cost more than the figures above. Prices as of June 2026 — always check current rates.
04 What a real prompt costs (worked examples)
Numbers feel abstract until you map them to real work. Using mid-range pricing:
- A quick question + short answer (≈250 in / 350 out): a fraction of a cent — well under $0.01 on any model.
- Summarize a 2-page document (≈800 in / 1,200 out): about $0.01–$0.02 on most models; under a tenth of a cent on Grok 4.1 Fast.
- Analyze a long report (≈2,500 in / 3,500 out): roughly $0.05–$0.07 on a mid model; more on premium reasoning models.
Individual prompts are cheap — pennies. The cost only adds up at volume (hundreds of prompts a day) or with premium models on long documents. For normal daily use, a flat $20 subscription is often more than your usage would cost on the API — you're paying for convenience and unlimited peace of mind, not because the usage is expensive.
The pennies-per-prompt math above is for chat. When AI runs a whole multi-step job for you — an agentic task — it costs far more, because one task is dozens of model calls under the hood. Microsoft Copilot Cowork prices this with credits: $0.01 each, and one task runs ~70 credits (light, under $1) to 1,500+ (heavy, $15+). Usage billing starts July 1, 2026. So "AI is basically free per prompt" is true for chatting — and very much not true for agents doing real work. Always know which one you're paying for.
05 Subscription or pay-per-use — which is cheaper for you?
The honest decision rule:
- Light-to-moderate user (a few to a few dozen prompts a day)? Your API usage would likely cost less than $20/month — but the subscription buys you the polished app, no metering anxiety, and features like voice and image gen. Most people happily pay for that.
- Heavy daily user (hundreds of prompts, or long documents all day)? A flat subscription is the better deal — unlimited beats per-token once you cross a few dollars of daily usage.
- Builder / automating things? The API (pay-per-use) is the right tool — you only pay for what runs, and you can pick a cheap model (Grok 4.1 Fast, Gemini Flash) for high-volume jobs.
Don't guess — plug your actual usage into our free calculator and see the real monthly cost across every tool, subscription vs pay-per-use, side by side.
Open the AI Cost Calculator →Final challenge: price your own AI habit
Estimate your real usage: how many prompts a day, and how long are they (a quick question, a doc summary, a long analysis)? Run it through the calculator. You'll get a clear answer to whether your subscription is worth it — and which tool is cheapest for the way you actually work.
What you can do now
- Recognize the 4 pricing models: per-token, flat subscription, credits, per-seat
- Read a token price: per million, input vs output (output costs more)
- Estimate a prompt's cost — input tokens + output tokens × their prices
- Know that individual prompts are pennies; cost lives in volume and premium models
- Decide subscription vs pay-per-use based on your real usage, not vibes