Question 1

How is LLM / token cost calculated?

Accepted Answer

Almost every API bills per token, with separate prices for input (your prompt) and output (the model’s reply). Cost per request = (input tokens ÷ 1,000,000 × input price) + (output tokens ÷ 1,000,000 × output price). Multiply by requests per month for the monthly bill. Output tokens usually cost several times more than input, so long replies dominate.

Question 2

Why is my OpenAI / Claude / Gemini bill higher than this estimate?

Accepted Answer

Real bills add things a ballpark can’t: system prompts and context resent on every call, retries, tool-call round trips, image and audio tokens, and reasoning tokens on thinking models. Long shared context that you resend each turn is the usual surprise. Treat this as a floor, and enable prompt caching to bring resent context down.

Question 3

How do I reduce LLM API costs?

Accepted Answer

In order of impact: route easy requests to a smaller model and keep the frontier model for the hard ones; cache the stable prompt prefix so resent context is read cheaply; trim output tokens (they are the expensive side); and move non-interactive work to the batch API for roughly half price. Together these often cut a bill by 50% or more.

Question 4

Which LLM provider is cheapest?

Accepted Answer

It depends on the model tier and your input/output mix, not the provider name. Small models from any provider are cheap; frontier models are far pricier and differ in how much output they generate for the same task. Because output is the expensive side, a model that answers concisely can beat a nominally cheaper one that rambles. Switch providers and models in the calculator above to compare your own workload.

Question 5

Is this calculator accurate?

Accepted Answer

It’s a good directional estimate, not a quote. Claude prices are current; the other providers use representative list prices for the common tiers, refreshed periodically. For an exact number tied to your prompts, traffic and caching, the fastest path is a short call.

LLM cost calculator

Breakdown

3 quick wins to cut this

Want the real number, and a plan to cut it?

Frequently asked questions