Question 1

What is a token in AI?

Accepted Answer

A token is roughly 4 characters or 0.75 words in English. The word 'hamburger' is 3 tokens. A typical paragraph is about 100 tokens. Most models have context windows of 128K-200K tokens. Both your prompt (input) and the AI's response (output) count toward your cost.

Question 2

Which AI model is cheapest?

Accepted Answer

For basic tasks, GPT-4o Mini and Claude Haiku offer the lowest costs at under $1 per million tokens. Gemini Flash is also very affordable. For complex reasoning, Claude Sonnet and GPT-4o offer the best value. Opus and frontier models are most expensive but highest quality.

Question 3

Why are output tokens more expensive than input tokens?

Accepted Answer

Output token generation requires more compute than processing input. Each output token requires a full forward pass through the model, while input tokens can be processed in parallel. Output pricing is typically 3-5x higher than input pricing, so shorter prompts for longer outputs cost relatively more.

Question 4

How can I reduce AI API costs?

Accepted Answer

Key strategies: use the smallest model that meets quality needs, cache frequently requested responses, keep prompts concise, set max_tokens to limit output length, use streaming to detect early when a response is off-track, batch API calls when possible, and consider fine-tuning a smaller model for specific tasks.

Question 5

How accurate are token estimates from character count?

Accepted Answer

The 4-characters-per-token estimate is roughly accurate for English text. Code tends to use more tokens per character. Non-English languages (especially CJK) may use 1-2 characters per token. For precise counts, use the model provider's tokenizer (e.g., OpenAI's tiktoken or Anthropic's token counter).

Question 6

How much does it cost to process a full document with AI?

Accepted Answer

A typical page of text is about 250-300 words (~400 tokens). A 10-page document is ~4,000 tokens. Processing it with Claude Sonnet would cost about $0.012 in input tokens. A 100-page report (~40,000 tokens) would cost about $0.12 input + output costs. Books and large datasets can run $1-$10+.

Model	Input $/1M	Output $/1M	Context Window	Best For
GPT-4o	$2.50	$10.00	128K	General purpose
GPT-4o Mini	$0.15	$0.60	128K	Simple tasks
Claude Opus 4	$15.00	$75.00	200K	Complex reasoning
Claude Sonnet 4	$3.00	$15.00	200K	Best value
Claude Haiku 3.5	$0.80	$4.00	200K	Fast & cheap
Gemini 2.5 Pro	$1.25	$10.00	1M	Long context
Gemini 2.5 Flash	$0.15	$0.60	1M	Budget option
Llama 4 (hosted)	$0.20	$0.80	128K	Open source

AI Prompt Cost Calculator