Periagoge
Concept
2 min readself knowledge

Understanding Tokens: Why Your AI Costs Money and Has Limits

A token is roughly a word or word fragment, and every exchange with AI consumes tokens from a limited budget—which is why conversations have hard limits and why API access costs money. Understanding tokens demystifies pricing, explains why certain requests feel expensive or cheap, and reveals how to structure prompts for efficiency.

Hypatia
Why It Matters

Think of tokens like subway fare. You don't pay per mile—you pay per journey segment. In AI, you don't pay per word; you pay per token, a small unit of text that roughly equals one word (but not exactly).

Here's what happens behind the scenes: before an AI processes your question, it breaks it into tokens. The number of tokens depends on language and complexity. "Hello" might be one token. "HelloWorld" might be three. It's not always intuitive, which is why pricing can feel random if you don't understand tokens.

Where tokens come into play:

Most AI tools that charge money (like ChatGPT's API or Claude's paid tier) charge based on tokens, not character count. They count tokens in two directions: input tokens (what you send) and output tokens (what the AI generates). Most tools charge differently for input and output—output is usually more expensive because it requires more processing.

This means a long prompt with a short answer costs less than a short prompt with a long answer. Your 2,000-word essay prompt might use 300 input tokens but 2,000 output tokens. You're mainly paying for the essay you got back.

Practical implications:

  • Asking the AI to be concise in your prompt saves money
  • Pasting huge documents consumes tokens—sometimes it's cheaper to summarize first
  • Repeating similar prompts in the same conversation is cheaper (you don't re-send the context)
  • Images count as tokens too, and they're expensive (sometimes 500+ tokens per image)

You can estimate token counts before submitting. Most platforms have a token counter. It's worth checking if you're working with a tight budget or large documents.

Try this: Write a short prompt (10 words) and a long one (100 words) asking for the same thing. Check the token count for each in your AI tool's interface. Notice how input tokens scale with length. Then generate responses to both and compare total cost. You'll see how input and output tokens both add up.

Helpful guides
Hypatia
Daily Life & Decisions
Related Concepts
Peri
Questions about Understanding Tokens: Why Your AI Costs Money and Has Limits?

Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.

Ready to work on Understanding Tokens: Why Your AI Costs Money and Has Limits?

Explore related journeys or tell Peri what you're working through.