Every AI model has a maximum length of input it can process in a single conversation, and once you hit that limit, older parts of the conversation get dropped or summarized to make room for new ones. If your input is very long—huge documents, long conversation history—the AI may not actually see parts of what you're asking it to work with.
Tokens are the small chunks of text that AI models use to read and generate language, and every model has a hard cap on how many tokens it can handle in a single exchange.
Knowing how token limits work helps you avoid truncated responses and lost instructions, so you can split long documents, prioritize key details, and keep your AI interactions running smoothly.
Peri can explain this concept, give practical examples, help you decide whether it applies to your situation, or recommend a journey if appropriate.
Explore related journeys or tell Peri what you're working through.