What is Context Window in AI? Token Limits Explained

About 75,000 words or ~150 pages.

Yes. Input and output share the same window.

Compute cost grows quadratically.

Not always. Quality can degrade.

When chatting with ChatGPT or Claude, the AI works within a context window—a fixed buffer holding your conversation plus system instructions.

What Are Tokens?

Context is measured in tokens, not words. One token ≈ 4 characters or ¾ word in English. Use OpenAI's Tokenizer to see how text splits.

Forgetting. When conversations exceed the window, older messages drop. Research shows models also attend less to middle content.

Summarize periodically. Ask AI to compress long discussions.

Front-load key info. Put important context at the start.

Use RAG. Retrieve relevant chunks instead of pasting everything.

Frequently Asked Questions

Why does AI forget earlier messages?

When conversations exceed the context limit, older messages are dropped.

How many words is 100K tokens?

About 75,000 words or ~150 pages in English.

Does output count against context?

Yes. Input and output share the same context window.

Why not unlimited context?

Compute cost grows quadratically with length.

Is bigger always better?

Not always. Attention quality can degrade with extreme length.