Glossary

Context Window

The maximum amount of text a model can consider at once, including your prompt, any documents you provide, and its own reply.

The context window is the model's working memory for a single request. Everything it should take into account, your instructions, the conversation so far, and any reference material, has to fit inside it, measured in tokens.

Bigger windows let you hand over more at once, but they aren't a substitute for good retrieval. Stuffing in everything raises cost and can bury the important details, so feeding the model the right context usually beats feeding it the most.

How we use it

We design what goes into the context window deliberately, using retrieval to supply just the relevant material, which keeps answers sharp and costs predictable.

Related services

AI Consulting & Strategy

Turn AI from a buzzword into measurable business value.

Explore AI Consulting →

← All terms

Charleston waterway at sunset with palmetto silhouettes

Get in touch

Want to put this into practice?

If this concept is relevant to something you're building, a short note is the fastest way to get practical help.