What does "long context" actually mean here?
"Long context" is the amount of text (and other tokens) a model can consider in a single request — its **context window**. A bigger window lets you paste an entire contract, a long research paper, or a whole codebase and ask questions across all of it without splitting the work. For a plain-English primer, see What is a context window? and What is a token in AI?.
Two things matter beyond raw window size. First, **effective recall**: a model can technically accept a huge input but still miss a fact buried in the middle, so test on your real documents. Second, **cost**: long inputs mean lots of input tokens, and price scales with them — model your true input/output ratio against the live Gemini pricing and OpenAI pricing pages before committing.