What makes a summary prompt produce a faithful result?
Summarization means compressing a source into a shorter form that preserves its key information. The two things that go wrong are length (the model returns three dense paragraphs when you wanted three bullets) and faithfulness (the summary adds a conclusion, statistic, or nuance that the source never stated). Faithfulness failures are the dangerous ones because they look authoritative.
A faithful summary prompt does three jobs. It tells the model **who the reader is** so the level and vocabulary are right. It **fixes the length precisely** so output is predictable. And it **grounds the model to the source** with an explicit rule: summarize only what is present, attribute nothing that isn't there, and flag the source if it's silent on a point.
Modern models are strong summarizers, but they still pattern-match toward a 'complete-sounding' answer. The provider prompt guides — OpenAI's prompt engineering guide and the Anthropic prompt engineering overview — both emphasize explicit instructions and grounding over hoping the model infers your intent.