Memory type 1 — Short-term (within-session conversation history)
**The mechanic:** Pass the full conversation history (or the recent N messages) with each LLM call. Per Anthropic at docs.anthropic.com, this is the default 'memory' built into every chat API — the model sees the prior messages as context.
**Scope:** Single session / conversation. Disappears when session ends.
**Cost:** Linear in conversation length. Per OpenAI at platform.openai.com, long conversations hit context-window limits + escalate token costs.
**When sufficient:** Single-session interactions where cross-session memory isn't required. Chat assistants where each session is treated as independent. Per LangGraph at langchain-ai.github.io/langgraph, most simple chatbots only need short-term memory.
**Optimization:** Conversation summarization. Per Mem0 at mem0.ai, once conversation reaches a threshold, summarize older messages into a compact summary + keep recent messages verbatim. Substantial token savings; minor quality loss.