What makes a model good at being an AI agent?
An "agent" is a model that doesn't just answer once — it **plans, calls external tools** (search, code execution, APIs, databases), **reads the results, and decides what to do next**, often over dozens of steps. Four durable capabilities separate good agent models from mediocre ones: reliable **function calling / tool use** (calling the right tool with correctly-formatted arguments), **instruction following** under long context, **error recovery** (noticing a failed call and adapting rather than looping), and a **reasoning/thinking mode** for planning hard tasks.
Crucially, the base model is only part of the story. Agent reliability is dominated by **tool design** (clear schemas, good error messages), **prompt structure** (see how to write a system prompt), and guardrails against prompt injection when tools touch untrusted data. A great model on a sloppy harness will still fail; a careful harness lifts both Claude and GPT-5.5.