What Grok-4 actually is (and what makes it different)
Grok-4 is xAI's first model to land on the same benchmark tier as the GPT-5 / Claude Opus 4.7 / Gemini 2.5 Pro frontier. Where Grok 3 lagged the frontier by 6-12 months on most benchmarks, Grok-4 is competitive on reasoning, code, and math — and ahead on a small set of benchmarks that reward real-time data access.
What makes it structurally different: native real-time access to the X (Twitter) firehose via the Live Search tool. Ask Grok-4 'what are people saying about the iPhone 18 launch right now?' and it issues a live X search, processes the recent posts, and synthesizes a response with citations to specific X posts. No other frontier model has this — Perplexity comes closest with web search but doesn't have first-class X data.
Think mode is xAI's name for explicit extended reasoning, comparable to GPT-5's `reasoning_effort: high` or Claude's extended thinking. Enabled via the `reasoning_effort` parameter (`low` or `high` — Grok-4 does not expose `medium`). When enabled, Grok-4 burns reasoning tokens before producing the visible answer, billed at the output rate.
The API is OpenAI-compatible: point the OpenAI Python or Node SDK at `api.x.ai/v1` with an xAI API key, change the model ID to `grok-4`, and existing OpenAI integration code works without further changes. That portability lowers the migration cost from GPT-5 dramatically.