Image generation pricing only tells half the story for content teams in 2026, because most production workflows now mix stills with short video clips. Video models price on a fundamentally different unit — per second of output rather than per image — and the per-second number lands 50-200x higher than the per-frame cost of an equivalent image model. That gap matters once you start building social ads, product explainers, or storyboard previews, because the budget math no longer carries over from image work.
OpenAI Sora 2 sits at the premium end of the API market. Standard 1080p output is priced around $0.30 per second of finished video; the 720p tier drops to roughly $0.10 per second. A 10-second 1080p clip runs $3.00, a 30-second clip runs $9.00, and a one-minute clip lands at $18.00. The 720p tier turns the same one-minute clip into $6.00 — a meaningful saving when the clip is destined for vertical social where the viewer will never see the difference between 720p and 1080p on a phone screen.
Google Veo 3 prices by clip rather than by raw second. A standard-quality 8-second clip costs roughly $0.50, and the high-quality tier (sharper detail, better motion coherence, more accurate physics) is around $1.20 for the same 8-second window. That works out to $0.06 per second standard and $0.15 per second high quality — meaningfully cheaper than Sora 2 at the standard tier, with the trade-off that Veo 3 caps individual generations at 8 seconds and you have to stitch clips for longer outputs. A 30-second ad assembled from four Veo 3 high-quality clips runs about $4.80, versus $9.00 on Sora 2 at 1080p.
Runway Gen-4 sells two ways. The subscription tiers are $15/month (Standard, ~625 credits), $35/month (Pro, ~2,250 credits), and $95/month (Unlimited, generation cap plus Explore mode for unmetered runs). Credits convert roughly 1:1 with seconds of generated video at base quality, so the $35 Pro tier effectively delivers about 37 minutes of video monthly — equivalent to $0.016 per second if you actually consume the full allotment. The Runway API is priced separately at roughly $0.05 per second, putting a 30-second clip at $1.50. The subscription wins by a wide margin for an in-house creative team; the API wins if you are embedding generation into a product where end-user demand is unpredictable.
Pika 2.0 is the price leader on the API side at roughly $0.04 per second, putting a 30-second clip at $1.20 and a 10-second teaser at $0.40. The trade-off versus Sora 2 or Veo 3 is shorter individual clip lengths (usually 5-10 seconds before quality degrades) and weaker physics on complex motion. Kling 2.0 splits the difference at around $0.08 per second — a 30-second piece costs $2.40 — with stronger motion coherence and better handling of human subjects than Pika, but less polish than Sora or Veo on cinematic shots.
Two pricing dynamics carry across every model. First, resolution scales cost roughly 3-4x: 1080p typically costs 3-4 times the per-second rate of 720p across Sora, Runway, and Pika. Veo 3 hides this in its standard-versus-high-quality tier split, but the multiplier is comparable. Duration scales linearly within a single generation, but most models charge a small re-initialization overhead per clip, so generating one 16-second clip is marginally cheaper than two 8-second clips when the model supports the longer single take. Second, the cost-per-second on the cheapest video tier (Pika 2.0 at $0.04) is still about 13x the cost of a single Flux Schnell image ($0.003) — and a single second of video is 24-30 rendered frames. The per-frame video cost is actually competitive; the human-visible second is what makes the bill jump.
Worked example for a 30-second product ad at 1080p: Sora 2 direct = $9.00 per take, and most teams run 3-5 takes for a usable result, so the budget lands at $27-45 for one finished ad. Veo 3 stitched from four 8-second high-quality clips = $4.80 per take, $14-24 for a usable result. Runway Gen-4 Pro subscription at $35/month covers 37 minutes of generated material, so a single 30-second ad amortizes to under $0.50 if you are producing more than one ad in the month. Pika 2.0 across six 5-second clips = $1.20 per take, $4-6 for a usable result, with the caveat that quality will not match Sora 2 or Veo 3 on cinematic compositions. Most startup commercial-content workflows in 2026 follow a consistent three-stage pattern: generate base footage cheaply (Pika or Kling), pay for an upscale and motion-smoothing pass (Topaz Video AI at $0.03 per second, or Runway's Upscale endpoint at $0.04 per second), then manually select the keepers. A blended budget of $0.10-0.15 per delivered second of finished 1080p video is achievable, which puts a 30-second ad at $3-5 finished rather than the $27-45 of running everything on Sora 2 directly.
Commercial licensing is the part that catches teams off guard. OpenAI Sora 2 grants commercial rights to API customers but prohibits generating likenesses of real people without explicit consent, and outputs must include the embedded C2PA provenance signal. Google Veo 3 includes a SynthID watermark on every generation, and the commercial license through Vertex AI is straightforward for enterprise customers but more limited on the consumer Gemini tier. Runway Gen-4 grants full commercial rights on the Pro and Unlimited subscription tiers; the Standard tier restricts commercial use to a single account holder. Pika 2.0 grants commercial rights on its paid tiers but explicitly disallows training derivative models on outputs. Kling 2.0, operated by Kuaishou, grants commercial rights but has had documented enforcement inconsistency outside China — most US-based ad agencies treat Kling output as draft-only and re-shoot the keeper with Sora, Veo, or Runway for the deliverable. Always re-read the active terms before a campaign launches, because every one of these vendors has changed terms at least once in the last 12 months.