The cost formula: per-token embedding cost
Embedding cost for both OpenAI and Cohere follows the same simple formula — per-token input, no output bill:
``` embedding_cost = (total_input_tokens / 1_000_000) × price_per_1M Examples at 100M tokens: OpenAI text-embedding-3-small: 100 × $0.02 = $2.00 OpenAI text-embedding-3-large: 100 × $0.13 = $13.00 Cohere embed-v4.0: 100 × $0.12 = $12.00 Voyage voyage-3-large: 100 × $0.18 = $18.00 ```
Token count estimation: 1 token ≈ 4 characters of English. A 200-word document description is ~267 tokens. A 1M-row product catalog with 200-word descriptions is ~267M tokens.
The critical difference between OpenAI and Cohere that is invisible in this formula: **Cohere's context window is 128,000 tokens; OpenAI's is 8,192 tokens.** If any of your documents exceed 8,192 tokens (~6,000 words) and you need a single embedding for the full document, OpenAI cannot produce it without truncation. Cohere can embed the whole document. This is not a minor specification difference — it changes what you can build.
For the downstream costs after you have your embeddings, see how much RAG queries cost.