Why DALL-E 3 uses images per minute, not tokens
Text generation is measured in tokens because a single request can consume anywhere from 10 to 100,000 tokens. The cost and compute load scale almost linearly with token count, so TPM is the natural unit. Image generation does not work that way. Whether you request a 1024×1024 standard image or a 1792×1024 HD image, the API treats each as one unit of work from a rate-limit perspective — the resolution affects pricing per image but not the request count. That makes IPM the cleaner constraint.
Each call to the images.generate endpoint counts as one image, even if your code generates one image at a time. You cannot batch multiple images in a single HTTP request the way you can batch text completions — each image is its own request. This matters because it means you cannot work around the IPM ceiling by packing more images per call the way some developers try to do with token-heavy text prompts.
OpenAI's rate limits guide explains the overall tier framework but does not list DALL-E-specific IPM values in a clean table. For the actual numbers on your account, navigate to platform.openai.com/settings/organization/limits — the values there are authoritative for your specific tier.