NEW: Real-Time Usage Tracking for AI Agents — track Claude Code, Kimi, Codex & more. Try it free →

CostGoat Logo

CostGoat

LAST UPDATED: MAY 21, 2026

OpenAI ChatGPT API Pricing Calculator - All Text/Chat Models

Calculate OpenAI API costs for current chat and code models per token, per request, and per month. Compare GPT-5.5, GPT-5.4 family, and GPT-5.3 Codex across Batch, Flex, Standard, and Priority tiers.

CalculatorPricing GuideExamplesSave MoneyFAQ

Pricing TLDR

  • $5 free credits for new users (no credit card required)
  • Pay-per-token: GPT-5.4 Nano ($0.20/$1.25) • GPT-5.4 ($2.50/$15) • GPT-5.5 ($5/$30) per million tokens
  • Batch API 50% off • Cached input 90% off on GPT-5.5 and GPT-5.4 families

Official pricing:

OpenAI

Quality Scores: Theozard

OpenAI API Cost Calculator - Monthly Pricing

Pricing Tier

Calculate by

Input Tokens

Output Tokens

API Calls / Month

Quick Examples:

Cost Optimization:

Showing 7 of 8 models

GPT-5.5 (gpt-5.5)

Context

1.05M

Quality

100

Per 1M Tokens

In: $5.00

Out: $30.00

Monthly Cost

$20.00

GPT-5.4 (gpt-5.4)

Context

1M

Quality

94

Per 1M Tokens

In: $2.50

Out: $15.00

Monthly Cost

$10.00

GPT-5.3 Codex (gpt-5.3-codex)

Context

400K

Quality

89

Per 1M Tokens

In: $1.75

Out: $14.00

Monthly Cost

$8.75

GPT-5.4 Mini (gpt-5.4-mini)

Context

1M

Quality

81

Per 1M Tokens

In: $0.75

Out: $4.50

Monthly Cost

$3.00

GPT-5.4 Nano (gpt-5.4-nano)

Context

1M

Quality

73

Per 1M Tokens

In: $0.20

Out: $1.25

Monthly Cost

$0.83

GPT-5.5 Pro (gpt-5.5-pro)

Context

1.05M

Quality

-

Per 1M Tokens

In: $30.00

Out: $180.00

Monthly Cost

$120.00

GPT-5.4 Pro (gpt-5.4-pro)

Context

1M

Quality

-

Per 1M Tokens

In: $30.00

Out: $180.00

Monthly Cost

$120.00

Spinning up GPT-5.5 reasoning runs and watching costs creep?

Track your OpenAI API spend in real-time.

Try free for 7 daysLearn more →

Privacy-first desktop app. No sign-up required.

CostGoat desktop app showing AI agent quotas, usage costs, credit balances, and subscriptions

About OpenAI API

What is OpenAI API?

The OpenAI API provides access to the current chat lineup: GPT-5.5 (latest flagship with built-in reasoning, 1.05M context), GPT-5.4 family (Mini, Nano, Pro variants for cost-optimized production), GPT-5.5 Pro and GPT-5.4 Pro (premium reasoning tiers), and GPT-5.3 Codex (specialized coding model). All models support four pricing tiers (Batch, Flex, Standard, Priority) so you can optimize cost versus latency per request.

  • Current Model Lineup: GPT-5.5: latest flagship with reasoning baked in ($5/$30). GPT-5.4: production-ready at lower cost ($2.50/$15). GPT-5.4 Mini and Nano: cheapest options for high-volume routing and extraction. GPT-5.5 Pro and GPT-5.4 Pro: premium tiers for deep-horizon enterprise reasoning. GPT-5.3 Codex: agentic coding specialist tuned for software engineering tasks.
  • Flexible Pricing Tiers: Batch (50% discount, 24hr processing), Flex (lower cost, variable latency on GPT-5.5 and GPT-5.4 families), Standard (default balanced tier), Priority (2x cost, faster processing). Switch tiers per request to optimize cost vs speed. Pro variants are Standard/Batch only.
  • Advanced Cost Features: Cached input drops 90% on GPT-5.5 and GPT-5.4 families (e.g. GPT-5.5 standard input $5.00 → $0.50 cached). Function calling, structured outputs, and built-in tools (Code Interpreter, File Search, Web Search) work across the lineup. Reasoning tokens are billed as output on reasoning-enabled models. OpenAI is winding down the fine-tuning platform (still available on GPT-4.1 family and o4-mini for existing users).

When to Use OpenAI API

Start with GPT-5.4 Nano or Mini for cost-sensitive routing and extraction, move to GPT-5.4 for balanced production workloads, and reach for GPT-5.5 (or Pro) when you need the highest reasoning quality. Use GPT-5.3 Codex for coding agents. Switch to Batch for non-urgent workloads (50% savings) and Priority for latency-critical user-facing flows.

Ideal for

  • Production chat and agentic apps on GPT-5.5 or GPT-5.4
  • Coding assistants and software engineering agents on GPT-5.3 Codex
  • High-volume classification and extraction with GPT-5.4 Nano or Mini + Batch
  • Deep-horizon enterprise reasoning with GPT-5.5 Pro or GPT-5.4 Pro
  • Non-reasoning ChatGPT-style chat via the chat-latest aliases

Not ideal for

  • Real-time applications requiring <50ms latency (even Priority tier)
  • Simple text parsing where regex or rules-based code is cheaper
  • Workloads needing guaranteed deterministic outputs
  • Reasoning models on trivial tasks (hidden thinking tokens add cost)

OpenAI API Pricing Breakdown

Free Tier

New users receive $5 in free credits with no credit card required. These credits expire after 3 months and are automatically applied to your usage across all models and tiers.

  • Sign up at platform.openai.com - no credit card required
  • Receive $5 free credits instantly upon registration
  • Credits expire after 3 months from grant date
  • Works across all current models (GPT-5.5, GPT-5.4 family, GPT-5.3 Codex)
  • Generate API key and start building immediately

Pricing Tiers Explained

Batch Tier (50% Discount)

Process non-urgent workloads within 24 hours at half price. Example: GPT-5.5 drops to $2.50/$15.00 per M tokens (vs $5/$30 standard). Perfect for data processing, content generation, and analysis tasks that aren't time-sensitive.

Flex Tier (Variable Latency)

Lower cost with variable latency on the GPT-5.5 and GPT-5.4 families (Pro variants excluded). Automatically routes requests during lower-demand periods. Good middle ground between Batch and Standard for non-critical applications.

Standard Tier (Default)

Default tier with balanced cost and performance. Most common choice for production applications. Consistent latency and throughput. All models available. Auto-scales based on usage tier (increases with spend).

Priority Tier (2x Faster)

Higher cost (typically 2x standard) for guaranteed faster processing. Best for user-facing applications requiring quick responses. Available on GPT-5.5, GPT-5.4 (Mini, Nano), and GPT-5.3 Codex (Pro variants excluded).

Cost Optimization Features

Prompt Caching (90% Savings)

Cache frequently used prompts, system messages, or documents. Cached input is billed at 10% of the standard rate on the GPT-5.5 and GPT-5.4 families (e.g. GPT-5.5 input $5.00/M → cached $0.50/M). Cache persists 5-10 minutes. Works for chatbots with consistent system prompts, RAG applications, and repeated context.

Fine-Tuning (Winding Down)

OpenAI is winding down the fine-tuning platform as of May 2026 — it's no longer open to new users. Existing fine-tuning users can still create training jobs on GPT-4.1, GPT-4.1-mini (SFT/DPO) and o4-mini (RFT). Fine-tuned models remain available for inference until their base models are deprecated. The current GPT-5.5 / GPT-5.4 family is not available for fine-tuning.

Understanding Reasoning Tokens

Reasoning-enabled models (GPT-5.5, GPT-5.5 Pro, GPT-5.4 Pro, GPT-5.3 Codex) generate hidden 'thinking' tokens that are billed as output but not returned in the response. These tokens occupy context window space and can significantly increase costs. A 500-token visible response may consume 2000+ total tokens. Use chat-latest aliases or GPT-5.4 Mini/Nano when you don't need extended reasoning.

Prepaid Credits System

Buy credits upfront with no monthly fees. Credits expire after 12 months. Set auto-reload thresholds to never run out. Volume discounts available for enterprise. No setup fees or subscription costs.

Usage Tracking & Limits

Real-time usage monitoring in dashboard. Set spending limits and alerts. Failed requests aren't charged. Rate limits increase automatically with usage tier (based on cumulative spend).

OpenAI API Monthly Cost Estimates

Light Use

$5-40/mo

Personal projects

<1K requests/day

GPT-5.4 Nano or Mini

Medium Use

$40-200/mo

Small apps

1-5K requests/day

GPT-5.4 + Batch tier mix

Heavy Use

$200-1.5K/mo

Production apps

5-20K requests/day

GPT-5.5 with caching enabled

Enterprise

$1.5K+/mo

Large scale

20K+ requests/day

GPT-5.5 Pro on Priority tier

7 OpenAI API Cost Optimization Tips

1

Watch Hidden Reasoning Token Costs

GPT-5.5, GPT-5.5 Pro, GPT-5.4 Pro, and GPT-5.3 Codex emit 'reasoning tokens' for internal thinking that are billed as output but not returned in the response. A response showing 500 output tokens can easily consume 2000+ tokens of actual output billing. Use the chat-latest non-reasoning aliases or GPT-5.4 Mini/Nano when extended reasoning isn't required.

2

Use Prompt Caching for 90% Off Input

Cached input is billed at 10% of the standard rate on the GPT-5.5 and GPT-5.4 families. Example: GPT-5.5 input drops from $5.00/M to $0.50/M. With 80% cache hits on GPT-5.5 at 10K requests/day, savings are around $1,000-$1,500/month. Works best for chatbots with consistent system prompts and RAG applications with repeated context.

3

Leverage Batch Tier for Non-Urgent Tasks

Get a 50% discount by processing non-urgent workloads within 24 hours via the Batch API. Perfect for data processing, content generation, analytics, and background jobs. GPT-5.5 batch: $2.50/$15 per M tokens vs $5/$30 standard. Can save thousands per month on high-volume workloads.

4

Start with the Smallest Model That Works

Default to GPT-5.4 Nano ($0.20/$1.25 per M tokens) for classification, routing, and simple extraction. Move to GPT-5.4 Mini ($0.75/$4.50) for balanced tasks, GPT-5.4 ($2.50/$15) for production chat, and only reach for GPT-5.5 or Pro variants when the task genuinely needs deeper reasoning.

5

Implement Smart Model Routing

Cascade by complexity: GPT-5.4 Nano → GPT-5.4 Mini → GPT-5.4 → GPT-5.5 (or GPT-5.3 Codex for coding). Use a cheap classifier model to score difficulty, then dispatch. Most production workloads can run 60-80% of requests on Nano or Mini and reserve GPT-5.5 for the long tail.

6

Optimize Token Usage

Minimize prompt tokens without sacrificing quality. Use concise system messages, drop redundant few-shot examples once you have a stable schema, set max_tokens limits on outputs, and use function calling instead of verbose JSON in prompts. Each 1,000-token reduction on GPT-5.5 standard saves around $0.005 per request.

7

Monitor OpenAI API Usage in Real-Time

Track OpenAI spending per model with CostGoat's token-level visibility. Get instant alerts when switching from Mini to premium models, when prompt caching savings drop unexpectedly, or when batch processing opportunities are missed. Prevent budget overruns before they happen.

OpenAI Model Selection Guide

Use Case

Customer Support Chat

Recommended Model

GPT-5.4 Mini or Nano

Fast & Affordable

Monthly Cost (Est.)

~$15-80

Why This Model?

Lowest cost for conversational tasks with prompt caching

Use Case

Code Generation & Review

Recommended Model

GPT-5.3 Codex

Coding Specialist

Monthly Cost (Est.)

~$100-500

Why This Model?

Frontier agentic coding model tuned for software engineering tasks

Use Case

Complex Reasoning & STEM

Recommended Model

GPT-5.5

Latest Flagship

Monthly Cost (Est.)

~$200-1500

Why This Model?

Built-in reasoning at the highest quality (note: hidden reasoning tokens)

Use Case

Enterprise Deep Reasoning

Recommended Model

GPT-5.5 Pro

Premium Tier

Monthly Cost (Est.)

~$500-5K

Why This Model?

Deep-horizon reasoning for high-stakes enterprise workflows

Use Case

Content Writing

Recommended Model

GPT-5.4

Balanced

Monthly Cost (Est.)

~$50-300

Why This Model?

Strong quality-to-cost ratio for creative and long-form content

Use Case

Data Extraction & Classification

Recommended Model

GPT-5.4 Nano + Batch

Budget-Friendly

Monthly Cost (Est.)

~$5-60

Why This Model?

Lowest cost with 50% batch discount for structured extraction

Use Case

Non-Reasoning ChatGPT-style Chat

Recommended Model

gpt-5.5-chat-latest

No hidden reasoning

Monthly Cost (Est.)

~$50-400

Why This Model?

GPT-5.5 quality without reasoning-token billing surprises

OpenAI API Rate Limits & Usage Tiers

Usage Tier

Free Tier

Spend Requirement

$0 spent

GPT-5.5 Limits

500 RPM

200K TPM

GPT-5.4 Limits

500 RPM

200K TPM

Usage Tier

Tier 1

Spend Requirement

$100 spent

GPT-5.5 Limits

2K RPM

2M TPM

GPT-5.4 Limits

2K RPM

2M TPM

Usage Tier

Tier 2

Spend Requirement

$1,000 spent

GPT-5.5 Limits

5K RPM

10M TPM

GPT-5.4 Limits

5K RPM

10M TPM

Usage Tier

Tier 3

Spend Requirement

$10,000 spent

GPT-5.5 Limits

10K RPM

30M TPM

GPT-5.4 Limits

10K RPM

30M TPM

Usage Tier

Tier 4+

Spend Requirement

$50,000+ spent

GPT-5.5 Limits

20K+ RPM

100M+ TPM

GPT-5.4 Limits

20K+ RPM

100M+ TPM

RPM: Requests Per Minute | TPM: Tokens Per Minute. Limits vary by model and tier. Contact sales for enterprise-level rate limits.

Catch Reasoning Token Spikes Before They Hit Your Bill

Monitor your OpenAI credit balance from your menubar, with budget alerts for the GPT-5.5, GPT-5.4, and GPT-5.3 Codex tiers. Catch reasoning-token and Priority-tier spend before it hits your bill.

Try Free for 7 DaysLearn more →

Privacy-first desktop app. 7-day free trial, no sign-up required.

CostGoat desktop app showing AI agent quotas, usage costs, credit balances, and subscriptions

OpenAI API Pricing FAQ

Common questions about OpenAI API costs, billing, models, and optimization

AI Pricing

Gemini API PricingClaude API PricingGoogle Veo PricingAI Cost CalculatorsReplicate API PricingOpenRouter API PricingOpenRouter Free Models
DownloadsPricingDashboardContactIssuesAffiliatesTermsPrivacy

© 2026 CostGoat. All rights reserved.

Made by Functioncraft: Redis GUI Client · SSH GUI Client

Affiliate disclosure: Some links earn CostGoat a commission or credit when you sign up — no extra cost to you.