OpenAI ChatGPT API Pricing Calculator - All Text/Chat Models
Calculate OpenAI API costs for current chat and code models per token, per request, and per month. Compare GPT-5.5, GPT-5.4 family, and GPT-5.3 Codex across Batch, Flex, Standard, and Priority tiers.
Pricing TLDR
- • $5 free credits for new users (no credit card required)
- • Pay-per-token: GPT-5.4 Nano ($0.20/$1.25) • GPT-5.4 ($2.50/$15) • GPT-5.5 ($5/$30) per million tokens
- • Batch API 50% off • Cached input 90% off on GPT-5.5 and GPT-5.4 families
OpenAI API Cost Calculator - Monthly Pricing
Pricing Tier
Calculate by
Input Tokens
Output Tokens
API Calls / Month
Quick Examples:
Cost Optimization:
Showing 7 of 8 models
GPT-5.5 (gpt-5.5)
Context
Quality
Per 1M Tokens
In: $5.00
Out: $30.00
Monthly Cost
GPT-5.4 (gpt-5.4)
Context
Quality
Per 1M Tokens
In: $2.50
Out: $15.00
Monthly Cost
GPT-5.3 Codex (gpt-5.3-codex)
Context
Quality
Per 1M Tokens
In: $1.75
Out: $14.00
Monthly Cost
GPT-5.4 Mini (gpt-5.4-mini)
Context
Quality
Per 1M Tokens
In: $0.75
Out: $4.50
Monthly Cost
GPT-5.4 Nano (gpt-5.4-nano)
Context
Quality
Per 1M Tokens
In: $0.20
Out: $1.25
Monthly Cost
GPT-5.5 Pro (gpt-5.5-pro)
Context
Quality
Per 1M Tokens
In: $30.00
Out: $180.00
Monthly Cost
GPT-5.4 Pro (gpt-5.4-pro)
Context
Quality
Per 1M Tokens
In: $30.00
Out: $180.00
Monthly Cost
Spinning up GPT-5.5 reasoning runs and watching costs creep?
Track your OpenAI API spend in real-time.
Privacy-first desktop app. No sign-up required.

About OpenAI API
What is OpenAI API?
The OpenAI API provides access to the current chat lineup: GPT-5.5 (latest flagship with built-in reasoning, 1.05M context), GPT-5.4 family (Mini, Nano, Pro variants for cost-optimized production), GPT-5.5 Pro and GPT-5.4 Pro (premium reasoning tiers), and GPT-5.3 Codex (specialized coding model). All models support four pricing tiers (Batch, Flex, Standard, Priority) so you can optimize cost versus latency per request.
- Current Model Lineup: GPT-5.5: latest flagship with reasoning baked in ($5/$30). GPT-5.4: production-ready at lower cost ($2.50/$15). GPT-5.4 Mini and Nano: cheapest options for high-volume routing and extraction. GPT-5.5 Pro and GPT-5.4 Pro: premium tiers for deep-horizon enterprise reasoning. GPT-5.3 Codex: agentic coding specialist tuned for software engineering tasks.
- Flexible Pricing Tiers: Batch (50% discount, 24hr processing), Flex (lower cost, variable latency on GPT-5.5 and GPT-5.4 families), Standard (default balanced tier), Priority (2x cost, faster processing). Switch tiers per request to optimize cost vs speed. Pro variants are Standard/Batch only.
- Advanced Cost Features: Cached input drops 90% on GPT-5.5 and GPT-5.4 families (e.g. GPT-5.5 standard input $5.00 → $0.50 cached). Function calling, structured outputs, and built-in tools (Code Interpreter, File Search, Web Search) work across the lineup. Reasoning tokens are billed as output on reasoning-enabled models. OpenAI is winding down the fine-tuning platform (still available on GPT-4.1 family and o4-mini for existing users).
When to Use OpenAI API
Start with GPT-5.4 Nano or Mini for cost-sensitive routing and extraction, move to GPT-5.4 for balanced production workloads, and reach for GPT-5.5 (or Pro) when you need the highest reasoning quality. Use GPT-5.3 Codex for coding agents. Switch to Batch for non-urgent workloads (50% savings) and Priority for latency-critical user-facing flows.
Ideal for
- Production chat and agentic apps on GPT-5.5 or GPT-5.4
- Coding assistants and software engineering agents on GPT-5.3 Codex
- High-volume classification and extraction with GPT-5.4 Nano or Mini + Batch
- Deep-horizon enterprise reasoning with GPT-5.5 Pro or GPT-5.4 Pro
- Non-reasoning ChatGPT-style chat via the chat-latest aliases
Not ideal for
- Real-time applications requiring <50ms latency (even Priority tier)
- Simple text parsing where regex or rules-based code is cheaper
- Workloads needing guaranteed deterministic outputs
- Reasoning models on trivial tasks (hidden thinking tokens add cost)
OpenAI API Pricing Breakdown
Free Tier
New users receive $5 in free credits with no credit card required. These credits expire after 3 months and are automatically applied to your usage across all models and tiers.
- Sign up at platform.openai.com - no credit card required
- Receive $5 free credits instantly upon registration
- Credits expire after 3 months from grant date
- Works across all current models (GPT-5.5, GPT-5.4 family, GPT-5.3 Codex)
- Generate API key and start building immediately
Pricing Tiers Explained
Batch Tier (50% Discount)
Process non-urgent workloads within 24 hours at half price. Example: GPT-5.5 drops to $2.50/$15.00 per M tokens (vs $5/$30 standard). Perfect for data processing, content generation, and analysis tasks that aren't time-sensitive.
Flex Tier (Variable Latency)
Lower cost with variable latency on the GPT-5.5 and GPT-5.4 families (Pro variants excluded). Automatically routes requests during lower-demand periods. Good middle ground between Batch and Standard for non-critical applications.
Standard Tier (Default)
Default tier with balanced cost and performance. Most common choice for production applications. Consistent latency and throughput. All models available. Auto-scales based on usage tier (increases with spend).
Priority Tier (2x Faster)
Higher cost (typically 2x standard) for guaranteed faster processing. Best for user-facing applications requiring quick responses. Available on GPT-5.5, GPT-5.4 (Mini, Nano), and GPT-5.3 Codex (Pro variants excluded).
Cost Optimization Features
Prompt Caching (90% Savings)
Cache frequently used prompts, system messages, or documents. Cached input is billed at 10% of the standard rate on the GPT-5.5 and GPT-5.4 families (e.g. GPT-5.5 input $5.00/M → cached $0.50/M). Cache persists 5-10 minutes. Works for chatbots with consistent system prompts, RAG applications, and repeated context.
Fine-Tuning (Winding Down)
OpenAI is winding down the fine-tuning platform as of May 2026 — it's no longer open to new users. Existing fine-tuning users can still create training jobs on GPT-4.1, GPT-4.1-mini (SFT/DPO) and o4-mini (RFT). Fine-tuned models remain available for inference until their base models are deprecated. The current GPT-5.5 / GPT-5.4 family is not available for fine-tuning.
Understanding Reasoning Tokens
Reasoning-enabled models (GPT-5.5, GPT-5.5 Pro, GPT-5.4 Pro, GPT-5.3 Codex) generate hidden 'thinking' tokens that are billed as output but not returned in the response. These tokens occupy context window space and can significantly increase costs. A 500-token visible response may consume 2000+ total tokens. Use chat-latest aliases or GPT-5.4 Mini/Nano when you don't need extended reasoning.
Prepaid Credits System
Buy credits upfront with no monthly fees. Credits expire after 12 months. Set auto-reload thresholds to never run out. Volume discounts available for enterprise. No setup fees or subscription costs.
Usage Tracking & Limits
Real-time usage monitoring in dashboard. Set spending limits and alerts. Failed requests aren't charged. Rate limits increase automatically with usage tier (based on cumulative spend).
OpenAI API Monthly Cost Estimates
Light Use
$5-40/mo
• Personal projects
• <1K requests/day
• GPT-5.4 Nano or Mini
Medium Use
$40-200/mo
• Small apps
• 1-5K requests/day
• GPT-5.4 + Batch tier mix
Heavy Use
$200-1.5K/mo
• Production apps
• 5-20K requests/day
• GPT-5.5 with caching enabled
Enterprise
$1.5K+/mo
• Large scale
• 20K+ requests/day
• GPT-5.5 Pro on Priority tier
7 OpenAI API Cost Optimization Tips
Watch Hidden Reasoning Token Costs
GPT-5.5, GPT-5.5 Pro, GPT-5.4 Pro, and GPT-5.3 Codex emit 'reasoning tokens' for internal thinking that are billed as output but not returned in the response. A response showing 500 output tokens can easily consume 2000+ tokens of actual output billing. Use the chat-latest non-reasoning aliases or GPT-5.4 Mini/Nano when extended reasoning isn't required.
Use Prompt Caching for 90% Off Input
Cached input is billed at 10% of the standard rate on the GPT-5.5 and GPT-5.4 families. Example: GPT-5.5 input drops from $5.00/M to $0.50/M. With 80% cache hits on GPT-5.5 at 10K requests/day, savings are around $1,000-$1,500/month. Works best for chatbots with consistent system prompts and RAG applications with repeated context.
Leverage Batch Tier for Non-Urgent Tasks
Get a 50% discount by processing non-urgent workloads within 24 hours via the Batch API. Perfect for data processing, content generation, analytics, and background jobs. GPT-5.5 batch: $2.50/$15 per M tokens vs $5/$30 standard. Can save thousands per month on high-volume workloads.
Start with the Smallest Model That Works
Default to GPT-5.4 Nano ($0.20/$1.25 per M tokens) for classification, routing, and simple extraction. Move to GPT-5.4 Mini ($0.75/$4.50) for balanced tasks, GPT-5.4 ($2.50/$15) for production chat, and only reach for GPT-5.5 or Pro variants when the task genuinely needs deeper reasoning.
Implement Smart Model Routing
Cascade by complexity: GPT-5.4 Nano → GPT-5.4 Mini → GPT-5.4 → GPT-5.5 (or GPT-5.3 Codex for coding). Use a cheap classifier model to score difficulty, then dispatch. Most production workloads can run 60-80% of requests on Nano or Mini and reserve GPT-5.5 for the long tail.
Optimize Token Usage
Minimize prompt tokens without sacrificing quality. Use concise system messages, drop redundant few-shot examples once you have a stable schema, set max_tokens limits on outputs, and use function calling instead of verbose JSON in prompts. Each 1,000-token reduction on GPT-5.5 standard saves around $0.005 per request.
Monitor OpenAI API Usage in Real-Time
Track OpenAI spending per model with CostGoat's token-level visibility. Get instant alerts when switching from Mini to premium models, when prompt caching savings drop unexpectedly, or when batch processing opportunities are missed. Prevent budget overruns before they happen.
OpenAI Model Selection Guide
Use Case
Customer Support Chat
Recommended Model
GPT-5.4 Mini or Nano
Fast & Affordable
Monthly Cost (Est.)
~$15-80
Why This Model?
Lowest cost for conversational tasks with prompt caching
Use Case
Code Generation & Review
Recommended Model
GPT-5.3 Codex
Coding Specialist
Monthly Cost (Est.)
~$100-500
Why This Model?
Frontier agentic coding model tuned for software engineering tasks
Use Case
Complex Reasoning & STEM
Recommended Model
GPT-5.5
Latest Flagship
Monthly Cost (Est.)
~$200-1500
Why This Model?
Built-in reasoning at the highest quality (note: hidden reasoning tokens)
Use Case
Enterprise Deep Reasoning
Recommended Model
GPT-5.5 Pro
Premium Tier
Monthly Cost (Est.)
~$500-5K
Why This Model?
Deep-horizon reasoning for high-stakes enterprise workflows
Use Case
Content Writing
Recommended Model
GPT-5.4
Balanced
Monthly Cost (Est.)
~$50-300
Why This Model?
Strong quality-to-cost ratio for creative and long-form content
Use Case
Data Extraction & Classification
Recommended Model
GPT-5.4 Nano + Batch
Budget-Friendly
Monthly Cost (Est.)
~$5-60
Why This Model?
Lowest cost with 50% batch discount for structured extraction
Use Case
Non-Reasoning ChatGPT-style Chat
Recommended Model
gpt-5.5-chat-latest
No hidden reasoning
Monthly Cost (Est.)
~$50-400
Why This Model?
GPT-5.5 quality without reasoning-token billing surprises
OpenAI API Rate Limits & Usage Tiers
Usage Tier
Spend Requirement
GPT-5.5 Limits
200K TPM
GPT-5.4 Limits
200K TPM
Usage Tier
Spend Requirement
GPT-5.5 Limits
2M TPM
GPT-5.4 Limits
2M TPM
Usage Tier
Spend Requirement
GPT-5.5 Limits
10M TPM
GPT-5.4 Limits
10M TPM
Usage Tier
Spend Requirement
GPT-5.5 Limits
30M TPM
GPT-5.4 Limits
30M TPM
Usage Tier
Spend Requirement
GPT-5.5 Limits
100M+ TPM
GPT-5.4 Limits
100M+ TPM
RPM: Requests Per Minute | TPM: Tokens Per Minute. Limits vary by model and tier. Contact sales for enterprise-level rate limits.
Catch Reasoning Token Spikes Before They Hit Your Bill
Monitor your OpenAI credit balance from your menubar, with budget alerts for the GPT-5.5, GPT-5.4, and GPT-5.3 Codex tiers. Catch reasoning-token and Priority-tier spend before it hits your bill.
Privacy-first desktop app. 7-day free trial, no sign-up required.

OpenAI API Pricing FAQ
Common questions about OpenAI API costs, billing, models, and optimization
