LLM Pricing Tracker: API and Subscription Costs

Mar 19, 2026

A tracker for leading LLM API token prices and consumer subscriptions, with official links, repo snapshot refreshes, price-history charts, and live benchmark/provider snapshots.

Note

Latest repo snapshot in this build: June 18, 2026. Click Check for newer snapshot below to query GitHub for a fresher snapshot. Each browser is limited to one remote check per day.

This page tracks public pricing from official provider pages for major frontier-model vendors I regularly compare: OpenAI, Google (Gemini and Gemma), Anthropic, xAI, DeepSeek, Qwen, Moonshot/Kimi, Xiaomi/MiMo, MiniMax, Together AI (including GLM-5 and public Llama endpoints), Meta/Llama references, and GitHub Copilot.

A few quick cautions before using the numbers:

  • API pricing and consumer subscription pricing are different products.
  • Some vendors publish tiered pricing by context length, region, or prompt type.
  • When a vendor does not publicly expose a comparable token-billing number, I mark that clearly instead of guessing.
  • The charts below use snapshots stored in this repo, including daily-refreshable Artificial Analysis benchmark/provider snapshots and manually curated pricing rows from official vendor pages.

Latest loaded snapshot: June 18, 2026
Manual snapshot refreshed against official provider pricing pages on June 18, 2026. API prices are list prices per 1M tokens unless a row explicitly notes a current promotion, region, or tier.

API Token Fees

Representative API pricing for major frontier-model vendors. Numbers are taken from official vendor pricing pages; mixed currencies are kept in the vendor’s quoted currency.

VendorModel / ProductInputCached inputOutputNotesOfficial
OpenAIGPT-5.5 (<272K context)
USD per 1M tokens
$5.00$0.50$30.00OpenAI developer pricing lists GPT-5.5 as a flagship model at the standard <272K-context tier; regional processing endpoints carry a 10% uplift.OpenAI developer pricing
GoogleGemini 3.5 Flash
USD per 1M tokens
$1.50$0.15$9.00Current Gemini API paid-tier standard pricing for Gemini 3.5 Flash. Google describes it as its most intelligent model built for speed; context-cache storage is listed separately at $1.00 / 1M tokens per hour.Gemini API pricing
GoogleGemma 4
USD per 1M tokens
Free of chargeFree of chargeFree of chargeGoogle's current Gemini Developer API pricing page lists Gemma 4 with free input, output, and context caching, while the paid tier remains unavailable.Google Gemini Developer API pricing
AnthropicClaude Opus 4.8
USD per 1M tokens
$5.00$0.50 (cache read)$25.00Anthropic pricing lists Opus 4.8 as ideal for complex agentic coding and enterprise work. Prompt-cache writes are $6.25 / 1M tokens. Fable 5 is listed at $10 / $50 but is unavailable, so it is not used as the active API row.Anthropic pricing
DeepSeekdeepseek-v4-flash
USD per 1M tokens
$0.14 (cache miss)$0.0028 (cache hit)$0.28Official DeepSeek V4 Flash overseas pricing. The compatibility names deepseek-chat and deepseek-reasoner are scheduled for deprecation on 2026-07-24 15:59 UTC.DeepSeek models & pricing
DeepSeekdeepseek-v4-pro
USD per 1M tokens
$0.435 (cache miss)$0.003625 (cache hit)$0.87Official DeepSeek V4 Pro overseas pricing as listed on the current models and pricing page.DeepSeek models & pricing
Qwen / Alibaba Cloudqwen3-max
USD per 1M tokens
$1.20 (<=32K, International)Context Cache discount available$6.00 (<=32K, International)Alibaba Cloud Model Studio now lists qwen3-max above qwen-max. International tiered pricing is $1.20/$6.00 up to 32K tokens, $2.40/$12.00 from 32K to 128K, and $3.00/$15.00 from 128K to 252K.Alibaba Cloud Model Studio pricing
Together AI / Z AIGLM-5
USD per 1M tokens
$1.00-$3.20Together AI's public serverless price for Z AI's GLM-5. The public model page does not list a separate cached-input rate.Together AI GLM-5 pricing
Meta / Llama via Together AILlama 4 Maverick
USD per 1M tokens
$0.27-$0.85Meta's Llama 4 Maverick served through Together AI's public serverless API. Meta's public developer docs describe the model family, while Together AI exposes a comparable public token price.Together AI Llama 4 Maverick pricing
Moonshot AI / Kimikimi-k2.7-code
USD per 1M tokens
$0.95 (cache miss)$0.19 (cache hit)$4.00Kimi K2.7 Code is Kimi’s current most capable coding model. The high-speed variant is $1.90 cache-miss input, $0.38 cache-hit input, and $8.00 output per 1M tokens.Kimi K2.7 Code pricing
Xiaomi / MiMoMiMo-V2.5-Pro
USD per 1M tokens
$1.00 (<=256K cache miss)$0.20 (<=256K cache hit)$3.00 (<=256K)Official overseas pricing for Xiaomi's flagship MiMo-V2.5-Pro / MiMo-V2-Pro tier. For prompts above 256K tokens, Xiaomi lists $2.00 input, $0.40 cached input, and $6.00 output per 1M tokens.Xiaomi MiMo pricing
MiniMaxMiniMax-M3
CNY per 1M tokens
¥2.10 (<=512K, 50% off)¥0.42 (cache read, <=512K)¥8.40 (<=512K, 50% off)Current MiniMax pay-as-you-go text pricing for MiniMax-M3 standard service after the listed permanent 50% discount. Prompts above 512K are listed at ¥4.20 input, ¥0.84 cache read, and ¥16.80 output while limited supply applies; priority service is 1.5x standard pricing.MiniMax pay-as-you-go pricing
xAIGrok 4.3
USD per 1M tokens
$1.25$0.20$2.50xAI docs list Grok 4.3 at $1.25 input, $0.20 cached input, and $2.50 output per 1M tokens.xAI Grok 4.3 pricing
GitHub CopilotCopilot product pricing
N/A
N/AN/AN/AGitHub Copilot is sold as a subscription product. GitHub does not publicly publish a Copilot per-token API price comparable to the other vendors here.GitHub Copilot plans

Subscription Plans

Publicly listed consumer or team subscriptions from the official provider pages I checked for this tracker.

VendorPlanPriceNotesOfficial
OpenAIChatGPT Plus$20 / monthConsumer plan. API usage is billed separately.ChatGPT pricing
OpenAIChatGPT Pro$200 / monthHighest individual-access plan.ChatGPT pricing
OpenAIChatGPT Business$25 / user / month billed annually or $30 monthlyShared workspace for teams and growing businesses.ChatGPT pricing
AnthropicClaude Pro$20 / monthIndividual paid plan for Claude.Anthropic pricing
AnthropicClaude Max 5x$100 / month5x more usage than Pro. Includes Claude Code.Claude Max
AnthropicClaude Max 20x$200 / month20x more usage than Pro. Includes Claude Code.Claude Max
AnthropicClaude Team$25 / seat / month billed annually, $30 billed monthlyStandard team seats. Anthropic also lists premium seats at $150 / member / month, including Claude Code and higher usage.Claude Team billing
GoogleGoogle AI Pro$19.99 / monthFormerly AI Premium. Consumer plan with Gemini app, Flow, NotebookLM and more.Google AI plans
GoogleGoogle AI Ultra$249.99 / monthHighest Google AI subscription tier for the Gemini app and related tools.Google AI plans
Xiaomi / MiMoToken Plan (monthly)$6 / $16 / $50 / $100 per monthOfficial monthly Token Plan tiers: Lite, Standard, Pro, and Max. The package covers MiMo-V2.5-Pro, MiMo-V2.5, and the rest of Xiaomi's current V2 / V2.5 programming-tool lineup.MiMo Token Plan subscription
Xiaomi / MiMoToken Plan (annual)$63.36 / $168.96 / $528.00 / $1056.00 per yearOfficial annual Token Plan tiers: Lite, Standard, Pro, and Max.MiMo Token Plan subscription
MiniMaxCoding Plan Starter¥29 / month40 prompts every 5 hours.MiniMax Coding Plan
MiniMaxCoding Plan Plus¥49 / month100 prompts every 5 hours.MiniMax Coding Plan
MiniMaxCoding Plan Max¥119 / month300 prompts every 5 hours.MiniMax Coding Plan
xAI / GrokX Premium+$40 / month in the US web pricing tableHelp.x.com ties Premium+ to expanded Grok access. Regional prices vary.X Premium pricing
GitHub CopilotCopilot Pro$10 / month or $100 / yearIndividual developer plan.GitHub Copilot plans
GitHub CopilotCopilot Pro+$39 / month or $390 / yearHigher premium-request limits and broader model access.GitHub Copilot plans
GitHub CopilotCopilot Business$19 / seat / monthOrganization-managed plan from GitHub Docs.GitHub Docs
GitHub CopilotCopilot Enterprise$39 / seat / monthEnterprise-managed plan from GitHub Docs.GitHub Docs

Some vendors in the API table are omitted here because I could not find a public official subscription plan for them in the sources checked for this snapshot: DeepSeek, Qwen / Alibaba Cloud, Moonshot AI / Kimi.

Price History

Switch provider, metric, and time grain to compare the official pricing checkpoints I have stored so far.

Loading chart…

History lines use official pricing snapshots that are stored in this repo. Some providers only have one official snapshot recorded so far, while others mix successive flagship or promo models.

Artificial Analysis Benchmark Snapshot

The current top 10 models on the Artificial Analysis Intelligence Index. This list is intentionally model-level, so repeated vendors can appear more than once when they occupy multiple top-10 slots.

Loading benchmark snapshot…

Top 10 current models by Artificial Analysis Intelligence Index. Prices and speeds below come from Artificial Analysis' benchmark snapshot, so they may differ from the manual API rows above when multiple deployments or reasoning modes exist. Source: Artificial Analysis models leaderboard (checked June 18, 2026).

VendorBenchmark modelIntelligenceSpeedBlended pricePrompt pricingDetails
AnthropicClaude Fable 5 (with fallback)
Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)
59.860 t/s$Input $10 · Output $50Model details
AnthropicClaude Opus 4.8 (max)
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
55.6960.3 t/s$Input $5 · Output $25Model details
OpenAIGPT-5.5 (xhigh)
GPT-5.5 (xhigh)
54.8456.35 t/s$Input $5 · Output $30Model details
AnthropicClaude Opus 4.7 (max)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
53.5350.25 t/s$Input $5 · Output $25Model details
OpenAIGPT-5.5 (high)
GPT-5.5 (high)
53.1348.82 t/s$Input $5 · Output $30Model details
Z AIGLM-5.2 (max)
GLM-5.2 (max)
50.67104.83 t/s$Input $1.4 · Output $4.4Model details
GoogleGemini 3.5 Flash
Gemini 3.5 Flash (high)
50.2148.2 t/s$Input $1.5 · Output $9Model details
AnthropicClaude Sonnet 4.6 (max)
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
47.2150.48 t/s$Input $3 · Output $15Model details
OpenAIGPT-5.5 (medium)
GPT-5.5 (medium)
47.1452.65 t/s$Input $5 · Output $30Model details
GoogleGemini 3.1 Pro Preview
Gemini 3.1 Pro Preview
46.46122.4 t/s$Input $2 · Output $12Model details

Top-25 API Provider Leaderboard

Artificial Analysis provider leaderboard snapshot, aggregated to the best currently benchmarked endpoint for each provider. Switch metrics to compare intelligence, blended price, latency, and context window separately.

Loading provider leaderboard…

Each provider rank uses that provider's best currently benchmarked endpoint for the selected metric, so the representative model can change across metrics. Source: Artificial Analysis provider leaderboard (checked June 18, 2026).

Scale and Price Frontier

A Star History-style yearly line for open-weight model sizes and the highest reviewed output-token prices since 2021.

Loading scale and price frontier…

Lines start in 2021 and combine curated source-backed historical records with the latest Artificial Analysis model metadata. Model size tracks the largest open-weight/open-access LLM by total disclosed parameters for each year, so sparse MoE and dense models are not quality-equivalent. Output-token price uses public USD text-token API prices and excludes tool-call, image, audio, video, and subscription pricing. Source: Artificial Analysis models (checked June 18, 2026).