AI Coding Plans — Pricing & Usage Reference

Last updated: 2026-05-20 | Auto-synced daily

Compiled April 2026. All plans compared across major providers.


BytePlus ModelArk Subscribe

ByteDance’s AI inference platform. Coding Plan = subscription with shared quota across supported tools. Now integrates with OpenClaw and Hermes Agent as supported coding tools.

Available Models

Model Type Notes
Dola-Seed-2.0-Pro Coding (flagship) ByteDance’s top-tier coding model. 256K context. Multimodal. Officially launched on BytePlus ModelArk May 2026
Dola-Seed-2.0-Lite Coding (budget) Cheaper, good for routine tasks
Ark-Coding-Pro Coding BytePlus proprietary coding model
O-Pro Coding BytePlus proprietary
ByteDance-Seed-Code Coding Code-specific optimization
GLM-5.1 Frontier (Z.ai) 754B MoE, SWE-Bench Pro 58.4%. Quota drain: ~3x
GLM-4.7 General (Z.ai) Reliable daily driver. 1x quota
Kimi-K2.5 Frontier (Moonshot) 1T MoE, 256K ctx. Quota drain: ~2x
DeepSeek-V3.2 General 73.0% SWE-Bench Verified
GPT-OSS-120B Open-weight 117B params, Apache 2.0

Quotas

Plan Price Per 5h Per Week Per Month
Lite ~$10/mo ~800 ~3,200 ~24,000
Pro ~$40/mo ~4,000 ~16,000 ~120,000

Personal Review (May 2026): The BytePlus-provided models like Dola-Seed-2.0-Pro, O-Pro, and Ark-Coding-Pro are significantly weaker than alternatives available for free on other platforms — e.g. MiniMax M2.5 (80.2% SWE-Bench, free via OpenRouter) easily outperforms them. The real issue is with frontier models like GLM-5.1: despite being listed as available, the quota multiplier (3x drain during peak hours) means you get only about 10–15 actual requests per 5h window. The advertised quotas (“800 per 5h”) are based on the cheapest base models at 1x drain — not usable on any capable model. In practice, the plan is misleading for anyone wanting to use frontier models. Not recommended unless you only need the weak ByteDance models for simple completions.


GitHub Copilot Plans

Source: docs.github.com, github.blog, winbuzzer.com

Plans (Current — May 2026)

Plan Price Premium Req/mo (old) / AI Credits (new) Completions Agent/CLI Status
Free $0 50 2,000/mo Limited Available
Student Free (verified) Unlimited (old only) Unlimited Yes Signups paused since Apr 20
Pro $10/mo ($100/yr) 300 PRU / 1,000 base + 500 flex credits Unlimited Yes Signups paused since Apr 20
Pro+ $39/mo ($390/yr) 1,500 PRU / 3,900 base + 3,100 flex credits Unlimited Yes Signups paused since Apr 20
Business $19/seat/mo $19 AI Credits/seat/mo ($30 promo Jun-Aug) Unlimited Yes Available
Enterprise $39/seat/mo $39 AI Credits/seat/mo ($70 promo Jun-Aug) Unlimited Yes Available
Max (new) $100/mo $100 base + $100 flex ($200 total) Unlimited Yes Starts Jun 1, 2026

Key Changes (Apr–May 2026)

Per-Token Pricing (Jun 2026 — Usage-Based)

Per-token rates from docs.github.com. 1 AI credit = $0.01 USD. Over-usage billed at these rates.

OpenAI: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | GPT-5.5 | $5.00 | $0.50 | $30.00 | | GPT-5.4 | $2.50 | $0.25 | $15.00 | | GPT-5.4 mini | $0.75 | $0.075 | $4.50 | | GPT-5.4 nano | $0.20 | $0.02 | $1.25 | | GPT-5.3-Codex | $1.75 | $0.175 | $14.00 | | GPT-5.2 | $1.75 | $0.175 | $14.00 | | GPT-5.2-Codex | $1.75 | $0.175 | $14.00 | | GPT-4.1 | $2.00 | $0.50 | $8.00 | | GPT-5 mini | $0.25 | $0.025 | $2.00 |

Anthropic (includes cache write cost): | Model | Input /1M | Cached Input | Cache Write | Output /1M | |——-|———-:|:————:|:———–:|———–:| | Claude Opus 4.7 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.6 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.5 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Sonnet 4.6 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4.5 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Haiku 4.5 | $1.00 | $0.10 | $1.25 | $5.00 |

Google: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Gemini 3.1 Pro | $2.00 | $0.20 | $12.00 | | Gemini 2.5 Pro | $1.25 | $0.125 | $10.00 | | Gemini 3 Flash | $0.50 | $0.05 | $3.00 |

Fine-tuned (GitHub): | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Raptor mini | $0.25 | $0.025 | $2.00 | | Goldeneye | $1.25 | $0.125 | $10.00 |

Plans Summary (Jun 2026)

Plan Price Base Flex Total
Free $0 50 completions/mo + limited chat/agent
Pro $10/mo $10 $5 $15 total included
Pro+ $39/mo $39 $31 $70 total included
Max $100/mo $100 $100 $200 total included
Business $19/seat/mo $19 Flex promo Jun-Aug
Enterprise $39/seat/mo $39 Flex promo Jun-Aug

Community Reviews

From techsifted (Apr 2026): “Microsoft paused new signups for Copilot Pro, Pro+, and Student plans. Then they removed Opus models from the Pro tier entirely. If you’d built workflows around Opus inside GitHub Copilot at $10/month, those workflows are broken now.”

From piunikaweb (Mar 2026): “To manage costs, GitHub has decided to remove access to flagship Claude models such as Opus and Sonnet, on the student plan.”

From Reddit (r/github): Users widely criticized the student plan downgrades and Pro signup pause. Many students reported being locked out of models they relied on for coursework.

Personal Review (May 2026): GitHub Copilot was the most reliable and good coding plan I used. A lot of models were available and the IDE integration was seamless. I used it on the Student plan which was fantastic initially. The bad part: they progressively removed all frontier models from the Student plan — Claude Opus/Sonnet went first, then GPT-5.3-Codex was pulled from the model picker. What started as a generous free plan for students got hollowed out over time. Now with usage-based billing coming June 1, the value proposition is even more uncertain — your $10 Pro gets you $10 worth of AI Credits, and heavy agent sessions will burn through that fast. Still a solid product for light use, but heavy users will need the $100 Max plan.


GLM / Z.ai Subscribe

5h sliding window + weekly cap. Overseas price raised Apr 12 2026.
Powered by GLM-5 (Feb 2026) and GLM-5.1 (Mar 27 2026).

Plan Domestic Overseas Per 5h Per Week Monthly Cap
Plan Price (Quarterly) Monthly Equivalent Per 5h Per Week Notes
Lite $30/quarter ($27 Q2) ~$10/mo ~80 prompts ~320 GLM-5.1, 5-Turbo, 4.7, 4.6, 4.5-Air
Pro $90/quarter ($81 Q2) ~$30/mo ~400 prompts ~2,000 Unlocks GLM-5 flagship. ~5x Lite
Max $240/quarter ($216 Q2) ~$80/mo ~1,600 prompts ~6,400 ~4x Pro. Highest throughput

Model Quota Multipliers (Critical)

Model / Time Multiplier Effective Lite prompts/5h
GLM-5.1 / GLM-5-Turbo at peak (14-18 UTC+8) 3x drain ~26
GLM-5.1 / GLM-5-Turbo off-peak 2x drain ~40
GLM-5.1 / GLM-5-Turbo off-peak (promo ended Apr 30) 1x drain ~80
GLM-5 (base) off-peak 2x drain ~40
GLM-4.7 all times 1x drain ~80
GLM-4.5-Air all times 1x drain ~80

OpenCode Go Plan Subscribe

OpenCode’s official low-cost subscription. $5 first month, then $10/month. Dollar-based limits (not request counts). Models hosted in US, EU, Singapore. Zero-retention policy.

Window Dollar Limit
Per 5 hours $12
Per week $30
Per month $60

Available Models (12)

Source: opencode.ai/docs/go

Model Provider Per 5h Per Week Per Month Notes
DeepSeek V4 Flash DeepSeek 31,650 79,050 158,150 Cheapest, huge quota
Qwen3.5 Plus Alibaba 10,200 25,200 50,500 Best raw request count
MiniMax M2.5 MiniMax 6,300 15,900 31,800 80.2% SWE-Bench
MiniMax M2.7 MiniMax 3,400 8,500 17,000 Newer, 196K ctx
Qwen3.6 Plus Alibaba 3,300 8,200 16,300 78.8% SWE-Bench
DeepSeek V4 Pro DeepSeek 3,450 8,550 17,150 80.6% SWE-Bench
MiMo-V2.5-Omni Xiaomi 2,150 5,450 10,900 Multimodal
MiMo-V2.5-Pro Xiaomi 1,290 3,225 6,450 Coding focus
Kimi K2.5 Moonshot 1,850 4,630 9,250 256K ctx, 76.8% SWE-Bench
Kimi K2.6 Moonshot 1,150 2,880 5,750 Updated K2.5, 58.6% SWE-Bench Pro
GLM-5 Z.ai 1,150 2,880 5,750 77.8% SWE-Bench
GLM-5.1 Z.ai 880 2,150 4,300 Best reasoning. 58.4% SWE-Bench Pro

Free Model

Big Pickle (~GLM-4.6, 200K ctx): 200 requests/5h - no subscription needed.

Community Reviews

From Reddit (Mar 2026): Described as “genuinely the worst coding plan I have ever used” - 94% upvotes. Criticism centered on quantized models and aggressive rate limits on reasoning models. (r/LocalLLaMA)

From APIYI review: “Getting three flagship open-source models for $10/month … you’re getting 6x the model invocation value. MiniMax M2.5 is the most cost-effective choice - it has the highest limits and the strongest coding ability.”

From Thomas Wiegold (Apr 2026): “MiniMax gives you a good number of requests … up to 31,800 per month with M2.5. These aren’t toy models - M2.5 scored 80.2% on SWE-Bench Verified, within spitting distance of Claude Opus 4.6’s 80.8%. The catch is that reasoning-heavy models like GLM-5.1 burn through limits fast.”

Personal Review (May 2026): Best and cheapest coding plan available. Generous quotas on MiniMax and Qwen models, super fast inference. Mostly reliable with top open-weight models like GLM-5.1 and Kimi K2.6 at good limits. MiniMax M2.5 at $10/month is unmatched value. DeepSeek V4 Flash gets 158K requests/month - absurdly cheap. Strongly recommended.

Notes


Claude Code (Anthropic) Pricing

Terminal coding agent. All usage shared across claude.ai, Claude Code CLI, and Desktop. Source: claude.com/pricing, community instrumentation.

Individual Plans

Plan Price Multiplier Est. Msgs/5h (Opus) Context CLI
Pro $20/mo ($17/yr) 1x (doubled) ~45-90 200K (1M ext) Yes
Max 5x $100/mo 5x (doubled) ~225-450 1M Yes
Max 20x $200/mo 20x (doubled) ~900-1,800 1M Yes

Team Plans

Plan Price Multiplier Weekly Cap Min Seats
Team Standard $25/seat/mo ($20 annual) ~1.25x Pro (doubled) 7-day, 1 cap 5
Team Premium $125/seat/mo ($100 annual) ~5x Pro 7-day, 2 caps 5-150

API Pay-per-Token

Model Input /1M Output /1M Cache Read Batch (50% off)
Opus 4.7 $5.00 $25.00 $0.50 Yes
Sonnet 4.6 $3.00 $15.00 $0.30 Yes
Haiku 4.5 $1.00 $5.00 $0.10 Yes

Recent Changes (May 2026)

Known Issues

Community Reviews

From Reddit (r/ClaudeAI): “The difference of Claude Pro and Max5 plan usage limit are enormous. It is not only 5x” – users report Max 5x is a dramatically better experience for daily coding, not just numerically 5x better.

From dev.to review: “Pro costs ~$0.44 per Opus message; Max 5x costs the same per message but prevents hitting rate limits sooner. Max 20x cuts per-message cost in half (~$0.22) and is the only plan offering true volume discount.”

From findskill.ai: “Pro $20/mo for solo devs, Max 5x $100/mo for full-time on Claude Code, Max 20x $200/mo for pair-programming all day. The v2.1.100 token inflation bug is real – burns quotas ~40% faster.”

From felloai review: “Max 5x at $100/month gives five times the Pro usage and priority access during peak demand. It is the sweet spot for full-time developers who use Claude Code as their default coding partner.”

Personal Review (May 2026): Claude Code Pro ($20/mo) is decent for light use but the 5-hour rolling window makes it impractical for serious daily development – you WILL hit the wall. The good news: Anthropic just doubled rate limits and removed peak hours throttling (thanks to SpaceX Colossus 1), so Pro is now much more usable — ~90 Opus messages per 5h instead of 45. Max 5x ($100/mo) is still the minimum for real work, but the gap is narrower. On the plus side, code quality is unmatched – Opus 4.7 is genuinely the smartest model for complex refactoring and architecture decisions. The main frustration is Anthropic’s opaque quota system and the tokenizer/v2.1.100 bugs silently eating into your limits. Compared to OpenCode Go ($10/mo with MiniMax M2.5), Claude Code is still 5-10x the price for incrementally better quality. Only worth it if you need Opus-level reasoning for complex codebases. The v2.1.100 bug may have been fixed — keep an eye on releases.


GPT Codex (OpenAI) Pricing

OpenAI’s agentic coding tool — CLI, IDE extension, ChatGPT app, and cloud. Codex is bundled into ChatGPT plans (no separate subscription).
Source: developers.openai.com/codex/pricing

ChatGPT Plans (Codex Included)

Plan Price Best For
Free $0 Limited Codex Mini access
Go $20/mo Budget-friendly Codex Mini (select regions). Reduced limits
Plus $20/mo Entry point for Codex CLI + IDE + cloud
Pro ($100) $100/mo 5x (10x promo until May 31) higher limits. Includes GPT-5.3-Codex-Spark preview
Pro ($200) $200/mo 20x higher limits. Heavy daily use
Business $25/seat/mo Team workspace + admin controls
API Key Pay-per-token CI/CD, automation, programmatic

Codex Usage Limits (Plus — 5-hour rolling window)

Model Local Msgs/5h Cloud Tasks/5h Code Reviews/5h
GPT-5.5 15-80
GPT-5.4 20-100
GPT-5.4-mini 60-350
GPT-5.3-Codex 30-150 10-60 20-50

Pro tiers: 5x to 20x these limits depending on tier. Promo: Pro $100 gets 2x (10x total) until May 31, 2026.

API Pay-per-Token (Codex API)

Model Input /1M Output /1M
GPT-5.5 $2.50 $10.00
GPT-5.5 mini $0.40 $1.60
GPT-5.5 nano $0.10 $0.40
Codex (specialized) $5.00 $15.00

Notes

Community Reviews

From aitoolsrecap (2026): “The April 16, 2026 update added computer use, memory, image generation, and 90+ plugins – turning it from a coding assistant into a full developer workstation. Included from $20/mo on ChatGPT Plus.”

From elite-ai-assisted-coding review: “Codex CLI performed well throughout testing. GPT-5’s capabilities were the key differentiator – the model followed instructions precisely, learned from documentation effectively, and applied knowledge appropriately.”

Personal Review (May 2026): Overall positive experience. The limits are fine and good enough for daily use – the mini models (GPT-5.4-mini) were the best thing about the plan, offering generous quotas for routine tasks. The main problem is with the frontier models (GPT-5.4 and GPT-5.5): when they understand you correctly from the start, they work really well and do exactly what’s asked. But if they misinterpret the intent, things go spectacularly wrong – they commit to wrong approaches confidently and produce deeply broken code. It’s an all-or-nothing experience. The Plus plan at $20/mo is reasonable value, but Pro at $200/mo is hard to justify vs OpenCode Go or BytePlus.


Antigravity (Google IDE) Download - Pricing

Google’s agent-first IDE (fork of VS Code). Runs multiple AI agents in parallel.
Includes Claude models (Sonnet & Opus) alongside Gemini — all in one IDE.
Source: antigravity.google/pricing

Plans

Plan Price Models Notes
Free $0 All models (rate-limited) Access to all 7 models. Quota refreshes ~5h (unreliable). 92% quota cut since Dec 2025
AI Pro $20/mo All models (higher limits) Built-in credits (amount undisclosed). Best for hobbyists. Multi-day lockouts reported
AI Ultra $249.99/mo All models (highest limits) Most expensive AI coding sub on the market. Still not immune to quota issues (Mar 2026+)
Pay-as-you-go $25/2,500 credits Full access Overflow / burst usage. $0.01/credit

Available Models

Model Provider Type Notes
Gemini 3.1 Pro Google Reasoning (High/Low) Google’s flagship. Good for frontend work only
Gemini 3 Flash Google Fast Very capable for frontend. Reliable
Claude Opus 4.6 Anthropic Premium $200/mo value, available here. Best for backend
Claude Sonnet 4.6 Anthropic Mid-tier Fast, reliable for backend code
GPT-OSS 120B OpenAI Open-weight Free model

Usage Tips

Known Issues

Community Reviews

From aitoolanalysis (Mar 2026): “The most promising and most frustrating editor I’ve ever used.” Free tier gives access to Claude Opus 4.6 ($200/mo value) for $0. But the rate limit crisis is real – Pro subscribers hit 7-day lockouts instead of 5-hour refresh.

From vibecoding.app: “One developer documented a single Claude Opus 4.6 session consuming 635 out of 1,000 credits. One or two complex coding sessions can trigger a lockout for the rest of the week.”

From claude-world.com: “Google Antigravity users reported multi-day account lockouts, a 92% free-tier quota cut, and pricing that pushes developers toward a $250/month plan.”

Personal Review (May 2026): I’ve used it extensively. The best thing is having Claude models (Sonnet & Opus) alongside Gemini in one IDE – I use Claude for backend and Gemini for frontend, each with separate limits. Gemini 3 Flash is genuinely capable for frontend work. The downside: limits are terrible. It was generous at the start but degraded badly – on some days I’d get only 5-10 requests with Claude models before hitting the wall. The $250 Ultra plan is absurdly priced. Worth trying the free tier for the multi-model access, but don’t rely on it for daily production work.


Xiaomi MiMo Token Plan Subscribe

Launched Apr 2 2026. Pure monthly credit pool — NO 5h windows, NO weekly limits.
Credits expire month-end, no rollover. Mid-month upgrades OK, downgrades not.
Now supports V2.5 model series — one subscription unlocks all 8 MiMo models.
Auto-renewal discounts: up to $144 off annual plans.

Plan Price Credits/Mo ~Tasks/mo (V2-Omni 1:1)
Lite $6/mo (¥39) 60,000,000 ~120
Standard $16/mo (¥99) 200,000,000 ~400
Pro $50/mo (¥329) 700,000,000 ~1,400
Max $100/mo (¥659) 1,600,000,000 ~3,200

Credit Multiplier by Model

Model Context Rate
MiMo-V2-Omni up to 256K 1 token = 1 Credit
MiMo-V2-Pro up to 256K 1 token = 2 Credits
MiMo-V2-Pro 256K–1M 1 token = 4 Credits
MiMo-V2.5 up to 256K 1 token = ~1 Credit (Omni-class)
MiMo-V2.5-Pro up to 256K 1 token = ~2 Credits
MiMo-V2-TTS n/a Free (limited time)

API rates (V2.5): $0.4/$2.00 per 1M (256K). 88% off first purchase.


Kimi / Moonshot AI Kimi Code

Two products: Kimi membership (app quotas) and Kimi Code (developer).
API billed separately — NOT included in membership.

Current model: K2.6 (released Apr 18–21 2026).
K2.5 predecessor: 1T params, 32B active, MoE, 384 experts, 256K ctx, MIT license.
K2 (original) discontinued — stops serving May 25, 2026 (6 days from now). Migrate to K2.5 or K2.6.

K2.6 Improvements

Capability K2.5 K2.6
SWE-Bench Pro 58.6%
Multilingual 76.7%
BrowseComp 78.4% 83.2%
Parallel sub-agents 100 300
Long-horizon Hours 4,000+ tool calls, 12+ hr

API Pricing (K2.6)

  Input (Cache Miss) Input (Cache Hit) Output
Per 1M tokens $0.95 $0.16 $4.00

Context window: 262,144 tokens.

Membership Plans (Kimi App)

Tier Price (annual/mo) Agent Usage/mo
Adagio Free 6
Moderato $15/mo 60
Allegretto $31/mo 150
Allegro $79/mo 360
Vivace $159/mo 720

Day-0 support on: vLLM, OpenRouter, Cloudflare, Baseten, MLX, Hermes, OpenCode.