AI Coding Plans — Pricing & Usage Reference

Last updated: 2026-05-20 | Auto-synced daily

Compiled April 2026. All plans compared across major providers.

ByteDance’s AI inference platform. Coding Plan = subscription with shared quota across supported tools. Now integrates with OpenClaw and Hermes Agent as supported coding tools.

Available Models

Model	Type	Notes
Dola-Seed-2.0-Pro	Coding (flagship)	ByteDance’s top-tier coding model. 256K context. Multimodal. Officially launched on BytePlus ModelArk May 2026
Dola-Seed-2.0-Lite	Coding (budget)	Cheaper, good for routine tasks
Ark-Coding-Pro	Coding	BytePlus proprietary coding model
O-Pro	Coding	BytePlus proprietary
ByteDance-Seed-Code	Coding	Code-specific optimization
GLM-5.1	Frontier (Z.ai)	754B MoE, SWE-Bench Pro 58.4%. Quota drain: ~3x
GLM-4.7	General (Z.ai)	Reliable daily driver. 1x quota
Kimi-K2.5	Frontier (Moonshot)	1T MoE, 256K ctx. Quota drain: ~2x
DeepSeek-V3.2	General	73.0% SWE-Bench Verified
GPT-OSS-120B	Open-weight	117B params, Apache 2.0

Quotas

Plan	Price	Per 5h	Per Week	Per Month
Lite	~$10/mo	~800	~3,200	~24,000
Pro	~$40/mo	~4,000	~16,000	~120,000

Quota shared across: Claude Code, Cursor, Cline, Codex CLI, Kilo Code, Roo Code, OpenCode, OpenClaw, Hermes Agent
5-hour sliding window + weekly reset. Monday 00:00 UTC+8
No overflow billing — depleted = wait for next cycle
Free tier: 500K tokens for new users (regular API only, NOT Coding Plan)
New Team plan now available for enterprise: scalability, management controls, digital employee capabilities

Personal Review (May 2026): The BytePlus-provided models like Dola-Seed-2.0-Pro, O-Pro, and Ark-Coding-Pro are significantly weaker than alternatives available for free on other platforms — e.g. MiniMax M2.5 (80.2% SWE-Bench, free via OpenRouter) easily outperforms them. The real issue is with frontier models like GLM-5.1: despite being listed as available, the quota multiplier (3x drain during peak hours) means you get only about 10–15 actual requests per 5h window. The advertised quotas (“800 per 5h”) are based on the cheapest base models at 1x drain — not usable on any capable model. In practice, the plan is misleading for anyone wanting to use frontier models. Not recommended unless you only need the weak ByteDance models for simple completions.

GitHub Copilot Plans

Source: docs.github.com, github.blog, winbuzzer.com

Plans (Current — May 2026)

Plan	Price	Premium Req/mo (old) / AI Credits (new)	Completions	Agent/CLI	Status
Free	$0	50	2,000/mo	Limited	Available
Student	Free (verified)	Unlimited (old only)	Unlimited	Yes	Signups paused since Apr 20
Pro	$10/mo ($100/yr)	300 PRU / 1,000 base + 500 flex credits	Unlimited	Yes	Signups paused since Apr 20
Pro+	$39/mo ($390/yr)	1,500 PRU / 3,900 base + 3,100 flex credits	Unlimited	Yes	Signups paused since Apr 20
Business	$19/seat/mo	$19 AI Credits/seat/mo ($30 promo Jun-Aug)	Unlimited	Yes	Available
Enterprise	$39/seat/mo	$39 AI Credits/seat/mo ($70 promo Jun-Aug)	Unlimited	Yes	Available
Max (new)	$100/mo	$100 base + $100 flex ($200 total)	Unlimited	Yes	Starts Jun 1, 2026

Key Changes (Apr–May 2026)

Usage-based billing: All Copilot plans transition to GitHub AI Credits on June 1, 2026. PRUs replaced by token-based metering
New signups paused: Pro, Pro+, and Student plans paused since Apr 20, 2026. No timeline for reopening
Max plan confirmed (May 12): $100/mo with $100 base + $100 flex = $200 total included usage. Heavy daily use tier
Flex allotments for Pro/Pro+: Pro gets $10 base + $5 flex = $15/mo. Pro+ gets $39 base + $31 flex = $70/mo. Flex allotments may vary over time
Opus removed from Pro: Claude Opus 4.7 is now exclusive to Pro+ ($39/mo). Pro users lost Opus access
Student plan downgraded: Claude Opus/Sonnet removed (Mar 14). GPT-5.3-Codex removed from manual picker (Apr 27). Only lower-tier models remain
Code completions remain unlimited on all paid plans — not billed in AI Credits
Copilot code review now also consumes GitHub Actions minutes

Per-Token Pricing (Jun 2026 — Usage-Based)

Per-token rates from docs.github.com. 1 AI credit = $0.01 USD. Over-usage billed at these rates.

OpenAI: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | GPT-5.5 | $5.00 | $0.50 | $30.00 | | GPT-5.4 | $2.50 | $0.25 | $15.00 | | GPT-5.4 mini | $0.75 | $0.075 | $4.50 | | GPT-5.4 nano | $0.20 | $0.02 | $1.25 | | GPT-5.3-Codex | $1.75 | $0.175 | $14.00 | | GPT-5.2 | $1.75 | $0.175 | $14.00 | | GPT-5.2-Codex | $1.75 | $0.175 | $14.00 | | GPT-4.1 | $2.00 | $0.50 | $8.00 | | GPT-5 mini | $0.25 | $0.025 | $2.00 |

Anthropic (includes cache write cost): | Model | Input /1M | Cached Input | Cache Write | Output /1M | |——-|———-:|:————:|:———–:|———–:| | Claude Opus 4.7 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.6 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.5 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Sonnet 4.6 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4.5 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Haiku 4.5 | $1.00 | $0.10 | $1.25 | $5.00 |

Google: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Gemini 3.1 Pro | $2.00 | $0.20 | $12.00 | | Gemini 2.5 Pro | $1.25 | $0.125 | $10.00 | | Gemini 3 Flash | $0.50 | $0.05 | $3.00 |

Fine-tuned (GitHub): | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Raptor mini | $0.25 | $0.025 | $2.00 | | Goldeneye | $1.25 | $0.125 | $10.00 |

Plans Summary (Jun 2026)

Plan	Price	Base	Flex	Total
Free	$0	—	—	50 completions/mo + limited chat/agent
Pro	$10/mo	$10	$5	$15 total included
Pro+	$39/mo	$39	$31	$70 total included
Max	$100/mo	$100	$100	$200 total included
Business	$19/seat/mo	$19	—	Flex promo Jun-Aug
Enterprise	$39/seat/mo	$39	—	Flex promo Jun-Aug

Community Reviews

From techsifted (Apr 2026): “Microsoft paused new signups for Copilot Pro, Pro+, and Student plans. Then they removed Opus models from the Pro tier entirely. If you’d built workflows around Opus inside GitHub Copilot at $10/month, those workflows are broken now.”

From piunikaweb (Mar 2026): “To manage costs, GitHub has decided to remove access to flagship Claude models such as Opus and Sonnet, on the student plan.”

From Reddit (r/github): Users widely criticized the student plan downgrades and Pro signup pause. Many students reported being locked out of models they relied on for coursework.

Personal Review (May 2026): GitHub Copilot was the most reliable and good coding plan I used. A lot of models were available and the IDE integration was seamless. I used it on the Student plan which was fantastic initially. The bad part: they progressively removed all frontier models from the Student plan — Claude Opus/Sonnet went first, then GPT-5.3-Codex was pulled from the model picker. What started as a generous free plan for students got hollowed out over time. Now with usage-based billing coming June 1, the value proposition is even more uncertain — your $10 Pro gets you $10 worth of AI Credits, and heavy agent sessions will burn through that fast. Still a solid product for light use, but heavy users will need the $100 Max plan.

5h sliding window + weekly cap. Overseas price raised Apr 12 2026.
Powered by GLM-5 (Feb 2026) and GLM-5.1 (Mar 27 2026).

Plan	Domestic	Overseas	Per 5h	Per Week	Monthly Cap
Plan	Price (Quarterly)	Monthly Equivalent	Per 5h	Per Week	Notes
Lite	$30/quarter ($27 Q2)	~$10/mo	~80 prompts	~320	GLM-5.1, 5-Turbo, 4.7, 4.6, 4.5-Air
Pro	$90/quarter ($81 Q2)	~$30/mo	~400 prompts	~2,000	Unlocks GLM-5 flagship. ~5x Lite
Max	$240/quarter ($216 Q2)	~$80/mo	~1,600 prompts	~6,400	~4x Pro. Highest throughput

Model Quota Multipliers (Critical)

Model / Time	Multiplier	Effective Lite prompts/5h
GLM-5.1 / GLM-5-Turbo at peak (14-18 UTC+8)	3x drain	~26
GLM-5.1 / GLM-5-Turbo off-peak	2x drain	~40
GLM-5.1 / GLM-5-Turbo off-peak (promo ended Apr 30)	1x drain	~80
GLM-5 (base) off-peak	2x drain	~40
GLM-4.7 all times	1x drain	~80
GLM-4.5-Air all times	1x drain	~80

Each prompt invokes the model ~15–20 times internally
GLM-5.1 at 44.3 tok/sec is ~6x slower than Grok 4.20
MCP quotas: Lite 100/mo, Pro 1,000/mo, Max 4,000/mo
Q2 2026 quarterly discounts applied to all tiers
Pricing restructured Feb 2026: Promotional $3/mo first-purchase pricing removed. Overseas price hikes moderated with switch to quarterly billing. Q2 discount: Lite $27/qtr, Pro $81/qtr, Max $216/qtr
Free models (no sub): GLM-4.7-Flash, GLM-4.5-Flash

OpenCode’s official low-cost subscription. $5 first month, then $10/month. Dollar-based limits (not request counts). Models hosted in US, EU, Singapore. Zero-retention policy.

Window	Dollar Limit
Per 5 hours	$12
Per week	$30
Per month	$60

Available Models (12)

Source: opencode.ai/docs/go

Model	Provider	Per 5h	Per Week	Per Month	Notes
DeepSeek V4 Flash	DeepSeek	31,650	79,050	158,150	Cheapest, huge quota
Qwen3.5 Plus	Alibaba	10,200	25,200	50,500	Best raw request count
MiniMax M2.5	MiniMax	6,300	15,900	31,800	80.2% SWE-Bench
MiniMax M2.7	MiniMax	3,400	8,500	17,000	Newer, 196K ctx
Qwen3.6 Plus	Alibaba	3,300	8,200	16,300	78.8% SWE-Bench
DeepSeek V4 Pro	DeepSeek	3,450	8,550	17,150	80.6% SWE-Bench
MiMo-V2.5-Omni	Xiaomi	2,150	5,450	10,900	Multimodal
MiMo-V2.5-Pro	Xiaomi	1,290	3,225	6,450	Coding focus
Kimi K2.5	Moonshot	1,850	4,630	9,250	256K ctx, 76.8% SWE-Bench
Kimi K2.6	Moonshot	1,150	2,880	5,750	Updated K2.5, 58.6% SWE-Bench Pro
GLM-5	Z.ai	1,150	2,880	5,750	77.8% SWE-Bench
GLM-5.1	Z.ai	880	2,150	4,300	Best reasoning. 58.4% SWE-Bench Pro

Free Model

Big Pickle (~GLM-4.6, 200K ctx): 200 requests/5h - no subscription needed.

Community Reviews

From Reddit (Mar 2026): Described as “genuinely the worst coding plan I have ever used” - 94% upvotes. Criticism centered on quantized models and aggressive rate limits on reasoning models. (r/LocalLLaMA)

From APIYI review: “Getting three flagship open-source models for $10/month … you’re getting 6x the model invocation value. MiniMax M2.5 is the most cost-effective choice - it has the highest limits and the strongest coding ability.”

From Thomas Wiegold (Apr 2026): “MiniMax gives you a good number of requests … up to 31,800 per month with M2.5. These aren’t toy models - M2.5 scored 80.2% on SWE-Bench Verified, within spitting distance of Claude Opus 4.6’s 80.8%. The catch is that reasoning-heavy models like GLM-5.1 burn through limits fast.”

Personal Review (May 2026): Best and cheapest coding plan available. Generous quotas on MiniMax and Qwen models, super fast inference. Mostly reliable with top open-weight models like GLM-5.1 and Kimi K2.6 at good limits. MiniMax M2.5 at $10/month is unmatched value. DeepSeek V4 Flash gets 158K requests/month - absurdly cheap. Strongly recommended.

Notes

API endpoint: opencode.ai/zen/go/v1/
Cancel anytime. Top-up credit if needed.
Optional “Use balance” fallback to Zen credits after limits reached
Models are periodically rotated as new ones are tested

Claude Code (Anthropic) Pricing

Terminal coding agent. All usage shared across claude.ai, Claude Code CLI, and Desktop. Source: claude.com/pricing, community instrumentation.

Individual Plans

Plan	Price	Multiplier	Est. Msgs/5h (Opus)	Context	CLI
Pro	$20/mo ($17/yr)	1x (doubled)	~45-90	200K (1M ext)	Yes
Max 5x	$100/mo	5x (doubled)	~225-450	1M	Yes
Max 20x	$200/mo	20x (doubled)	~900-1,800	1M	Yes

Team Plans

Plan	Price	Multiplier	Weekly Cap	Min Seats
Team Standard	$25/seat/mo ($20 annual)	~1.25x Pro (doubled)	7-day, 1 cap	5
Team Premium	$125/seat/mo ($100 annual)	~5x Pro	7-day, 2 caps	5-150

Max 5x: ~$0.44/message (Opus). Max 20x: ~$0.22/message – actual volume discount kicks in here
Max 5x: auto-switch Opus->Sonnet at 20% limit. Max 20x: at 50%
Peak hours throttling removed as of May 2026 (SpaceX Colossus 1 deal)
Team Standard added Claude Code access late April 2026 (was Premium-only before)
Pro had Claude Code briefly removed (Apr 2026 test on 2% of signups) then restored within hours

API Pay-per-Token

Model	Input /1M	Output /1M	Cache Read	Batch (50% off)
Opus 4.7	$5.00	$25.00	$0.50	Yes
Sonnet 4.6	$3.00	$15.00	$0.30	Yes
Haiku 4.5	$1.00	$5.00	$0.10	Yes

Recent Changes (May 2026)

Rate limits doubled for Pro, Max, Team, and Enterprise plans (SpaceX Colossus 1 deal — 220K+ NVIDIA GPUs, 300+ MW capacity). Announced May 6 at Code with Claude event
Peak hours throttling removed — same rate limits at 3 AM and 3 PM
Opus API rate limits raised — higher maximums across all API tiers
Managed Agents launched — Dreaming, Outcomes, multi-agent orchestration now available at claude.ai/code
Claude Cowork now bundled into Pro (visual canvas-based workspace, GA since Apr 2026)

Known Issues

v2.1.100 bug (Apr 2026): ~20K invisible tokens added per request, burning quota ~40% faster. Root cause: broken prompt caching forcing full re-processing on every turn. Six releases shipped through v2.1.133 (May 8) with features but no public fix. Workaround: downgrade to v2.1.34 or reinstall via npm
Opus 4.7 tokenizer: New tokenizer can use up to 35% more tokens for equivalent text vs Opus 4.6
5-hour rolling window: Hit limit at 2pm? Wait until 7pm. Not a daily reset

Community Reviews

From Reddit (r/ClaudeAI): “The difference of Claude Pro and Max5 plan usage limit are enormous. It is not only 5x” – users report Max 5x is a dramatically better experience for daily coding, not just numerically 5x better.

From dev.to review: “Pro costs ~$0.44 per Opus message; Max 5x costs the same per message but prevents hitting rate limits sooner. Max 20x cuts per-message cost in half (~$0.22) and is the only plan offering true volume discount.”

From findskill.ai: “Pro $20/mo for solo devs, Max 5x $100/mo for full-time on Claude Code, Max 20x $200/mo for pair-programming all day. The v2.1.100 token inflation bug is real – burns quotas ~40% faster.”

From felloai review: “Max 5x at $100/month gives five times the Pro usage and priority access during peak demand. It is the sweet spot for full-time developers who use Claude Code as their default coding partner.”

Personal Review (May 2026): Claude Code Pro ($20/mo) is decent for light use but the 5-hour rolling window makes it impractical for serious daily development – you WILL hit the wall. The good news: Anthropic just doubled rate limits and removed peak hours throttling (thanks to SpaceX Colossus 1), so Pro is now much more usable — ~90 Opus messages per 5h instead of 45. Max 5x ($100/mo) is still the minimum for real work, but the gap is narrower. On the plus side, code quality is unmatched – Opus 4.7 is genuinely the smartest model for complex refactoring and architecture decisions. The main frustration is Anthropic’s opaque quota system and the tokenizer/v2.1.100 bugs silently eating into your limits. Compared to OpenCode Go ($10/mo with MiniMax M2.5), Claude Code is still 5-10x the price for incrementally better quality. Only worth it if you need Opus-level reasoning for complex codebases. The v2.1.100 bug may have been fixed — keep an eye on releases.

GPT Codex (OpenAI) Pricing

OpenAI’s agentic coding tool — CLI, IDE extension, ChatGPT app, and cloud. Codex is bundled into ChatGPT plans (no separate subscription).
Source: developers.openai.com/codex/pricing

ChatGPT Plans (Codex Included)

Plan	Price	Best For
Free	$0	Limited Codex Mini access
Go	$20/mo	Budget-friendly Codex Mini (select regions). Reduced limits
Plus	$20/mo	Entry point for Codex CLI + IDE + cloud
Pro ($100)	$100/mo	5x (10x promo until May 31) higher limits. Includes GPT-5.3-Codex-Spark preview
Pro ($200)	$200/mo	20x higher limits. Heavy daily use
Business	$25/seat/mo	Team workspace + admin controls
API Key	Pay-per-token	CI/CD, automation, programmatic

Codex Usage Limits (Plus — 5-hour rolling window)

Model	Local Msgs/5h	Cloud Tasks/5h	Code Reviews/5h
GPT-5.5	15-80	—	—
GPT-5.4	20-100	—	—
GPT-5.4-mini	60-350	—	—
GPT-5.3-Codex	30-150	10-60	20-50

Pro tiers: 5x to 20x these limits depending on tier. Promo: Pro $100 gets 2x (10x total) until May 31, 2026.

API Pay-per-Token (Codex API)

Model	Input /1M	Output /1M
GPT-5.5	$2.50	$10.00
GPT-5.5 mini	$0.40	$1.60
GPT-5.5 nano	$0.10	$0.40
Codex (specialized)	$5.00	$15.00

Notes

Usage shared across all Codex surfaces (CLI, IDE, ChatGPT, cloud)
Soft caps slow you down; hard caps cut off until window resets
Extra credits can be purchased on Plus/Pro plans
Pro $100 2x promo expires May 31 2026
Go plan ($20/mo): Budget Codex Mini tier available in select regions. Reduced limits, same models

Community Reviews

From aitoolsrecap (2026): “The April 16, 2026 update added computer use, memory, image generation, and 90+ plugins – turning it from a coding assistant into a full developer workstation. Included from $20/mo on ChatGPT Plus.”

From elite-ai-assisted-coding review: “Codex CLI performed well throughout testing. GPT-5’s capabilities were the key differentiator – the model followed instructions precisely, learned from documentation effectively, and applied knowledge appropriately.”

Personal Review (May 2026): Overall positive experience. The limits are fine and good enough for daily use – the mini models (GPT-5.4-mini) were the best thing about the plan, offering generous quotas for routine tasks. The main problem is with the frontier models (GPT-5.4 and GPT-5.5): when they understand you correctly from the start, they work really well and do exactly what’s asked. But if they misinterpret the intent, things go spectacularly wrong – they commit to wrong approaches confidently and produce deeply broken code. It’s an all-or-nothing experience. The Plus plan at $20/mo is reasonable value, but Pro at $200/mo is hard to justify vs OpenCode Go or BytePlus.

Antigravity (Google IDE) Download - Pricing

Google’s agent-first IDE (fork of VS Code). Runs multiple AI agents in parallel.
Includes Claude models (Sonnet & Opus) alongside Gemini — all in one IDE.
Source: antigravity.google/pricing

Plans

Plan	Price	Models	Notes
Free	$0	All models (rate-limited)	Access to all 7 models. Quota refreshes ~5h (unreliable). 92% quota cut since Dec 2025
AI Pro	$20/mo	All models (higher limits)	Built-in credits (amount undisclosed). Best for hobbyists. Multi-day lockouts reported
AI Ultra	$249.99/mo	All models (highest limits)	Most expensive AI coding sub on the market. Still not immune to quota issues (Mar 2026+)
Pay-as-you-go	$25/2,500 credits	Full access	Overflow / burst usage. $0.01/credit

Available Models

Model	Provider	Type	Notes
Gemini 3.1 Pro	Google	Reasoning (High/Low)	Google’s flagship. Good for frontend work only
Gemini 3 Flash	Google	Fast	Very capable for frontend. Reliable
Claude Opus 4.6	Anthropic	Premium	$200/mo value, available here. Best for backend
Claude Sonnet 4.6	Anthropic	Mid-tier	Fast, reliable for backend code
GPT-OSS 120B	OpenAI	Open-weight	Free model

Usage Tips

Use Claude models for backend work — they handle complex logic, refactoring, and architecture
Use Gemini models for frontend — Gemini 3 Flash in particular is excellent for UI work
Never use Gemini for backend or complex logic tasks — it performs poorly
Limits are separate per provider — running out on Claude? Switch to Gemini and keep working
Multi-agent missions can assign different models to different agents within the same task

Known Issues

Quota cuts: Free tier quotas cut 92% since Dec 2025 (250 -> 20 requests/day)
Pro lockouts: Multiple reports of 7-day lockouts even with low usage. Advertised 5-hour refresh unreliable
Credit system opaque: Credit-to-token conversion rate undisclosed. Per-model credit costs unknown
Ultra lockouts: Even $250/mo Ultra users report unexpected quota restrictions since Mar 2026

Community Reviews

From aitoolanalysis (Mar 2026): “The most promising and most frustrating editor I’ve ever used.” Free tier gives access to Claude Opus 4.6 ($200/mo value) for $0. But the rate limit crisis is real – Pro subscribers hit 7-day lockouts instead of 5-hour refresh.

From vibecoding.app: “One developer documented a single Claude Opus 4.6 session consuming 635 out of 1,000 credits. One or two complex coding sessions can trigger a lockout for the rest of the week.”

From claude-world.com: “Google Antigravity users reported multi-day account lockouts, a 92% free-tier quota cut, and pricing that pushes developers toward a $250/month plan.”

Personal Review (May 2026): I’ve used it extensively. The best thing is having Claude models (Sonnet & Opus) alongside Gemini in one IDE – I use Claude for backend and Gemini for frontend, each with separate limits. Gemini 3 Flash is genuinely capable for frontend work. The downside: limits are terrible. It was generous at the start but degraded badly – on some days I’d get only 5-10 requests with Claude models before hitting the wall. The $250 Ultra plan is absurdly priced. Worth trying the free tier for the multi-model access, but don’t rely on it for daily production work.

Launched Apr 2 2026. Pure monthly credit pool — NO 5h windows, NO weekly limits.
Credits expire month-end, no rollover. Mid-month upgrades OK, downgrades not.
Now supports V2.5 model series — one subscription unlocks all 8 MiMo models.
Auto-renewal discounts: up to $144 off annual plans.

Plan	Price	Credits/Mo	~Tasks/mo (V2-Omni 1:1)
Lite	$6/mo (¥39)	60,000,000	~120
Standard	$16/mo (¥99)	200,000,000	~400
Pro	$50/mo (¥329)	700,000,000	~1,400
Max	$100/mo (¥659)	1,600,000,000	~3,200

Credit Multiplier by Model

Model	Context	Rate
MiMo-V2-Omni	up to 256K	1 token = 1 Credit
MiMo-V2-Pro	up to 256K	1 token = 2 Credits
MiMo-V2-Pro	256K–1M	1 token = 4 Credits
MiMo-V2.5	up to 256K	1 token = ~1 Credit (Omni-class)
MiMo-V2.5-Pro	up to 256K	1 token = ~2 Credits
MiMo-V2-TTS	n/a	Free (limited time)

API rates (V2.5): $0.4/$2.00 per 1M (256K). 88% off first purchase.

Kimi / Moonshot AI Kimi Code

Two products: Kimi membership (app quotas) and Kimi Code (developer).
API billed separately — NOT included in membership.

Current model: K2.6 (released Apr 18–21 2026).
K2.5 predecessor: 1T params, 32B active, MoE, 384 experts, 256K ctx, MIT license.
K2 (original) discontinued — stops serving May 25, 2026 (6 days from now). Migrate to K2.5 or K2.6.

K2.6 Improvements

Capability	K2.5	K2.6
SWE-Bench Pro	—	58.6%
Multilingual	—	76.7%
BrowseComp	78.4%	83.2%
Parallel sub-agents	100	300
Long-horizon	Hours	4,000+ tool calls, 12+ hr

API Pricing (K2.6)

	Input (Cache Miss)	Input (Cache Hit)	Output
Per 1M tokens	$0.95	$0.16	$4.00

Context window: 262,144 tokens.

Membership Plans (Kimi App)

Tier	Price (annual/mo)	Agent Usage/mo
Adagio	Free	6
Moderato	$15/mo	60
Allegretto	$31/mo	150
Allegro	$79/mo	360
Vivace	$159/mo	720

Day-0 support on: vLLM, OpenRouter, Cloudflare, Baseten, MLX, Hermes, OpenCode.

AI Coding Plans — Pricing & Usage Reference

BytePlus ModelArk Subscribe

Available Models

Quotas

GitHub Copilot Plans

Plans (Current — May 2026)

Key Changes (Apr–May 2026)

Per-Token Pricing (Jun 2026 — Usage-Based)

Plans Summary (Jun 2026)

Community Reviews

GLM / Z.ai Subscribe

Model Quota Multipliers (Critical)

OpenCode Go Plan Subscribe

Available Models (12)

Free Model

Community Reviews

Notes

Claude Code (Anthropic) Pricing

Individual Plans

Team Plans

API Pay-per-Token

Recent Changes (May 2026)

Known Issues

Community Reviews

GPT Codex (OpenAI) Pricing

ChatGPT Plans (Codex Included)

Codex Usage Limits (Plus — 5-hour rolling window)

API Pay-per-Token (Codex API)

Notes

Community Reviews

Antigravity (Google IDE) Download - Pricing

Plans

Available Models

Usage Tips

Known Issues

Community Reviews

Xiaomi MiMo Token Plan Subscribe

Credit Multiplier by Model

Kimi / Moonshot AI Kimi Code

K2.6 Improvements

API Pricing (K2.6)

Membership Plans (Kimi App)