AI Coding Plans — Pricing & Usage Reference
Compiled April 2026. All plans compared across major providers.
BytePlus ModelArk Subscribe
ByteDance’s AI inference platform. Coding Plan = subscription with shared quota across supported tools. Now integrates with OpenClaw and Hermes Agent as supported coding tools.
Available Models
| Model | Type | Notes |
|---|---|---|
| Dola-Seed-2.0-Pro | Coding (flagship) | ByteDance’s top-tier coding model. 256K context. Multimodal. Officially launched on BytePlus ModelArk May 2026 |
| Dola-Seed-2.0-Lite | Coding (budget) | Cheaper, good for routine tasks |
| Ark-Coding-Pro | Coding | BytePlus proprietary coding model |
| O-Pro | Coding | BytePlus proprietary |
| ByteDance-Seed-Code | Coding | Code-specific optimization |
| GLM-5.1 | Frontier (Z.ai) | 754B MoE, SWE-Bench Pro 58.4%. Quota drain: ~3x |
| GLM-4.7 | General (Z.ai) | Reliable daily driver. 1x quota |
| Kimi-K2.5 | Frontier (Moonshot) | 1T MoE, 256K ctx. Quota drain: ~2x |
| DeepSeek-V3.2 | General | 73.0% SWE-Bench Verified |
| GPT-OSS-120B | Open-weight | 117B params, Apache 2.0 |
Quotas
| Plan | Price | Per 5h | Per Week | Per Month |
|---|---|---|---|---|
| Lite | ~$10/mo | ~800 | ~3,200 | ~24,000 |
| Pro | ~$40/mo | ~4,000 | ~16,000 | ~120,000 |
- Quota shared across: Claude Code, Cursor, Cline, Codex CLI, Kilo Code, Roo Code, OpenCode, OpenClaw, Hermes Agent
- 5-hour sliding window + weekly reset. Monday 00:00 UTC+8
- No overflow billing — depleted = wait for next cycle
- Free tier: 500K tokens for new users (regular API only, NOT Coding Plan)
- New Team plan now available for enterprise: scalability, management controls, digital employee capabilities
Personal Review (May 2026): The BytePlus-provided models like Dola-Seed-2.0-Pro, O-Pro, and Ark-Coding-Pro are significantly weaker than alternatives available for free on other platforms — e.g. MiniMax M2.5 (80.2% SWE-Bench, free via OpenRouter) easily outperforms them. The real issue is with frontier models like GLM-5.1: despite being listed as available, the quota multiplier (3x drain during peak hours) means you get only about 10–15 actual requests per 5h window. The advertised quotas (“800 per 5h”) are based on the cheapest base models at 1x drain — not usable on any capable model. In practice, the plan is misleading for anyone wanting to use frontier models. Not recommended unless you only need the weak ByteDance models for simple completions.
GitHub Copilot Plans
Source: docs.github.com, github.blog, winbuzzer.com
Plans (Current — May 2026)
| Plan | Price | Premium Req/mo (old) / AI Credits (new) | Completions | Agent/CLI | Status |
|---|---|---|---|---|---|
| Free | $0 | 50 | 2,000/mo | Limited | Available |
| Student | Free (verified) | Unlimited (old only) | Unlimited | Yes | Signups paused since Apr 20 |
| Pro | $10/mo ($100/yr) | 300 PRU / 1,000 base + 500 flex credits | Unlimited | Yes | Signups paused since Apr 20 |
| Pro+ | $39/mo ($390/yr) | 1,500 PRU / 3,900 base + 3,100 flex credits | Unlimited | Yes | Signups paused since Apr 20 |
| Business | $19/seat/mo | $19 AI Credits/seat/mo ($30 promo Jun-Aug) | Unlimited | Yes | Available |
| Enterprise | $39/seat/mo | $39 AI Credits/seat/mo ($70 promo Jun-Aug) | Unlimited | Yes | Available |
| Max (new) | $100/mo | $100 base + $100 flex ($200 total) | Unlimited | Yes | Starts Jun 1, 2026 |
Key Changes (Apr–May 2026)
- Usage-based billing: All Copilot plans transition to GitHub AI Credits on June 1, 2026. PRUs replaced by token-based metering
- New signups paused: Pro, Pro+, and Student plans paused since Apr 20, 2026. No timeline for reopening
- Max plan confirmed (May 12): $100/mo with $100 base + $100 flex = $200 total included usage. Heavy daily use tier
- Flex allotments for Pro/Pro+: Pro gets $10 base + $5 flex = $15/mo. Pro+ gets $39 base + $31 flex = $70/mo. Flex allotments may vary over time
- Opus removed from Pro: Claude Opus 4.7 is now exclusive to Pro+ ($39/mo). Pro users lost Opus access
- Student plan downgraded: Claude Opus/Sonnet removed (Mar 14). GPT-5.3-Codex removed from manual picker (Apr 27). Only lower-tier models remain
- Code completions remain unlimited on all paid plans — not billed in AI Credits
- Copilot code review now also consumes GitHub Actions minutes
Per-Token Pricing (Jun 2026 — Usage-Based)
Per-token rates from docs.github.com. 1 AI credit = $0.01 USD. Over-usage billed at these rates.
OpenAI: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | GPT-5.5 | $5.00 | $0.50 | $30.00 | | GPT-5.4 | $2.50 | $0.25 | $15.00 | | GPT-5.4 mini | $0.75 | $0.075 | $4.50 | | GPT-5.4 nano | $0.20 | $0.02 | $1.25 | | GPT-5.3-Codex | $1.75 | $0.175 | $14.00 | | GPT-5.2 | $1.75 | $0.175 | $14.00 | | GPT-5.2-Codex | $1.75 | $0.175 | $14.00 | | GPT-4.1 | $2.00 | $0.50 | $8.00 | | GPT-5 mini | $0.25 | $0.025 | $2.00 |
Anthropic (includes cache write cost): | Model | Input /1M | Cached Input | Cache Write | Output /1M | |——-|———-:|:————:|:———–:|———–:| | Claude Opus 4.7 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.6 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Opus 4.5 | $5.00 | $0.50 | $6.25 | $25.00 | | Claude Sonnet 4.6 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4.5 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Sonnet 4 | $3.00 | $0.30 | $3.75 | $15.00 | | Claude Haiku 4.5 | $1.00 | $0.10 | $1.25 | $5.00 |
Google: | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Gemini 3.1 Pro | $2.00 | $0.20 | $12.00 | | Gemini 2.5 Pro | $1.25 | $0.125 | $10.00 | | Gemini 3 Flash | $0.50 | $0.05 | $3.00 |
Fine-tuned (GitHub): | Model | Input /1M | Cached Input | Output /1M | |——-|———-:|:————:|———–:| | Raptor mini | $0.25 | $0.025 | $2.00 | | Goldeneye | $1.25 | $0.125 | $10.00 |
Plans Summary (Jun 2026)
| Plan | Price | Base | Flex | Total |
|---|---|---|---|---|
| Free | $0 | — | — | 50 completions/mo + limited chat/agent |
| Pro | $10/mo | $10 | $5 | $15 total included |
| Pro+ | $39/mo | $39 | $31 | $70 total included |
| Max | $100/mo | $100 | $100 | $200 total included |
| Business | $19/seat/mo | $19 | — | Flex promo Jun-Aug |
| Enterprise | $39/seat/mo | $39 | — | Flex promo Jun-Aug |
Community Reviews
From techsifted (Apr 2026): “Microsoft paused new signups for Copilot Pro, Pro+, and Student plans. Then they removed Opus models from the Pro tier entirely. If you’d built workflows around Opus inside GitHub Copilot at $10/month, those workflows are broken now.”
From piunikaweb (Mar 2026): “To manage costs, GitHub has decided to remove access to flagship Claude models such as Opus and Sonnet, on the student plan.”
From Reddit (r/github): Users widely criticized the student plan downgrades and Pro signup pause. Many students reported being locked out of models they relied on for coursework.
Personal Review (May 2026): GitHub Copilot was the most reliable and good coding plan I used. A lot of models were available and the IDE integration was seamless. I used it on the Student plan which was fantastic initially. The bad part: they progressively removed all frontier models from the Student plan — Claude Opus/Sonnet went first, then GPT-5.3-Codex was pulled from the model picker. What started as a generous free plan for students got hollowed out over time. Now with usage-based billing coming June 1, the value proposition is even more uncertain — your $10 Pro gets you $10 worth of AI Credits, and heavy agent sessions will burn through that fast. Still a solid product for light use, but heavy users will need the $100 Max plan.
GLM / Z.ai Subscribe
5h sliding window + weekly cap. Overseas price raised Apr 12 2026.
Powered by GLM-5 (Feb 2026) and GLM-5.1 (Mar 27 2026).
| Plan | Domestic | Overseas | Per 5h | Per Week | Monthly Cap |
|---|---|---|---|---|---|
| Plan | Price (Quarterly) | Monthly Equivalent | Per 5h | Per Week | Notes |
| Lite | $30/quarter ($27 Q2) | ~$10/mo | ~80 prompts | ~320 | GLM-5.1, 5-Turbo, 4.7, 4.6, 4.5-Air |
| Pro | $90/quarter ($81 Q2) | ~$30/mo | ~400 prompts | ~2,000 | Unlocks GLM-5 flagship. ~5x Lite |
| Max | $240/quarter ($216 Q2) | ~$80/mo | ~1,600 prompts | ~6,400 | ~4x Pro. Highest throughput |
Model Quota Multipliers (Critical)
| Model / Time | Multiplier | Effective Lite prompts/5h |
|---|---|---|
| GLM-5.1 / GLM-5-Turbo at peak (14-18 UTC+8) | 3x drain | ~26 |
| GLM-5.1 / GLM-5-Turbo off-peak | 2x drain | ~40 |
| GLM-5.1 / GLM-5-Turbo off-peak (promo ended Apr 30) | 1x drain | ~80 |
| GLM-5 (base) off-peak | 2x drain | ~40 |
| GLM-4.7 all times | 1x drain | ~80 |
| GLM-4.5-Air all times | 1x drain | ~80 |
- Each prompt invokes the model ~15–20 times internally
- GLM-5.1 at 44.3 tok/sec is ~6x slower than Grok 4.20
- MCP quotas: Lite 100/mo, Pro 1,000/mo, Max 4,000/mo
- Q2 2026 quarterly discounts applied to all tiers
- Pricing restructured Feb 2026: Promotional $3/mo first-purchase pricing removed. Overseas price hikes moderated with switch to quarterly billing. Q2 discount: Lite $27/qtr, Pro $81/qtr, Max $216/qtr
- Free models (no sub): GLM-4.7-Flash, GLM-4.5-Flash
OpenCode Go Plan Subscribe
OpenCode’s official low-cost subscription. $5 first month, then $10/month. Dollar-based limits (not request counts). Models hosted in US, EU, Singapore. Zero-retention policy.
| Window | Dollar Limit |
|---|---|
| Per 5 hours | $12 |
| Per week | $30 |
| Per month | $60 |
Available Models (12)
Source: opencode.ai/docs/go
| Model | Provider | Per 5h | Per Week | Per Month | Notes |
|---|---|---|---|---|---|
| DeepSeek V4 Flash | DeepSeek | 31,650 | 79,050 | 158,150 | Cheapest, huge quota |
| Qwen3.5 Plus | Alibaba | 10,200 | 25,200 | 50,500 | Best raw request count |
| MiniMax M2.5 | MiniMax | 6,300 | 15,900 | 31,800 | 80.2% SWE-Bench |
| MiniMax M2.7 | MiniMax | 3,400 | 8,500 | 17,000 | Newer, 196K ctx |
| Qwen3.6 Plus | Alibaba | 3,300 | 8,200 | 16,300 | 78.8% SWE-Bench |
| DeepSeek V4 Pro | DeepSeek | 3,450 | 8,550 | 17,150 | 80.6% SWE-Bench |
| MiMo-V2.5-Omni | Xiaomi | 2,150 | 5,450 | 10,900 | Multimodal |
| MiMo-V2.5-Pro | Xiaomi | 1,290 | 3,225 | 6,450 | Coding focus |
| Kimi K2.5 | Moonshot | 1,850 | 4,630 | 9,250 | 256K ctx, 76.8% SWE-Bench |
| Kimi K2.6 | Moonshot | 1,150 | 2,880 | 5,750 | Updated K2.5, 58.6% SWE-Bench Pro |
| GLM-5 | Z.ai | 1,150 | 2,880 | 5,750 | 77.8% SWE-Bench |
| GLM-5.1 | Z.ai | 880 | 2,150 | 4,300 | Best reasoning. 58.4% SWE-Bench Pro |
Free Model
Big Pickle (~GLM-4.6, 200K ctx): 200 requests/5h - no subscription needed.
Community Reviews
From Reddit (Mar 2026): Described as “genuinely the worst coding plan I have ever used” - 94% upvotes. Criticism centered on quantized models and aggressive rate limits on reasoning models. (r/LocalLLaMA)
From APIYI review: “Getting three flagship open-source models for $10/month … you’re getting 6x the model invocation value. MiniMax M2.5 is the most cost-effective choice - it has the highest limits and the strongest coding ability.”
From Thomas Wiegold (Apr 2026): “MiniMax gives you a good number of requests … up to 31,800 per month with M2.5. These aren’t toy models - M2.5 scored 80.2% on SWE-Bench Verified, within spitting distance of Claude Opus 4.6’s 80.8%. The catch is that reasoning-heavy models like GLM-5.1 burn through limits fast.”
Personal Review (May 2026): Best and cheapest coding plan available. Generous quotas on MiniMax and Qwen models, super fast inference. Mostly reliable with top open-weight models like GLM-5.1 and Kimi K2.6 at good limits. MiniMax M2.5 at $10/month is unmatched value. DeepSeek V4 Flash gets 158K requests/month - absurdly cheap. Strongly recommended.
Notes
- API endpoint: opencode.ai/zen/go/v1/
- Cancel anytime. Top-up credit if needed.
- Optional “Use balance” fallback to Zen credits after limits reached
- Models are periodically rotated as new ones are tested
Claude Code (Anthropic) Pricing
Terminal coding agent. All usage shared across claude.ai, Claude Code CLI, and Desktop. Source: claude.com/pricing, community instrumentation.
Individual Plans
| Plan | Price | Multiplier | Est. Msgs/5h (Opus) | Context | CLI |
|---|---|---|---|---|---|
| Pro | $20/mo ($17/yr) | 1x (doubled) | ~45-90 | 200K (1M ext) | Yes |
| Max 5x | $100/mo | 5x (doubled) | ~225-450 | 1M | Yes |
| Max 20x | $200/mo | 20x (doubled) | ~900-1,800 | 1M | Yes |
Team Plans
| Plan | Price | Multiplier | Weekly Cap | Min Seats |
|---|---|---|---|---|
| Team Standard | $25/seat/mo ($20 annual) | ~1.25x Pro (doubled) | 7-day, 1 cap | 5 |
| Team Premium | $125/seat/mo ($100 annual) | ~5x Pro | 7-day, 2 caps | 5-150 |
- Max 5x: ~$0.44/message (Opus). Max 20x: ~$0.22/message – actual volume discount kicks in here
- Max 5x: auto-switch Opus->Sonnet at 20% limit. Max 20x: at 50%
- Peak hours throttling removed as of May 2026 (SpaceX Colossus 1 deal)
- Team Standard added Claude Code access late April 2026 (was Premium-only before)
- Pro had Claude Code briefly removed (Apr 2026 test on 2% of signups) then restored within hours
API Pay-per-Token
| Model | Input /1M | Output /1M | Cache Read | Batch (50% off) |
|---|---|---|---|---|
| Opus 4.7 | $5.00 | $25.00 | $0.50 | Yes |
| Sonnet 4.6 | $3.00 | $15.00 | $0.30 | Yes |
| Haiku 4.5 | $1.00 | $5.00 | $0.10 | Yes |
Recent Changes (May 2026)
- Rate limits doubled for Pro, Max, Team, and Enterprise plans (SpaceX Colossus 1 deal — 220K+ NVIDIA GPUs, 300+ MW capacity). Announced May 6 at Code with Claude event
- Peak hours throttling removed — same rate limits at 3 AM and 3 PM
- Opus API rate limits raised — higher maximums across all API tiers
- Managed Agents launched — Dreaming, Outcomes, multi-agent orchestration now available at claude.ai/code
- Claude Cowork now bundled into Pro (visual canvas-based workspace, GA since Apr 2026)
Known Issues
- v2.1.100 bug (Apr 2026): ~20K invisible tokens added per request, burning quota ~40% faster. Root cause: broken prompt caching forcing full re-processing on every turn. Six releases shipped through v2.1.133 (May 8) with features but no public fix. Workaround: downgrade to v2.1.34 or reinstall via npm
- Opus 4.7 tokenizer: New tokenizer can use up to 35% more tokens for equivalent text vs Opus 4.6
- 5-hour rolling window: Hit limit at 2pm? Wait until 7pm. Not a daily reset
Community Reviews
From Reddit (r/ClaudeAI): “The difference of Claude Pro and Max5 plan usage limit are enormous. It is not only 5x” – users report Max 5x is a dramatically better experience for daily coding, not just numerically 5x better.
From dev.to review: “Pro costs ~$0.44 per Opus message; Max 5x costs the same per message but prevents hitting rate limits sooner. Max 20x cuts per-message cost in half (~$0.22) and is the only plan offering true volume discount.”
From findskill.ai: “Pro $20/mo for solo devs, Max 5x $100/mo for full-time on Claude Code, Max 20x $200/mo for pair-programming all day. The v2.1.100 token inflation bug is real – burns quotas ~40% faster.”
From felloai review: “Max 5x at $100/month gives five times the Pro usage and priority access during peak demand. It is the sweet spot for full-time developers who use Claude Code as their default coding partner.”
Personal Review (May 2026): Claude Code Pro ($20/mo) is decent for light use but the 5-hour rolling window makes it impractical for serious daily development – you WILL hit the wall. The good news: Anthropic just doubled rate limits and removed peak hours throttling (thanks to SpaceX Colossus 1), so Pro is now much more usable — ~90 Opus messages per 5h instead of 45. Max 5x ($100/mo) is still the minimum for real work, but the gap is narrower. On the plus side, code quality is unmatched – Opus 4.7 is genuinely the smartest model for complex refactoring and architecture decisions. The main frustration is Anthropic’s opaque quota system and the tokenizer/v2.1.100 bugs silently eating into your limits. Compared to OpenCode Go ($10/mo with MiniMax M2.5), Claude Code is still 5-10x the price for incrementally better quality. Only worth it if you need Opus-level reasoning for complex codebases. The v2.1.100 bug may have been fixed — keep an eye on releases.
GPT Codex (OpenAI) Pricing
OpenAI’s agentic coding tool — CLI, IDE extension, ChatGPT app, and cloud. Codex is bundled into ChatGPT plans (no separate subscription).
Source: developers.openai.com/codex/pricing
ChatGPT Plans (Codex Included)
| Plan | Price | Best For |
|---|---|---|
| Free | $0 | Limited Codex Mini access |
| Go | $20/mo | Budget-friendly Codex Mini (select regions). Reduced limits |
| Plus | $20/mo | Entry point for Codex CLI + IDE + cloud |
| Pro ($100) | $100/mo | 5x (10x promo until May 31) higher limits. Includes GPT-5.3-Codex-Spark preview |
| Pro ($200) | $200/mo | 20x higher limits. Heavy daily use |
| Business | $25/seat/mo | Team workspace + admin controls |
| API Key | Pay-per-token | CI/CD, automation, programmatic |
Codex Usage Limits (Plus — 5-hour rolling window)
| Model | Local Msgs/5h | Cloud Tasks/5h | Code Reviews/5h |
|---|---|---|---|
| GPT-5.5 | 15-80 | — | — |
| GPT-5.4 | 20-100 | — | — |
| GPT-5.4-mini | 60-350 | — | — |
| GPT-5.3-Codex | 30-150 | 10-60 | 20-50 |
Pro tiers: 5x to 20x these limits depending on tier. Promo: Pro $100 gets 2x (10x total) until May 31, 2026.
API Pay-per-Token (Codex API)
| Model | Input /1M | Output /1M |
|---|---|---|
| GPT-5.5 | $2.50 | $10.00 |
| GPT-5.5 mini | $0.40 | $1.60 |
| GPT-5.5 nano | $0.10 | $0.40 |
| Codex (specialized) | $5.00 | $15.00 |
Notes
- Usage shared across all Codex surfaces (CLI, IDE, ChatGPT, cloud)
- Soft caps slow you down; hard caps cut off until window resets
- Extra credits can be purchased on Plus/Pro plans
- Pro $100 2x promo expires May 31 2026
- Go plan ($20/mo): Budget Codex Mini tier available in select regions. Reduced limits, same models
Community Reviews
From aitoolsrecap (2026): “The April 16, 2026 update added computer use, memory, image generation, and 90+ plugins – turning it from a coding assistant into a full developer workstation. Included from $20/mo on ChatGPT Plus.”
From elite-ai-assisted-coding review: “Codex CLI performed well throughout testing. GPT-5’s capabilities were the key differentiator – the model followed instructions precisely, learned from documentation effectively, and applied knowledge appropriately.”
Personal Review (May 2026): Overall positive experience. The limits are fine and good enough for daily use – the mini models (GPT-5.4-mini) were the best thing about the plan, offering generous quotas for routine tasks. The main problem is with the frontier models (GPT-5.4 and GPT-5.5): when they understand you correctly from the start, they work really well and do exactly what’s asked. But if they misinterpret the intent, things go spectacularly wrong – they commit to wrong approaches confidently and produce deeply broken code. It’s an all-or-nothing experience. The Plus plan at $20/mo is reasonable value, but Pro at $200/mo is hard to justify vs OpenCode Go or BytePlus.
Antigravity (Google IDE) Download - Pricing
Google’s agent-first IDE (fork of VS Code). Runs multiple AI agents in parallel.
Includes Claude models (Sonnet & Opus) alongside Gemini — all in one IDE.
Source: antigravity.google/pricing
Plans
| Plan | Price | Models | Notes |
|---|---|---|---|
| Free | $0 | All models (rate-limited) | Access to all 7 models. Quota refreshes ~5h (unreliable). 92% quota cut since Dec 2025 |
| AI Pro | $20/mo | All models (higher limits) | Built-in credits (amount undisclosed). Best for hobbyists. Multi-day lockouts reported |
| AI Ultra | $249.99/mo | All models (highest limits) | Most expensive AI coding sub on the market. Still not immune to quota issues (Mar 2026+) |
| Pay-as-you-go | $25/2,500 credits | Full access | Overflow / burst usage. $0.01/credit |
Available Models
| Model | Provider | Type | Notes |
|---|---|---|---|
| Gemini 3.1 Pro | Reasoning (High/Low) | Google’s flagship. Good for frontend work only | |
| Gemini 3 Flash | Fast | Very capable for frontend. Reliable | |
| Claude Opus 4.6 | Anthropic | Premium | $200/mo value, available here. Best for backend |
| Claude Sonnet 4.6 | Anthropic | Mid-tier | Fast, reliable for backend code |
| GPT-OSS 120B | OpenAI | Open-weight | Free model |
Usage Tips
- Use Claude models for backend work — they handle complex logic, refactoring, and architecture
- Use Gemini models for frontend — Gemini 3 Flash in particular is excellent for UI work
- Never use Gemini for backend or complex logic tasks — it performs poorly
- Limits are separate per provider — running out on Claude? Switch to Gemini and keep working
- Multi-agent missions can assign different models to different agents within the same task
Known Issues
- Quota cuts: Free tier quotas cut 92% since Dec 2025 (250 -> 20 requests/day)
- Pro lockouts: Multiple reports of 7-day lockouts even with low usage. Advertised 5-hour refresh unreliable
- Credit system opaque: Credit-to-token conversion rate undisclosed. Per-model credit costs unknown
- Ultra lockouts: Even $250/mo Ultra users report unexpected quota restrictions since Mar 2026
Community Reviews
From aitoolanalysis (Mar 2026): “The most promising and most frustrating editor I’ve ever used.” Free tier gives access to Claude Opus 4.6 ($200/mo value) for $0. But the rate limit crisis is real – Pro subscribers hit 7-day lockouts instead of 5-hour refresh.
From vibecoding.app: “One developer documented a single Claude Opus 4.6 session consuming 635 out of 1,000 credits. One or two complex coding sessions can trigger a lockout for the rest of the week.”
From claude-world.com: “Google Antigravity users reported multi-day account lockouts, a 92% free-tier quota cut, and pricing that pushes developers toward a $250/month plan.”
Personal Review (May 2026): I’ve used it extensively. The best thing is having Claude models (Sonnet & Opus) alongside Gemini in one IDE – I use Claude for backend and Gemini for frontend, each with separate limits. Gemini 3 Flash is genuinely capable for frontend work. The downside: limits are terrible. It was generous at the start but degraded badly – on some days I’d get only 5-10 requests with Claude models before hitting the wall. The $250 Ultra plan is absurdly priced. Worth trying the free tier for the multi-model access, but don’t rely on it for daily production work.
Xiaomi MiMo Token Plan Subscribe
Launched Apr 2 2026. Pure monthly credit pool — NO 5h windows, NO weekly limits.
Credits expire month-end, no rollover. Mid-month upgrades OK, downgrades not.
Now supports V2.5 model series — one subscription unlocks all 8 MiMo models.
Auto-renewal discounts: up to $144 off annual plans.
| Plan | Price | Credits/Mo | ~Tasks/mo (V2-Omni 1:1) |
|---|---|---|---|
| Lite | $6/mo (¥39) | 60,000,000 | ~120 |
| Standard | $16/mo (¥99) | 200,000,000 | ~400 |
| Pro | $50/mo (¥329) | 700,000,000 | ~1,400 |
| Max | $100/mo (¥659) | 1,600,000,000 | ~3,200 |
Credit Multiplier by Model
| Model | Context | Rate |
|---|---|---|
| MiMo-V2-Omni | up to 256K | 1 token = 1 Credit |
| MiMo-V2-Pro | up to 256K | 1 token = 2 Credits |
| MiMo-V2-Pro | 256K–1M | 1 token = 4 Credits |
| MiMo-V2.5 | up to 256K | 1 token = ~1 Credit (Omni-class) |
| MiMo-V2.5-Pro | up to 256K | 1 token = ~2 Credits |
| MiMo-V2-TTS | n/a | Free (limited time) |
API rates (V2.5): $0.4/$2.00 per 1M (256K). 88% off first purchase.
Kimi / Moonshot AI Kimi Code
Two products: Kimi membership (app quotas) and Kimi Code (developer).
API billed separately — NOT included in membership.
Current model: K2.6 (released Apr 18–21 2026).
K2.5 predecessor: 1T params, 32B active, MoE, 384 experts, 256K ctx, MIT license.
K2 (original) discontinued — stops serving May 25, 2026 (6 days from now). Migrate to K2.5 or K2.6.
K2.6 Improvements
| Capability | K2.5 | K2.6 |
|---|---|---|
| SWE-Bench Pro | — | 58.6% |
| Multilingual | — | 76.7% |
| BrowseComp | 78.4% | 83.2% |
| Parallel sub-agents | 100 | 300 |
| Long-horizon | Hours | 4,000+ tool calls, 12+ hr |
API Pricing (K2.6)
| Input (Cache Miss) | Input (Cache Hit) | Output | |
|---|---|---|---|
| Per 1M tokens | $0.95 | $0.16 | $4.00 |
Context window: 262,144 tokens.
Membership Plans (Kimi App)
| Tier | Price (annual/mo) | Agent Usage/mo |
|---|---|---|
| Adagio | Free | 6 |
| Moderato | $15/mo | 60 |
| Allegretto | $31/mo | 150 |
| Allegro | $79/mo | 360 |
| Vivace | $159/mo | 720 |
Day-0 support on: vLLM, OpenRouter, Cloudflare, Baseten, MLX, Hermes, OpenCode.