Files
qwen-code/docs/quota-and-pricing.md

3.8 KiB

Qwen Code: Quotas and Pricing (for Google-backed flows)

Quotas and pricing below apply when using Google-backed flows interoperable with Qwen Code. A summary of model usage is available through the /stats command and presented on exit at the end of a session. See privacy and terms for details. Note: published prices are list price; additional negotiated commercial discounting may apply.

This article outlines the specific quotas and pricing applicable to Google-backed usage paths.

1. Log in with Google (Gemini Code Assist Free Tier)

For users who authenticate by using their Google account to access Gemini Code Assist for individuals:

  • Quota:
    • 60 requests per minute
    • 1000 requests per day
    • Token usage is not applicable
  • Cost: Free
  • Details: Gemini Code Assist Quotas
  • Notes: A specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.

2. Gemini API Key (Unpaid)

If you are using a Gemini API key for the free tier:

  • Quota:
    • Flash model only
    • 10 requests per minute
    • 250 requests per day
  • Cost: Free
  • Details: Gemini API Rate Limits

3. Gemini API Key (Paid)

If you are using a Gemini API key with a paid plan:

4. Login with Google (for Workspace or Licensed Code Assist users)

For users of Standard or Enterprise editions of Gemini Code Assist, quotas and pricing are based on a fixed price subscription with assigned license seats:

  • Standard Tier:
    • Quota: 120 requests per minute, 1500 per day
  • Enterprise Tier:
    • Quota: 120 requests per minute, 2000 per day
  • Cost: Fixed price included with your Gemini for Google Workspace or Gemini Code Assist subscription.
  • Details: Gemini Code Assist Quotas, Gemini Code Assist Pricing
  • Notes:
    • Specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
    • Members of the Google Developer Program may have Gemini Code Assist licenses through their membership.

5. Vertex AI (Express Mode)

If you are using Vertex AI in Express Mode:

  • Quota: Quotas are variable and specific to your account. See the source for more details.
  • Cost: After your Express Mode usage is consumed and you enable billing for your project, cost is based on standard Vertex AI Pricing.
  • Details: Vertex AI Express Mode Quotas

6. Vertex AI (Regular Mode)

If you are using the standard Vertex AI service:

7. Google One and Ultra plans, Gemini for Workspace plans

These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers CLI integrations.