Qwen Code: Quotas and Pricing (for Google-backed flows)

Quotas and pricing below apply when using Google-backed flows interoperable with Qwen Code. A summary of model usage is available through the /stats command and presented on exit at the end of a session. See privacy and terms for details. Note: published prices are list price; additional negotiated commercial discounting may apply.

This article outlines the specific quotas and pricing applicable to Google-backed usage paths.

1. Log in with Google (Gemini Code Assist Free Tier)

For users who authenticate by using their Google account to access Gemini Code Assist for individuals:

Quota:
- 60 requests per minute
- 1000 requests per day
- Token usage is not applicable
Cost: Free
Details: Gemini Code Assist Quotas
Notes: A specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.

2. Gemini API Key (Unpaid)

If you are using a Gemini API key for the free tier:

Quota:
- Flash model only
- 10 requests per minute
- 250 requests per day
Cost: Free
Details: Gemini API Rate Limits

3. Gemini API Key (Paid)

If you are using a Gemini API key with a paid plan:

Quota: Varies by pricing tier.
Cost: Varies by pricing tier and model/token usage.
Details: Gemini API Rate Limits, Gemini API Pricing

For users of Standard or Enterprise editions of Gemini Code Assist, quotas and pricing are based on a fixed price subscription with assigned license seats:

Standard Tier:
- Quota: 120 requests per minute, 1500 per day
Enterprise Tier:
- Quota: 120 requests per minute, 2000 per day
Cost: Fixed price included with your Gemini for Google Workspace or Gemini Code Assist subscription.
Details: Gemini Code Assist Quotas, Gemini Code Assist Pricing
Notes:
- Specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
- Members of the Google Developer Program may have Gemini Code Assist licenses through their membership.

5. Vertex AI (Express Mode)

If you are using Vertex AI in Express Mode:

Quota: Quotas are variable and specific to your account. See the source for more details.
Cost: After your Express Mode usage is consumed and you enable billing for your project, cost is based on standard Vertex AI Pricing.
Details: Vertex AI Express Mode Quotas

6. Vertex AI (Regular Mode)

If you are using the standard Vertex AI service:

Quota: Governed by a dynamic shared quota system or pre-purchased provisioned throughput.
Cost: Based on model and token usage. See Vertex AI Pricing.
Details: Vertex AI Dynamic Shared Quota

7. Google One and Ultra plans, Gemini for Workspace plans

These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers CLI integrations.

3.8 KiB Raw Blame History