From 1be10c118675bbb77c9e0ba5dd0cff3a1da1e7e5 Mon Sep 17 00:00:00 2001 From: tanzhenxin Date: Wed, 20 Aug 2025 17:03:21 +0800 Subject: [PATCH] chore: update docs --- docs/quota-and-pricing.md | 70 --------------------------------------- docs/tos-privacy.md | 2 +- 2 files changed, 1 insertion(+), 71 deletions(-) delete mode 100644 docs/quota-and-pricing.md diff --git a/docs/quota-and-pricing.md b/docs/quota-and-pricing.md deleted file mode 100644 index 28e1eae9..00000000 --- a/docs/quota-and-pricing.md +++ /dev/null @@ -1,70 +0,0 @@ -# Qwen Code: Quotas and Pricing (for Google-backed flows) - -Quotas and pricing below apply when using Google-backed flows interoperable with Qwen Code. A summary of model usage is available through the `/stats` command and presented on exit at the end of a session. See [privacy and terms](./tos-privacy.md) for details. Note: published prices are list price; additional negotiated commercial discounting may apply. - -This article outlines the specific quotas and pricing applicable to Google-backed usage paths. - -## 1. Log in with Google (Gemini Code Assist Free Tier) - -For users who authenticate by using their Google account to access Gemini Code Assist for individuals: - -- **Quota:** - - 60 requests per minute - - 1000 requests per day - - Token usage is not applicable -- **Cost:** Free -- **Details:** [Gemini Code Assist Quotas](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli) -- **Notes:** A specific quota for different models is not specified; model fallback may occur to preserve shared experience quality. - -## 2. Gemini API Key (Unpaid) - -If you are using a Gemini API key for the free tier: - -- **Quota:** - - Flash model only - - 10 requests per minute - - 250 requests per day -- **Cost:** Free -- **Details:** [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits) - -## 3. Gemini API Key (Paid) - -If you are using a Gemini API key with a paid plan: - -- **Quota:** Varies by pricing tier. -- **Cost:** Varies by pricing tier and model/token usage. -- **Details:** [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits), [Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing) - -## 4. Login with Google (for Workspace or Licensed Code Assist users) - -For users of Standard or Enterprise editions of Gemini Code Assist, quotas and pricing are based on a fixed price subscription with assigned license seats: - -- **Standard Tier:** - - **Quota:** 120 requests per minute, 1500 per day -- **Enterprise Tier:** - - **Quota:** 120 requests per minute, 2000 per day -- **Cost:** Fixed price included with your Gemini for Google Workspace or Gemini Code Assist subscription. -- **Details:** [Gemini Code Assist Quotas](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli), [Gemini Code Assist Pricing](https://cloud.google.com/products/gemini/pricing) -- **Notes:** - - Specific quota for different models is not specified; model fallback may occur to preserve shared experience quality. - - Members of the Google Developer Program may have Gemini Code Assist licenses through their membership. - -## 5. Vertex AI (Express Mode) - -If you are using Vertex AI in Express Mode: - -- **Quota:** Quotas are variable and specific to your account. See the source for more details. -- **Cost:** After your Express Mode usage is consumed and you enable billing for your project, cost is based on standard [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing). -- **Details:** [Vertex AI Express Mode Quotas](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas) - -## 6. Vertex AI (Regular Mode) - -If you are using the standard Vertex AI service: - -- **Quota:** Governed by a dynamic shared quota system or pre-purchased provisioned throughput. -- **Cost:** Based on model and token usage. See [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing). -- **Details:** [Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota) - -## 7. Google One and Ultra plans, Gemini for Workspace plans - -These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers CLI integrations. diff --git a/docs/tos-privacy.md b/docs/tos-privacy.md index f8142d77..046b1d08 100644 --- a/docs/tos-privacy.md +++ b/docs/tos-privacy.md @@ -23,7 +23,7 @@ When you authenticate using your qwen.ai account, these Terms of Service and Pri - **Terms of Service:** Your use is governed by the [Qwen Terms of Service](https://qwen.ai/termsservice). - **Privacy Notice:** The collection and use of your data is described in the [Qwen Privacy Policy](https://qwen.ai/privacypolicy). -For details about quotas, pricing, and features, see [Quotas and Pricing](./quota-and-pricing.md). +For details about authentication setup, quotas, and supported features, see [Authentication Setup](./cli/authentication.md). ## 2. If you are using OpenAI-Compatible API Authentication