From 1be10c118675bbb77c9e0ba5dd0cff3a1da1e7e5 Mon Sep 17 00:00:00 2001
From: tanzhenxin <tanzhenxing1987@gmail.com>
Date: Wed, 20 Aug 2025 17:03:21 +0800
Subject: [PATCH] chore: update docs

---
 docs/quota-and-pricing.md | 70 ---------------------------------------
 docs/tos-privacy.md       |  2 +-
 2 files changed, 1 insertion(+), 71 deletions(-)
 delete mode 100644 docs/quota-and-pricing.md

diff --git a/docs/quota-and-pricing.md b/docs/quota-and-pricing.md
deleted file mode 100644
index 28e1eae9..00000000
--- a/docs/quota-and-pricing.md
+++ /dev/null
@@ -1,70 +0,0 @@
-# Qwen Code: Quotas and Pricing (for Google-backed flows)
-
-Quotas and pricing below apply when using Google-backed flows interoperable with Qwen Code. A summary of model usage is available through the `/stats` command and presented on exit at the end of a session. See [privacy and terms](./tos-privacy.md) for details. Note: published prices are list price; additional negotiated commercial discounting may apply.
-
-This article outlines the specific quotas and pricing applicable to Google-backed usage paths.
-
-## 1. Log in with Google (Gemini Code Assist Free Tier)
-
-For users who authenticate by using their Google account to access Gemini Code Assist for individuals:
-
-- **Quota:**
-  - 60 requests per minute
-  - 1000 requests per day
-  - Token usage is not applicable
-- **Cost:** Free
-- **Details:** [Gemini Code Assist Quotas](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli)
-- **Notes:** A specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
-
-## 2. Gemini API Key (Unpaid)
-
-If you are using a Gemini API key for the free tier:
-
-- **Quota:**
-  - Flash model only
-  - 10 requests per minute
-  - 250 requests per day
-- **Cost:** Free
-- **Details:** [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits)
-
-## 3. Gemini API Key (Paid)
-
-If you are using a Gemini API key with a paid plan:
-
-- **Quota:** Varies by pricing tier.
-- **Cost:** Varies by pricing tier and model/token usage.
-- **Details:** [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits), [Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing)
-
-## 4. Login with Google (for Workspace or Licensed Code Assist users)
-
-For users of Standard or Enterprise editions of Gemini Code Assist, quotas and pricing are based on a fixed price subscription with assigned license seats:
-
-- **Standard Tier:**
-  - **Quota:** 120 requests per minute, 1500 per day
-- **Enterprise Tier:**
-  - **Quota:** 120 requests per minute, 2000 per day
-- **Cost:** Fixed price included with your Gemini for Google Workspace or Gemini Code Assist subscription.
-- **Details:** [Gemini Code Assist Quotas](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli), [Gemini Code Assist Pricing](https://cloud.google.com/products/gemini/pricing)
-- **Notes:**
-  - Specific quota for different models is not specified; model fallback may occur to preserve shared experience quality.
-  - Members of the Google Developer Program may have Gemini Code Assist licenses through their membership.
-
-## 5. Vertex AI (Express Mode)
-
-If you are using Vertex AI in Express Mode:
-
-- **Quota:** Quotas are variable and specific to your account. See the source for more details.
-- **Cost:** After your Express Mode usage is consumed and you enable billing for your project, cost is based on standard [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
-- **Details:** [Vertex AI Express Mode Quotas](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas)
-
-## 6. Vertex AI (Regular Mode)
-
-If you are using the standard Vertex AI service:
-
-- **Quota:** Governed by a dynamic shared quota system or pre-purchased provisioned throughput.
-- **Cost:** Based on model and token usage. See [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
-- **Details:** [Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota)
-
-## 7. Google One and Ultra plans, Gemini for Workspace plans
-
-These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers CLI integrations.
diff --git a/docs/tos-privacy.md b/docs/tos-privacy.md
index f8142d77..046b1d08 100644
--- a/docs/tos-privacy.md
+++ b/docs/tos-privacy.md
@@ -23,7 +23,7 @@ When you authenticate using your qwen.ai account, these Terms of Service and Pri
 - **Terms of Service:** Your use is governed by the [Qwen Terms of Service](https://qwen.ai/termsservice).
 - **Privacy Notice:** The collection and use of your data is described in the [Qwen Privacy Policy](https://qwen.ai/privacypolicy).
 
-For details about quotas, pricing, and features, see [Quotas and Pricing](./quota-and-pricing.md).
+For details about authentication setup, quotas, and supported features, see [Authentication Setup](./cli/authentication.md).
 
 ## 2. If you are using OpenAI-Compatible API Authentication