diff --git a/docs/resources/quota-and-pricing.md b/docs/resources/quota-and-pricing.md index 16d6b407b8..18beb7c761 100644 --- a/docs/resources/quota-and-pricing.md +++ b/docs/resources/quota-and-pricing.md @@ -12,6 +12,21 @@ quota for your needs, see the [Plans page](https://geminicli.com/plans/). This article outlines the specific quotas and pricing applicable to Gemini CLI when using different authentication methods. +The following table summarizes the available quotas and their respective limits: + +| Authentication method | Tier / Subscription | Maximum requests per user per day | +| :-------------------- | :------------------------------ | :-------------------------------- | +| **Google account** | Gemini Code Assist (Individual) | 1,000 requests | +| | Google AI Pro | 1,500 requests | +| | Google AI Ultra | 2,000 requests | +| **Gemini API key** | Free tier (Unpaid) | 250 requests | +| | Pay-as-you-go (Paid) | Varies | +| **Vertex AI** | Express mode (Free) | Varies | +| | Pay-as-you-go (Paid) | Varies | +| **Google Workspace** | Code Assist Standard | 1,500 requests | +| | Code Assist Enterprise | 2,000 requests | +| | Workspace AI Ultra | 2,000 requests | + Generally, there are three categories to choose from: - Free Usage: Ideal for experimentation and light use. @@ -20,6 +35,9 @@ Generally, there are three categories to choose from: - Pay-As-You-Go: The most flexible option for professional use, long-running tasks, or when you need full control over your usage. +Requests are limited per user per minute and are subject to the availability of +the service in times of high demand. + ## Free usage Access to Gemini CLI begins with a generous free tier, perfect for @@ -33,8 +51,7 @@ authorization type. For users who authenticate by using their Google account to access Gemini Code Assist for individuals. This includes: -- 1000 model requests / user / day -- 60 model requests / user / minute +- 1000 maximum model requests / user / day - Model requests will be made across the Gemini model family as determined by Gemini CLI. @@ -46,8 +63,7 @@ Learn more at If you are using a Gemini API key, you can also benefit from a free tier. This includes: -- 250 model requests / user / day -- 10 model requests / user / minute +- 250 maximum model requests / user / day - Model requests to Flash model only. Learn more at @@ -59,7 +75,7 @@ Vertex AI offers an Express Mode without the need to enable billing. This includes: - 90 days before you need to enable billing. -- Quotas and models are variable and specific to your account. +- Quotas and models are specific to your account and their limits vary. Learn more at [Vertex AI Express Mode Limits](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas). @@ -112,11 +128,9 @@ Standard/PlusĀ and AI Expanded, are not supported._ This includes the following request limits: - Gemini Code Assist Standard edition: - - 1500 model requests / user / day - - 120 model requests / user / minute + - 1500 maximum model requests / user / day - Gemini Code Assist Enterprise edition: - - 2000 model requests / user / day - - 120 model requests / user / minute + - 2000 maximum model requests / user / day - Model requests will be made across the Gemini model family as determined by Gemini CLI.