docs/quota-and-pricing.md

# Gemini CLI: Quotas and Pricing

Gemini CLI offers a generous free tier that covers the use cases for many individual developers. For enterprise / professional usage, or if you need higher limits, there are multiple possible avenues depending on what type of account you use to authenticate.

See [privacy and terms](./tos-privacy.md) for details on Privacy policy and Terms of Service.

Note: published prices are list price; additional negotiated commercial discounting may apply.

This article outlines the specific quotas and pricing applicable to the Gemini CLI when using different authentication methods.

Generally, there are three categories to choose from:

- Free Usage: Ideal for experimentation and light use.
- Paid Tier (fixed price): For individual developers or enterprises who need more generous daily quotas and predictable costs.
- Pay-As-You-Go: The most flexible option for professional use, long-running tasks, or when you need full control over your usage.

## Free Usage

Your journey begins with a generous free tier, perfect for experimentation and light use.

Your free usage limits depend on your authorization type.

### Log in with Google (Gemini Code Assist for individuals)

For users who authenticate by using their Google account to access Gemini Code Assist for individuals. This includes:

- 1000 model requests / user / day
- 60 model requests / user / minute
- Model requests will be made across the Gemini model family as determined by Gemini CLI.

Learn more at [Gemini Code Assist for Individuals Limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).

### Log in with Gemini API Key (Unpaid)

If you are using a Gemini API key, you can also benefit from a free tier. This includes:

- 250 model requests / user / day
- 10 model requests / user / minute
- Model requests to Flash model only.

Learn more at [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits).

### Log in with Vertex AI (Express Mode)

Vertex AI offers an Express Mode without the need to enable billing. This includes:

- 90 days before you need to enable billing.
- Quotas and models are variable and specific to your account.

Learn more at [Vertex AI Express Mode Limits](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas).

## Paid tier: Higher limits for a fixed cost

If you use up your initial number of requests, you can continue to benefit from Gemini CLI by upgrading to one of the following subscriptions:

- [Google AI Pro and AI Ultra](https://cloud.google.com/products/gemini/pricing) by signing up at [Set up Gemini Code Assist](https://goo.gle/set-up-gemini-code-assist). This is recommended for individual developers. Quotas and pricing are based on a fixed price subscription.

  For predictable costs, you can log in with Google.

  Learn more at [Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)

- [Purchase a Gemini Code Assist Subscription through Google Cloud ](https://cloud.google.com/gemini/docs/codeassist/overview) by signing up in the Google Cloud console. Learn more at [Set up Gemini Code Assist] (https://cloud.google.com/gemini/docs/discover/set-up-gemini) Quotas and pricing are based on a fixed price subscription with assigned license seats. For predictable costs, you can sign in with Google.

  This includes:
  - Gemini Code Assist Standard edition:
    - 1500 model requests / user / day
    - 120 model requests / user / minute
  - Gemini Code Assist Enterprise edition:
    - 2000 model requests / user / day
    - 120 model requests / user / minute
  - Model requests will be made across the Gemini model family as determined by Gemini CLI.

  [Learn more about Gemini Code Assist Standard and Enterprise license limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).

## Pay As You Go

If you hit your daily request limits or exhaust your Gemini Pro quota even after upgrading, the most flexible solution is to switch to a pay-as-you-go model, where you pay for the specific amount of processing you use. This is the recommended path for uninterrupted access.

To do this, log in using a Gemini API key or Vertex AI.

- Vertex AI (Regular Mode):
  - Quota: Governed by a dynamic shared quota system or pre-purchased provisioned throughput.
  - Cost: Based on model and token usage.

Learn more at [Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota) and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).

- Gemini API key:
  - Quota: Varies by pricing tier.
  - Cost: Varies by pricing tier and model/token usage.

Learn more at [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits), [Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing)

It’s important to highlight that when using an API key, you pay per token/call. This can be more expensive for many small calls with few tokens, but it's the only way to ensure your workflow isn't interrupted by quota limits.

## Gemini for Workspace plans

These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers the Gemini CLI. Supporting these plans is under active consideration for future support.

## Tips to Avoid High Costs

When using a Pay as you Go API key, be mindful of your usage to avoid unexpected costs.

- Don't blindly accept every suggestion, especially for computationally intensive tasks like refactoring large codebases.
- Be intentional with your prompts and commands. You are paying per call, so think about the most efficient way to get the job done.

## Gemini API vs. Vertex

- Gemini API (gemini developer api): This is the fastest way to use the Gemini models directly.
- Vertex AI: This is the enterprise-grade platform for building, deploying, and managing Gemini models with specific security and control requirements.

## Understanding your usage

A summary of model usage is available through the `/stats` command and presented on exit at the end of a session.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
+								# Gemini CLI: Quotas and Pricing
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Gemini CLI offers a generous free tier that covers the use cases for many individual developers. For enterprise / professional usage, or if you need higher limits, there are multiple possible avenues depending on what type of account you use to authenticate.
 								See [privacy and terms](./tos-privacy.md) for details on Privacy policy and Terms of Service.
 								Note: published prices are list price; additional negotiated commercial discounting may apply.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
 								This article outlines the specific quotas and pricing applicable to the Gemini CLI when using different authentication methods.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Generally, there are three categories to choose from:
 								- Free Usage: Ideal for experimentation and light use.
 								- Paid Tier (fixed price): For individual developers or enterprises who need more generous daily quotas and predictable costs.
 								- Pay-As-You-Go: The most flexible option for professional use, long-running tasks, or when you need full control over your usage.
 								## Free Usage
 								Your journey begins with a generous free tier, perfect for experimentation and light use.
 								Your free usage limits depend on your authorization type.
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								### Log in with Google (Gemini Code Assist for individuals)
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
 								For users who authenticate by using their Google account to access Gemini Code Assist for individuals. This includes:
 								- 1000 model requests / user / day
 								- 60 model requests / user / minute
 								- Model requests will be made across the Gemini model family as determined by Gemini CLI.
 								Learn more at [Gemini Code Assist for Individuals Limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								### Log in with Gemini API Key (Unpaid)
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								If you are using a Gemini API key, you can also benefit from a free tier. This includes:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- 250 model requests / user / day
 								- 10 model requests / user / minute
 								- Model requests to Flash model only.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Learn more at [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								### Log in with Vertex AI (Express Mode)
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Vertex AI offers an Express Mode without the need to enable billing. This includes:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- 90 days before you need to enable billing.
 								- Quotas and models are variable and specific to your account.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Learn more at [Vertex AI Express Mode Limits](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								## Paid tier: Higher limits for a fixed cost
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								If you use up your initial number of requests, you can continue to benefit from Gemini CLI by upgrading to one of the following subscriptions:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								- [Google AI Pro and AI Ultra](https://cloud.google.com/products/gemini/pricing) by signing up at [Set up Gemini Code Assist](https://goo.gle/set-up-gemini-code-assist). This is recommended for individual developers. Quotas and pricing are based on a fixed price subscription.
 								  For predictable costs, you can log in with Google.
 								  Learn more at [Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)
 								- [Purchase a Gemini Code Assist Subscription through Google Cloud ](https://cloud.google.com/gemini/docs/codeassist/overview) by signing up in the Google Cloud console. Learn more at [Set up Gemini Code Assist] (https://cloud.google.com/gemini/docs/discover/set-up-gemini) Quotas and pricing are based on a fixed price subscription with assigned license seats. For predictable costs, you can sign in with Google.
 								  This includes:
 								  - Gemini Code Assist Standard edition:
 								    - 1500 model requests / user / day
 								    - 120 model requests / user / minute
 								  - Gemini Code Assist Enterprise edition:
 								    - 2000 model requests / user / day
 								    - 120 model requests / user / minute
 								  - Model requests will be made across the Gemini model family as determined by Gemini CLI.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								  [Learn more about Gemini Code Assist Standard and Enterprise license limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								## Pay As You Go
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								If you hit your daily request limits or exhaust your Gemini Pro quota even after upgrading, the most flexible solution is to switch to a pay-as-you-go model, where you pay for the specific amount of processing you use. This is the recommended path for uninterrupted access.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								To do this, log in using a Gemini API key or Vertex AI.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- Vertex AI (Regular Mode):
 								  - Quota: Governed by a dynamic shared quota system or pre-purchased provisioned throughput.
 								  - Cost: Based on model and token usage.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Learn more at [Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota) and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- Gemini API key:
 								  - Quota: Varies by pricing tier.
 								  - Cost: Varies by pricing tier and model/token usage.
 								Learn more at [Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits), [Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing)
 								It’s important to highlight that when using an API key, you pay per token/call. This can be more expensive for many small calls with few tokens, but it's the only way to ensure your workflow isn't interrupted by quota limits.
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								## Gemini for Workspace plans
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
 								These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers the Gemini CLI. Supporting these plans is under active consideration for future support.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
 								## Tips to Avoid High Costs
 								When using a Pay as you Go API key, be mindful of your usage to avoid unexpected costs.
 								- Don't blindly accept every suggestion, especially for computationally intensive tasks like refactoring large codebases.
 								- Be intentional with your prompts and commands. You are paying per call, so think about the most efficient way to get the job done.
 								## Gemini API vs. Vertex
 								- Gemini API (gemini developer api): This is the fastest way to use the Gemini models directly.
 								- Vertex AI: This is the enterprise-grade platform for building, deploying, and managing Gemini models with specific security and control requirements.
 								## Understanding your usage
 								A summary of model usage is available through the `/stats` command and presented on exit at the end of a session.