docs/resources/quota-and-pricing.md

# Gemini CLI: Quotas and pricing

Gemini CLI offers a generous free tier that covers many individual developers'
use cases. For enterprise or professional usage, or if you need increased quota,
several options are available depending on your authentication account type.

For a high-level comparison of available subscriptions and to select the right
quota for your needs, see the [Plans page](https://geminicli.com/plans/).

## Overview

This article outlines the specific quotas and pricing applicable to Gemini CLI
when using different authentication methods.

Generally, there are three categories to choose from:

- Free Usage: Ideal for experimentation and light use.
- Paid Tier (fixed price): For individual developers or enterprises who need
  more generous daily quotas and predictable costs.
- Pay-As-You-Go: The most flexible option for professional use, long-running
  tasks, or when you need full control over your usage.

## Free usage

Access to Gemini CLI begins with a generous free tier, perfect for
experimentation and light use.

Your free usage is governed by the following limits, which depend on your
authorization type.

### Log in with Google (Gemini Code Assist for individuals)

For users who authenticate by using their Google account to access Gemini Code
Assist for individuals. This includes:

- 1000 model requests / user / day
- 60 model requests / user / minute
- Model requests will be made across the Gemini model family as determined by
  Gemini CLI.

Learn more at
[Gemini Code Assist for Individuals Limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).

### Log in with Gemini API Key (unpaid)

If you are using a Gemini API key, you can also benefit from a free tier. This
includes:

- 250 model requests / user / day
- 10 model requests / user / minute
- Model requests to Flash model only.

Learn more at
[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits).

### Log in with Vertex AI (Express Mode)

Vertex AI offers an Express Mode without the need to enable billing. This
includes:

- 90 days before you need to enable billing.
- Quotas and models are variable and specific to your account.

Learn more at
[Vertex AI Express Mode Limits](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas).

## Paid tier: Higher limits for a fixed cost

If you use up your initial number of requests, you can continue to benefit from
Gemini CLI by upgrading to one of the following subscriptions:

### Individuals

These tiers apply when you sign in with a personal account. To verify whether
you're on a personal account, visit
[Google One](https://one.google.com/about/plans?hl=en-US&g1_landing_page=0):

- If you are on a personal account, you will see your personal dashboard.
- If you are not on a personal account, you will see: "You're currently signed
  in to your Google Workspace Account."

**Supported tiers:** _- Tiers not listed above, including Google AI Plus, are
not supported._

- [Google AI Pro and AI Ultra](https://gemini.google/subscriptions/). This is
  recommended for individual developers. Quotas and pricing are based on a fixed
  price subscription.

  For predictable costs, you can log in with Google.

  Learn more at
  [Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)

### Through your organization

These tiers are applicable when you are signing in with a Google Workspace
account.

- To verify your account type, visit
  [the Google One page](https://one.google.com/about/plans?hl=en-US&g1_landing_page=0).
- You are on a workspace account if you see the message "You're currently signed
  in to your Google Workspace Account".

**Supported tiers:** _- Tiers not listed above, including Workspace AI
Standard/Plus and AI Expanded, are not supported._

- [Workspace AI Ultra Access](https://workspace.google.com/products/ai-ultra/).
- [Purchase a Gemini Code Assist Subscription through Google Cloud](https://cloud.google.com/gemini/docs/codeassist/overview).

  Quotas and pricing are based on a fixed price subscription with assigned
  license seats. For predictable costs, you can sign in with Google.

  This includes the following request limits:
  - Gemini Code Assist Standard edition:
    - 1500 model requests / user / day
    - 120 model requests / user / minute
  - Gemini Code Assist Enterprise edition:
    - 2000 model requests / user / day
    - 120 model requests / user / minute
  - Model requests will be made across the Gemini model family as determined by
    Gemini CLI.

  [Learn more about Gemini Code Assist license limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).

## Pay as you go

If you hit your daily request limits or exhaust your Gemini Pro quota even after
upgrading, the most flexible solution is to switch to a pay-as-you-go model,
where you pay for the specific amount of processing you use. This is the
recommended path for uninterrupted access.

To do this, log in using a Gemini API key or Vertex AI.

### Vertex AI (regular mode)

An enterprise-grade platform for building, deploying, and managing AI models,
including Gemini. It offers enhanced security, data governance, and integration
with other Google Cloud services.

- Quota: Governed by a dynamic shared quota system or pre-purchased provisioned
  throughput.
- Cost: Based on model and token usage.

Learn more at
[Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota)
and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).

### Gemini API key

Ideal for developers who want to quickly build applications with the Gemini
models. This is the most direct way to use the models.

- Quota: Varies by pricing tier.
- Cost: Varies by pricing tier and model/token usage.

Learn more at
[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits),
[Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing)

It’s important to highlight that when using an API key, you pay per token/call.
This can be more expensive for many small calls with few tokens, but it's the
only way to ensure your workflow isn't interrupted by reaching a limit on your
quota.

## Gemini for workspace plans

These plans currently apply only to the use of Gemini web-based products
provided by Google-based experiences (for example, the Gemini web app or the
Flow video editor). These plans do not apply to the API usage which powers the
Gemini CLI. Supporting these plans is under active consideration for future
support.

## Check usage and limits

You can check your current token usage and applicable limits using the
`/stats model` command. This command provides a snapshot of your current
session's token usage, as well as information about the limits associated with
your current quota.

For more information on the `/stats` command and its subcommands, see the
[Command Reference](../reference/commands.md#stats).

A summary of model usage is also presented on exit at the end of a session.

## Tips to avoid high costs

When using a pay-as-you-go plan, be mindful of your usage to avoid unexpected
costs.

- **Be selective with suggestions**: Before accepting a suggestion, especially
  for a computationally intensive task like refactoring a large codebase,
  consider if it's the most cost-effective approach.
- **Use precise prompts**: You are paying per call, so think about the most
  efficient way to get your desired result. A well-crafted prompt can often get
  you the answer you need in a single call, rather than multiple back-and-forth
  interactions.
- **Monitor your usage**: Use the `/stats model` command to track your token
  usage during a session. This can help you stay aware of your spending in real
  time.
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								# Gemini CLI: Quotas and pricing
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												docs: Update 4 files (#13628)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
											
										
										
											2025-12-01 15:55:25 -08:00
+								Gemini CLI offers a generous free tier that covers many individual developers'
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								use cases. For enterprise or professional usage, or if you need increased quota,
-												docs: Update 4 files (#13628)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
											
										
										
											2025-12-01 15:55:25 -08:00
+								several options are available depending on your authentication account type.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								For a high-level comparison of available subscriptions and to select the right
-												Docs: Make documentation links relative (#21490)
											
										
										
											2026-03-09 08:23:00 -07:00
+								quota for your needs, see the [Plans page](https://geminicli.com/plans/).
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								## Overview
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												docs: Update 4 files (#13628)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
											
										
										
											2025-12-01 15:55:25 -08:00
+								This article outlines the specific quotas and pricing applicable to Gemini CLI
 								when using different authentication methods.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								Generally, there are three categories to choose from:
 								- Free Usage: Ideal for experimentation and light use.
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								- Paid Tier (fixed price): For individual developers or enterprises who need
 								  more generous daily quotas and predictable costs.
 								- Pay-As-You-Go: The most flexible option for professional use, long-running
 								  tasks, or when you need full control over your usage.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								## Free usage
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								Access to Gemini CLI begins with a generous free tier, perfect for
 								experimentation and light use.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								Your free usage is governed by the following limits, which depend on your
 								authorization type.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								### Log in with Google (Gemini Code Assist for individuals)
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								For users who authenticate by using their Google account to access Gemini Code
 								Assist for individuals. This includes:
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
 								- 1000 model requests / user / day
 								- 60 model requests / user / minute
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								- Model requests will be made across the Gemini model family as determined by
 								  Gemini CLI.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Learn more at
 								[Gemini Code Assist for Individuals Limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								### Log in with Gemini API Key (unpaid)
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								If you are using a Gemini API key, you can also benefit from a free tier. This
 								includes:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- 250 model requests / user / day
 								- 10 model requests / user / minute
 								- Model requests to Flash model only.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Learn more at
 								[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								### Log in with Vertex AI (Express Mode)
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Vertex AI offers an Express Mode without the need to enable billing. This
 								includes:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								- 90 days before you need to enable billing.
 								- Quotas and models are variable and specific to your account.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Learn more at
 								[Vertex AI Express Mode Limits](https://cloud.google.com/vertex-ai/generative-ai/docs/start/express-mode/overview#quotas).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								## Paid tier: Higher limits for a fixed cost
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								If you use up your initial number of requests, you can continue to benefit from
 								Gemini CLI by upgrading to one of the following subscriptions:
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota and pricing documentation with subscription tiers (#21351)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Jenna Inouye <jinouye@google.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2026-03-06 11:46:03 -08:00
+								### Individuals
 								These tiers apply when you sign in with a personal account. To verify whether
 								you're on a personal account, visit
 								[Google One](https://one.google.com/about/plans?hl=en-US&g1_landing_page=0):
 								- If you are on a personal account, you will see your personal dashboard.
 								- If you are not on a personal account, you will see: "You're currently signed
 								  in to your Google Workspace Account."
 								**Supported tiers:** _- Tiers not listed above, including Google AI Plus, are
 								not supported._
-												docs: Update 4 files (#13628)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
											
										
										
											2025-12-01 15:55:25 -08:00
+								- [Google AI Pro and AI Ultra](https://gemini.google/subscriptions/). This is
 								  recommended for individual developers. Quotas and pricing are based on a fixed
 								  price subscription.
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
 								  For predictable costs, you can log in with Google.
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								  Learn more at
 								  [Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
-												Update quota and pricing documentation with subscription tiers (#21351)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Jenna Inouye <jinouye@google.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2026-03-06 11:46:03 -08:00
+								### Through your organization
 								These tiers are applicable when you are signing in with a Google Workspace
 								account.
 								- To verify your account type, visit
 								  [the Google One page](https://one.google.com/about/plans?hl=en-US&g1_landing_page=0).
 								- You are on a workspace account if you see the message "You're currently signed
 								  in to your Google Workspace Account".
 								**Supported tiers:** _- Tiers not listed above, including Workspace AI
 								Standard/Plus and AI Expanded, are not supported._
 								- [Workspace AI Ultra Access](https://workspace.google.com/products/ai-ultra/).
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								- [Purchase a Gemini Code Assist Subscription through Google Cloud](https://cloud.google.com/gemini/docs/codeassist/overview).
-												docs: Update 4 files (#13628)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
											
										
										
											2025-12-01 15:55:25 -08:00
-												fixing minor formatting issues in quota-and-pricing.md (#11340)


											
										
										
											2025-10-30 17:46:23 -04:00
+								  Quotas and pricing are based on a fixed price subscription with assigned
 								  license seats. For predictable costs, you can sign in with Google.
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								  This includes the following request limits:
-												Document support for Google AI Pro and AI Ultra  (#9426)

Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2025-09-24 10:54:48 -07:00
+								  - Gemini Code Assist Standard edition:
 								    - 1500 model requests / user / day
 								    - 120 model requests / user / minute
 								  - Gemini Code Assist Enterprise edition:
 								    - 2000 model requests / user / day
 								    - 120 model requests / user / minute
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								  - Model requests will be made across the Gemini model family as determined by
 								    Gemini CLI.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota and pricing documentation with subscription tiers (#21351)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Jenna Inouye <jinouye@google.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
											
										
										
											2026-03-06 11:46:03 -08:00
+								  [Learn more about Gemini Code Assist license limits](https://developers.google.com/gemini-code-assist/resources/quotas#quotas-for-agent-mode-gemini-cli).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								## Pay as you go
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								If you hit your daily request limits or exhaust your Gemini Pro quota even after
 								upgrading, the most flexible solution is to switch to a pay-as-you-go model,
 								where you pay for the specific amount of processing you use. This is the
 								recommended path for uninterrupted access.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
+								To do this, log in using a Gemini API key or Vertex AI.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								### Vertex AI (regular mode)
 								An enterprise-grade platform for building, deploying, and managing AI models,
 								including Gemini. It offers enhanced security, data governance, and integration
 								with other Google Cloud services.
 								- Quota: Governed by a dynamic shared quota system or pre-purchased provisioned
 								  throughput.
 								- Cost: Based on model and token usage.
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Learn more at
 								[Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota)
 								and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								### Gemini API key
 								Ideal for developers who want to quickly build applications with the Gemini
 								models. This is the most direct way to use the models.
 								- Quota: Varies by pricing tier.
 								- Cost: Varies by pricing tier and model/token usage.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								Learn more at
 								[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits),
 								[Gemini API Pricing](https://ai.google.dev/gemini-api/docs/pricing)
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								It’s important to highlight that when using an API key, you pay per token/call.
 								This can be more expensive for many small calls with few tokens, but it's the
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								only way to ensure your workflow isn't interrupted by reaching a limit on your
 								quota.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								## Gemini for workspace plans
-												Docs: Add a page detailing quota and cost information (#2894)

Co-authored-by: Jenna Inouye <jinouye@google.com>
											
										
										
											2025-07-01 15:28:15 -07:00
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								These plans currently apply only to the use of Gemini web-based products
 								provided by Google-based experiences (for example, the Gemini web app or the
 								Flow video editor). These plans do not apply to the API usage which powers the
 								Gemini CLI. Supporting these plans is under active consideration for future
 								support.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								## Check usage and limits
-												Updates command reference and /stats command. (#19794)


											
										
										
											2026-02-23 09:13:24 -08:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								You can check your current token usage and applicable limits using the
-												Updates command reference and /stats command. (#19794)


											
										
										
											2026-02-23 09:13:24 -08:00
+								`/stats model` command. This command provides a snapshot of your current
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								session's token usage, as well as information about the limits associated with
 								your current quota.
-												Updates command reference and /stats command. (#19794)


											
										
										
											2026-02-23 09:13:24 -08:00
 								For more information on the `/stats` command and its subcommands, see the
-												docs: fix incorrect relative links to command reference (#20964)
											
										
										
											2026-03-06 12:27:25 +09:00
+								[Command Reference](../reference/commands.md#stats).
-												Updates command reference and /stats command. (#19794)


											
										
										
											2026-02-23 09:13:24 -08:00
 								A summary of model usage is also presented on exit at the end of a session.
-												Updated ToC on docs intro; updated title casing to match Google style (#13717)


											
										
										
											2025-12-01 11:38:48 -08:00
+								## Tips to avoid high costs
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								When using a pay-as-you-go plan, be mindful of your usage to avoid unexpected
-												cleanup(markdown): Prettier format all markdown @ 80 char width (#10714)


											
										
										
											2025-10-09 08:17:37 -04:00
+								costs.
-												Update quota-and-pricing.md to clarify billing (#6092)

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Srinath Padmanabhan <srithreepo@google.com>
Co-authored-by: Srinath Padmanabhan <17151014+srithreepo@users.noreply.github.com>
											
										
										
											2025-08-15 15:00:13 -04:00
-												DOCS: Update quota and pricing page (#21194)


											
										
										
											2026-03-05 10:09:14 -08:00
+								- **Be selective with suggestions**: Before accepting a suggestion, especially
 								  for a computationally intensive task like refactoring a large codebase,
 								  consider if it's the most cost-effective approach.
 								- **Use precise prompts**: You are paying per call, so think about the most
 								  efficient way to get your desired result. A well-crafted prompt can often get
 								  you the answer you need in a single call, rather than multiple back-and-forth
 								  interactions.
 								- **Monitor your usage**: Use the `/stats model` command to track your token
 								  usage during a session. This can help you stay aware of your spending in real
 								  time.