mirror of
https://github.com/google-gemini/gemini-cli.git
synced 2026-04-25 04:24:51 -07:00
DOCS: Update quota and pricing page (#21194)
This commit is contained in:
@@ -4,6 +4,10 @@ To use Gemini CLI, you'll need to authenticate with Google. This guide helps you
|
|||||||
quickly find the best way to sign in based on your account type and how you're
|
quickly find the best way to sign in based on your account type and how you're
|
||||||
using the CLI.
|
using the CLI.
|
||||||
|
|
||||||
|
> **Note:** Looking for a high-level comparison of all available subscriptions?
|
||||||
|
> To compare features and find the right quota for your needs, see our
|
||||||
|
> [Plans page](/plans/).
|
||||||
|
|
||||||
For most users, we recommend starting Gemini CLI and logging in with your
|
For most users, we recommend starting Gemini CLI and logging in with your
|
||||||
personal Google account.
|
personal Google account.
|
||||||
|
|
||||||
|
|||||||
@@ -39,6 +39,10 @@ When you encounter that limit, you’ll be given the option to switch to Gemini
|
|||||||
2.5 Pro, upgrade for higher limits, or stop. You’ll also be told when your usage
|
2.5 Pro, upgrade for higher limits, or stop. You’ll also be told when your usage
|
||||||
limit resets and Gemini 3 Pro can be used again.
|
limit resets and Gemini 3 Pro can be used again.
|
||||||
|
|
||||||
|
> **Note:** Looking to upgrade for higher limits? To compare subscription
|
||||||
|
> options and find the right quota for your needs, see our
|
||||||
|
> [Plans page](/plans/).
|
||||||
|
|
||||||
Similarly, when you reach your daily usage limit for Gemini 2.5 Pro, you’ll see
|
Similarly, when you reach your daily usage limit for Gemini 2.5 Pro, you’ll see
|
||||||
a message prompting fallback to Gemini 2.5 Flash.
|
a message prompting fallback to Gemini 2.5 Flash.
|
||||||
|
|
||||||
|
|||||||
@@ -1,14 +1,13 @@
|
|||||||
# Gemini CLI: Quotas and pricing
|
# Gemini CLI: Quotas and pricing
|
||||||
|
|
||||||
Gemini CLI offers a generous free tier that covers many individual developers'
|
Gemini CLI offers a generous free tier that covers many individual developers'
|
||||||
use cases. For enterprise or professional usage, or if you need higher limits,
|
use cases. For enterprise or professional usage, or if you need increased quota,
|
||||||
several options are available depending on your authentication account type.
|
several options are available depending on your authentication account type.
|
||||||
|
|
||||||
See [privacy and terms](./tos-privacy.md) for details on the Privacy Policy and
|
For a high-level comparison of available subscriptions and to select the right
|
||||||
Terms of Service.
|
quota for your needs, see the [Plans page](/plans/).
|
||||||
|
|
||||||
> **Note:** Published prices are list price; additional negotiated commercial
|
## Overview
|
||||||
> discounting may apply.
|
|
||||||
|
|
||||||
This article outlines the specific quotas and pricing applicable to Gemini CLI
|
This article outlines the specific quotas and pricing applicable to Gemini CLI
|
||||||
when using different authentication methods.
|
when using different authentication methods.
|
||||||
@@ -23,10 +22,11 @@ Generally, there are three categories to choose from:
|
|||||||
|
|
||||||
## Free usage
|
## Free usage
|
||||||
|
|
||||||
Your journey begins with a generous free tier, perfect for experimentation and
|
Access to Gemini CLI begins with a generous free tier, perfect for
|
||||||
light use.
|
experimentation and light use.
|
||||||
|
|
||||||
Your free usage limits depend on your authorization type.
|
Your free usage is governed by the following limits, which depend on your
|
||||||
|
authorization type.
|
||||||
|
|
||||||
### Log in with Google (Gemini Code Assist for individuals)
|
### Log in with Google (Gemini Code Assist for individuals)
|
||||||
|
|
||||||
@@ -78,14 +78,12 @@ Gemini CLI by upgrading to one of the following subscriptions:
|
|||||||
Learn more at
|
Learn more at
|
||||||
[Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)
|
[Gemini Code Assist Quotas and Limits](https://developers.google.com/gemini-code-assist/resources/quotas)
|
||||||
|
|
||||||
- [Purchase a Gemini Code Assist Subscription through Google Cloud ](https://cloud.google.com/gemini/docs/codeassist/overview)
|
- [Purchase a Gemini Code Assist Subscription through Google Cloud](https://cloud.google.com/gemini/docs/codeassist/overview).
|
||||||
by signing up in the Google Cloud console. Learn more at
|
|
||||||
[Set up Gemini Code Assist](https://cloud.google.com/gemini/docs/discover/set-up-gemini).
|
|
||||||
|
|
||||||
Quotas and pricing are based on a fixed price subscription with assigned
|
Quotas and pricing are based on a fixed price subscription with assigned
|
||||||
license seats. For predictable costs, you can sign in with Google.
|
license seats. For predictable costs, you can sign in with Google.
|
||||||
|
|
||||||
This includes:
|
This includes the following request limits:
|
||||||
- Gemini Code Assist Standard edition:
|
- Gemini Code Assist Standard edition:
|
||||||
- 1500 model requests / user / day
|
- 1500 model requests / user / day
|
||||||
- 120 model requests / user / minute
|
- 120 model requests / user / minute
|
||||||
@@ -106,18 +104,27 @@ recommended path for uninterrupted access.
|
|||||||
|
|
||||||
To do this, log in using a Gemini API key or Vertex AI.
|
To do this, log in using a Gemini API key or Vertex AI.
|
||||||
|
|
||||||
- Vertex AI (Regular Mode):
|
### Vertex AI (regular mode)
|
||||||
- Quota: Governed by a dynamic shared quota system or pre-purchased
|
|
||||||
provisioned throughput.
|
An enterprise-grade platform for building, deploying, and managing AI models,
|
||||||
- Cost: Based on model and token usage.
|
including Gemini. It offers enhanced security, data governance, and integration
|
||||||
|
with other Google Cloud services.
|
||||||
|
|
||||||
|
- Quota: Governed by a dynamic shared quota system or pre-purchased provisioned
|
||||||
|
throughput.
|
||||||
|
- Cost: Based on model and token usage.
|
||||||
|
|
||||||
Learn more at
|
Learn more at
|
||||||
[Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota)
|
[Vertex AI Dynamic Shared Quota](https://cloud.google.com/vertex-ai/generative-ai/docs/resources/dynamic-shared-quota)
|
||||||
and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
|
and [Vertex AI Pricing](https://cloud.google.com/vertex-ai/pricing).
|
||||||
|
|
||||||
- Gemini API key:
|
### Gemini API key
|
||||||
- Quota: Varies by pricing tier.
|
|
||||||
- Cost: Varies by pricing tier and model/token usage.
|
Ideal for developers who want to quickly build applications with the Gemini
|
||||||
|
models. This is the most direct way to use the models.
|
||||||
|
|
||||||
|
- Quota: Varies by pricing tier.
|
||||||
|
- Cost: Varies by pricing tier and model/token usage.
|
||||||
|
|
||||||
Learn more at
|
Learn more at
|
||||||
[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits),
|
[Gemini API Rate Limits](https://ai.google.dev/gemini-api/docs/rate-limits),
|
||||||
@@ -125,7 +132,8 @@ Learn more at
|
|||||||
|
|
||||||
It’s important to highlight that when using an API key, you pay per token/call.
|
It’s important to highlight that when using an API key, you pay per token/call.
|
||||||
This can be more expensive for many small calls with few tokens, but it's the
|
This can be more expensive for many small calls with few tokens, but it's the
|
||||||
only way to ensure your workflow isn't interrupted by quota limits.
|
only way to ensure your workflow isn't interrupted by reaching a limit on your
|
||||||
|
quota.
|
||||||
|
|
||||||
## Gemini for workspace plans
|
## Gemini for workspace plans
|
||||||
|
|
||||||
@@ -135,12 +143,12 @@ Flow video editor). These plans do not apply to the API usage which powers the
|
|||||||
Gemini CLI. Supporting these plans is under active consideration for future
|
Gemini CLI. Supporting these plans is under active consideration for future
|
||||||
support.
|
support.
|
||||||
|
|
||||||
## Check usage and quota
|
## Check usage and limits
|
||||||
|
|
||||||
You can check your current token usage and quota information using the
|
You can check your current token usage and applicable limits using the
|
||||||
`/stats model` command. This command provides a snapshot of your current
|
`/stats model` command. This command provides a snapshot of your current
|
||||||
session's token usage, as well as your overall quota and usage for the supported
|
session's token usage, as well as information about the limits associated with
|
||||||
models.
|
your current quota.
|
||||||
|
|
||||||
For more information on the `/stats` command and its subcommands, see the
|
For more information on the `/stats` command and its subcommands, see the
|
||||||
[Command Reference](../../reference/commands.md#stats).
|
[Command Reference](../../reference/commands.md#stats).
|
||||||
@@ -149,17 +157,16 @@ A summary of model usage is also presented on exit at the end of a session.
|
|||||||
|
|
||||||
## Tips to avoid high costs
|
## Tips to avoid high costs
|
||||||
|
|
||||||
When using a Pay as you Go API key, be mindful of your usage to avoid unexpected
|
When using a pay-as-you-go plan, be mindful of your usage to avoid unexpected
|
||||||
costs.
|
costs.
|
||||||
|
|
||||||
- Don't blindly accept every suggestion, especially for computationally
|
- **Be selective with suggestions**: Before accepting a suggestion, especially
|
||||||
intensive tasks like refactoring large codebases.
|
for a computationally intensive task like refactoring a large codebase,
|
||||||
- Be intentional with your prompts and commands. You are paying per call, so
|
consider if it's the most cost-effective approach.
|
||||||
think about the most efficient way to get the job done.
|
- **Use precise prompts**: You are paying per call, so think about the most
|
||||||
|
efficient way to get your desired result. A well-crafted prompt can often get
|
||||||
## Gemini API vs. Vertex
|
you the answer you need in a single call, rather than multiple back-and-forth
|
||||||
|
interactions.
|
||||||
- Gemini API (gemini developer api): This is the fastest way to use the Gemini
|
- **Monitor your usage**: Use the `/stats model` command to track your token
|
||||||
models directly.
|
usage during a session. This can help you stay aware of your spending in real
|
||||||
- Vertex AI: This is the enterprise-grade platform for building, deploying, and
|
time.
|
||||||
managing Gemini models with specific security and control requirements.
|
|
||||||
|
|||||||
Reference in New Issue
Block a user