13 KiB
Gemini CLI Companion Extension: Interface Specification
Last Updated: September 15, 2025
This document defines the contract for building a companion extension to enable Gemini CLI's IDE mode. For VS Code, these features (native diffing, context awareness) are provided by the official extension (marketplace). This specification is for contributors who wish to bring similar functionality to other editors like JetBrains IDEs, Sublime Text, etc.
I. The Communication Interface
Gemini CLI and the IDE extension communicate through a local communication channel.
1. Transport Layer: MCP over HTTP
The extension MUST run a local HTTP server that implements the Model Context Protocol (MCP).
- Protocol: The server must be a valid MCP server. We recommend using an existing MCP SDK for your language of choice if available.
- Endpoint: The server should expose a single endpoint (e.g.,
/mcp) for all MCP communication. - Port: The server MUST listen on a dynamically assigned port (i.e., listen on port
0).
2. Discovery Mechanism: The Port File
For Gemini CLI to connect, it needs to discover which IDE instance it's running in and what port your server is using. The extension MUST facilitate this by creating a "discovery file."
- How the CLI Finds the File: The CLI determines the Process ID (PID) of the IDE it's running in by traversing the process tree. It then looks for a discovery file that contains this PID in its name.
- File Location: The file must be created in a specific directory:
os.tmpdir()/gemini/ide/. Your extension must create this directory if it doesn't exist. - File Naming Convention: The filename is critical and MUST follow the pattern:
gemini-ide-server-${PID}-${PORT}.json${PID}: The process ID of the parent IDE process. Your extension must determine this PID and include it in the filename.${PORT}: The port your MCP server is listening on.
- File Content & Workspace Validation: The file MUST contain a JSON object with the following structure:
{ "port": 12345, "workspacePath": "/path/to/project1:/path/to/project2" }port(number): The port of the MCP server.workspacePath(string): A list of all open workspace root paths, delimited by the OS-specific path separator (:for Linux/macOS,;for Windows). The CLI uses this path to ensure it's running in the same project folder that's open in the IDE. If the CLI's current working directory is not a sub-directory ofworkspacePath, the connection will be rejected. Your extension MUST provide the correct, absolute path(s) to the root of the open workspace(s).
- Tie-Breaking with Environment Variables (Recommended): For the most reliable experience, your extension SHOULD both create the discovery file and set the
GEMINI_CLI_IDE_SERVER_PORTandGEMINI_CLI_IDE_WORKSPACE_PATHenvironment variables in the integrated terminal. The file serves as the primary discovery mechanism, but the environment variables are crucial for tie-breaking. If a user has multiple IDE windows open for the same workspace, the CLI uses theGEMINI_CLI_IDE_SERVER_PORTvariable to identify and connect to the correct window's server.- For prototyping, you may opt to only set the environment variables. However, this is not a robust solution for a production extension, as environment variables may not be reliably set in all terminal sessions (e.g., restored terminals), which can lead to connection failures.
- Authentication: To secure the connection, the extension SHOULD generate a unique, secret token and include it in the discovery file. The CLI will then include this token in all requests to the MCP server.
- Token Generation: The extension should generate a random string to be used as a bearer token.
- Discovery File Content: The
authTokenfield must be added to the JSON object in the discovery file:{ "port": 12345, "workspacePath": "/path/to/project", "authToken": "a-very-secret-token" } - Request Authorization: The CLI will read the
authTokenfrom the file and include it in theAuthorizationheader for all HTTP requests to the MCP server (e.g.,Authorization: Bearer a-very-secret-token). Your server MUST validate this token on every request and reject any that are unauthorized.
II. The Context Interface
To enable context awareness, the extension MAY provide the CLI with real-time information about the user's activity in the IDE.
ide/contextUpdate Notification
The extension MAY send an ide/contextUpdate notification to the CLI whenever the user's context changes.
-
Triggering Events: This notification should be sent (with a recommended debounce of 50ms) when:
- A file is opened, closed, or focused.
- The user's cursor position or text selection changes in the active file.
-
Payload (
IdeContext): The notification parameters MUST be anIdeContextobject:interface IdeContext { workspaceState?: { openFiles?: File[]; isTrusted?: boolean; }; } interface File { // Absolute path to the file path: string; // Last focused Unix timestamp (for ordering) timestamp: number; // True if this is the currently focused file isActive?: boolean; cursor?: { // 1-based line number line: number; // 1-based character number character: number; }; // The text currently selected by the user selectedText?: string; }Note: The
openFileslist should only include files that exist on disk. Virtual files (e.g., unsaved files without a path, editor settings pages) MUST be excluded.
How the CLI Uses This Context
After receiving the IdeContext object, the CLI performs several normalization and truncation steps before sending the information to the model.
- File Ordering: The CLI uses the
timestampfield to determine the most recently used files. It sorts theopenFileslist based on this value. Therefore, your extension MUST provide an accurate Unix timestamp for when a file was last focused. - Active File: The CLI considers only the most recent file (after sorting) to be the "active" file. It will ignore the
isActiveflag on all other files and clear theircursorandselectedTextfields. Your extension should focus on settingisActive: trueand providing cursor/selection details only for the currently focused file. - Truncation: To manage token limits, the CLI truncates both the file list (to 10 files) and the
selectedText(to 16KB).
While the CLI handles the final truncation, it is highly recommended that your extension also limits the amount of context it sends.
III. The Diffing Interface
To enable interactive code modifications, the extension MAY expose a diffing interface. This allows the CLI to request that the IDE open a diff view, showing proposed changes to a file. The user can then review, edit, and ultimately accept or reject these changes directly within the IDE.
openDiff Tool
The extension MUST register an openDiff tool on its MCP server.
-
Description: This tool instructs the IDE to open a modifiable diff view for a specific file.
-
Request (
OpenDiffRequest): The tool is invoked via atools/callrequest. Theargumentsfield within the request'sparamsMUST be anOpenDiffRequestobject.interface OpenDiffRequest { // The absolute path to the file to be diffed. filePath: string; // The proposed new content for the file. newContent: string; } -
Response (
CallToolResult): The tool MUST immediately return aCallToolResultto acknowledge the request and report whether the diff view was successfully opened.- On Success: If the diff view was opened successfully, the response MUST contain empty content (i.e.,
content: []). - On Failure: If an error prevented the diff view from opening, the response MUST have
isError: trueand include aTextContentblock in thecontentarray describing the error.
The actual outcome of the diff (acceptance or rejection) is communicated asynchronously via notifications.
- On Success: If the diff view was opened successfully, the response MUST contain empty content (i.e.,
closeDiff Tool
The extension MUST register a closeDiff tool on its MCP server.
-
Description: This tool instructs the IDE to close an open diff view for a specific file.
-
Request (
CloseDiffRequest): The tool is invoked via atools/callrequest. Theargumentsfield within the request'sparamsMUST be anCloseDiffRequestobject.interface CloseDiffRequest { // The absolute path to the file whose diff view should be closed. filePath: string; } -
Response (
CallToolResult): The tool MUST return aCallToolResult.- On Success: If the diff view was closed successfully, the response MUST include a single TextContent block in the content array containing the file's final content before closing.
- On Failure: If an error prevented the diff view from closing, the response MUST have
isError: trueand include aTextContentblock in thecontentarray describing the error.
ide/diffAccepted Notification
When the user accepts the changes in a diff view (e.g., by clicking an "Apply" or "Save" button), the extension MUST send an ide/diffAccepted notification to the CLI.
-
Payload: The notification parameters MUST include the file path and the final content of the file. The content may differ from the original
newContentif the user made manual edits in the diff view.{ // The absolute path to the file that was diffed. filePath: string; // The full content of the file after acceptance. content: string; }
ide/diffRejected Notification
When the user rejects the changes (e.g., by closing the diff view without accepting), the extension MUST send an ide/diffRejected notification to the CLI.
-
Payload: The notification parameters MUST include the file path of the rejected diff.
{ // The absolute path to the file that was diffed. filePath: string; }
IV. Supporting Additional IDEs
To add support for a new IDE, two main components in the Gemini CLI codebase need to be updated: the detection logic and the installer logic.
1. IDE Detection (@packages/core/src/ide/detect-ide.ts)
// TODO(skeshive): Determine whether we should discover the IDE via the port file
The CLI must be able to identify when it is running inside a specific IDE's integrated terminal. This is primarily done by checking for unique environment variables. As a fallback, it can also inspect process information (like the command name) to help distinguish between IDEs if a unique environment variable is not available.
- Add to
DetectedIdeEnum: First, add your new IDE to theDetectedIdeenum. - Update
detectIdeFromEnv: Add a check in this function for an environment variable specific to your IDE (e.g.,if (process.env['MY_IDE_VAR']) { return DetectedIde.MyIde; }). - Update
detectIde(Optional): If your IDE lacks a unique environment variable, you can add logic to thedetectIdefunction to inspectideProcessInfo(e.g.,ideProcessInfo.command) as a secondary detection mechanism.
2. Extension Installation (@packages/core/src/ide/ide-installer.ts)
The CLI provides a command (/ide install) to help users automatically install the companion extension. While optional, implementing an IdeInstaller for your IDE is highly recommended to provide a seamless setup experience.
- Create an Installer Class: Create a new class that implements the
IdeInstallerinterface. - Implement
install(): Theinstallmethod should:- Locate the IDE's command-line executable. The
VsCodeInstallerprovides a good example of searching common installation paths for different operating systems. - Execute the command to install the extension by its marketplace ID (e.g.,
"path/to/my-ide-cli" --install-extension my-publisher.my-extension-id). - Return a result object indicating success or failure.
- Locate the IDE's command-line executable. The
- Update
getIdeInstaller: Add a case to theswitchstatement in this factory function to return an instance of your new installer class when yourDetectedIdeenum is matched.
V. The Lifecycle Interface
The extension MUST manage its resources and the discovery file correctly based on the IDE's lifecycle.
- On Activation (IDE startup/extension enabled):
- Start the MCP server.
- Create the discovery file.
- On Deactivation (IDE shutdown/extension disabled):
- Stop the MCP server.
- Delete the discovery file.