mirror of
https://github.com/google-gemini/gemini-cli.git
synced 2026-07-04 23:27:00 -07:00
1620c7d82f
Implement the visual agent using the LocalAgentDefinition pattern: - VisualAgentDefinition: Agent metadata for coordinate-based visual tasks - delegateToVisualAgent.ts: Tool for semantic agent to delegate visual tasks - Uses gemini-2.5-computer-use-preview-10-2025 model for Computer Use capability The visual agent handles tasks requiring visual identification or precise coordinate-based actions that cannot be done via the accessibility tree.