Implement bot that performs time-series metric analysis and suggests repo management improvements (#25945)

2026-04-29 14:34:55 -07:00 · 2026-04-28 16:49:53 +00:00
parent 54b7586106
commit 58a57b72ae
15 changed files with 907 additions and 54 deletions
@@ -10,42 +10,67 @@ long-term strategic optimization.

 ### 1. System 1: The Pulse (Reflex Layer)

- **Purpose**: High-frequency, deterministic maintenance and data collection.
+- **Purpose**: High-frequency, deterministic maintenance.
 - **Frequency**: 30-minute cron (`.github/workflows/gemini-cli-bot-pulse.yml`).
 - **Implementation**: Pure TypeScript/JavaScript scripts.
- **Role**: Currently focuses on gathering repository metrics
-  (`tools/gemini-cli-bot/metrics/scripts`).
- **Output**: Action execution and `metrics-before.csv` artifact generation.
+- **Classification**: Optionally utilizes Gemini CLI for high-confidence
+  semantic classification (e.g., triage, labeling, sentiment) while preferring
+  deterministic logic for equivalent tasks.
+- **Phases**:
+  - **Reflex Execution**: Runs triage, routing, and automated maintenance
+    scripts in `reflexes/scripts/`.
+- **Output**: Real-time action execution.

 ### 2. System 2: The Brain (Reasoning Layer)

- **Purpose**: Strategic investigation, policy refinement, and
+- **Purpose**: Strategic investigation, policy refinement, and proactive
  self-optimization.
 - **Frequency**: 24-hour cron (`.github/workflows/gemini-cli-bot-brain.yml`).
 - **Implementation**: Agentic Gemini CLI phases.
- **Role**: Analyzing metric trends and running deeper repository health
-  investigations.
+- **Phases**:
+  - **Metrics Collection**: Executes scripts in `metrics/scripts/` to track
+    repository health (Open issues, PR latency, throughput, etc.).
+  - **Phase 1: Reasoning (Metrics & Root-Cause Analysis)**: Analyzes time-series
+    metric trends and repository state to identify bottlenecks or productivity
+    gaps, tests hypotheses, and proposes script or configuration changes to
+    improve repository health and maintainability.
+  - **Phase 2: Critique**: A technical and logical validation layer that reviews
+    proposed changes for robustness, actor-awareness, and anti-spam protocols.
+  - **Phase 3: Publish**: Automatically promotes approved changes to Pull
+    Requests, handles branch management, and responds to maintainer feedback.

 ## Directory Structure

- `metrics/`: Contains the deterministic runner (`index.ts`) and individual
-  TypeScript scripts (`scripts/`) that use the GitHub CLI to track metrics like
-  open issues, PR latency, throughput, and reviewer domain expertise.
- `processes/scripts/`: Placeholder directory for future deterministic triage
-  and routing scripts.
- `investigations/`: Placeholder directory for agentic root-cause analysis
-  phases.
- `critique/`: Placeholder directory for policy evaluation.
- `history/`: Storage for downloaded metrics artifacts from previous runs.
+- `metrics/`: Deterministic runner (`index.ts`) and scripts for tracking
+  repository metrics via GitHub CLI.
+- `reflexes/scripts/`: Deterministic triage and routing scripts executed by the
+  Pulse.
+- `brain/`: Prompt templates and logic for strategic root-cause analysis (Phase
+  1: `metrics.md`) and technical validation (Phase 2: `critique.md`).
+- `history/`: Persistent storage for time-series metrics artifacts.
+- `lessons-learned.md`: The bot's structured memory, containing the Task Ledger,
+  Hypothesis Ledger, and Decision Log.

 ## Usage

+### Local Metrics Collection
+
 To manually collect repository metrics locally, run the following command from
 the workspace root:

 ```bash
-npm run metrics
+npx tsx tools/gemini-cli-bot/metrics/index.ts
 ```

 This will execute all scripts within `metrics/scripts/` and output the results
-to a `metrics-before.csv` file in the root directory.
+to `tools/gemini-cli-bot/history/metrics-before.csv`.
+
+### Development
+
+When modifying the bot's logic:
+
+1. **Reflexes**: Add or update scripts in `reflexes/scripts/`.
+2. **Reasoning**: Update the prompts in `brain/` to refine how the bot
+   identifies bottlenecks.
+3. **Critique**: Update the prompts in `critique/` to strengthen the validation
+   of proposed changes.