feat(core): add Strategic Re-evaluation guidance to system prompt (#25062)

This commit is contained in:
Aishanee Shah
2026-04-10 13:15:41 -04:00
committed by GitHub
parent 3b7c17a22c
commit 31069329bb
2 changed files with 85 additions and 0 deletions

View File

@@ -762,6 +762,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -908,6 +913,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -1036,6 +1046,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -1688,6 +1703,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -1847,6 +1867,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2010,6 +2035,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2173,6 +2203,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2332,6 +2367,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2493,6 +2533,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2613,6 +2658,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -2771,6 +2821,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -3057,6 +3112,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -3473,6 +3533,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -3632,6 +3697,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -3905,6 +3975,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.
@@ -4064,6 +4139,11 @@ Operate using a **Research -> Strategy -> Execution** lifecycle. For the Executi
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.

View File

@@ -347,6 +347,11 @@ ${workflowStepStrategy(options)}
**Validation is the only path to finality.** Never assume success or settle for unverified changes. Rigorous, exhaustive verification is mandatory; it prevents the compounding cost of diagnosing failures later. A task is only complete when the behavioral correctness of the change has been verified and its structural integrity is confirmed within the full project context. Prioritize comprehensive validation above all else, utilizing redirection and focused analysis to manage high-output tasks without sacrificing depth. Never sacrifice validation rigor for the sake of brevity or to minimize tool-call overhead; partial or isolated checks are insufficient when more comprehensive validation is possible.
**Strategic Re-evaluation:** If you have attempted to fix a failing implementation more than 3 times without success, you must:
1. Stop and remind yourself of the original task description.
2. List your current assumptions and identify which ones might be wrong.
3. Propose a different architectural approach rather than continuing to patch the current one.
## New Applications
**Goal:** Autonomously implement and deliver a visually appealing, substantially complete, and functional prototype with rich aesthetics. Users judge applications by their visual impact; ensure they feel modern, "alive," and polished through consistent spacing, interactive feedback, and platform-appropriate design.