|
1 | 1 | # 🚫 STOP — READ THIS FIRST — NON-NEGOTIABLE — NO EXCEPTIONS |
2 | 2 |
|
| 3 | +--- |
| 4 | + |
| 5 | +## WHAT IS THIS PROJECT |
| 6 | + |
| 7 | +**guIDE** is a local-first, offline-capable AI IDE — the quality target is Visual Studio Code + GitHub Copilot, but running entirely locally with no cloud dependency. It ships to real end users on all hardware configurations from 4GB GPU laptops to 128GB RAM workstations, running 0.5B to 200B parameter models. |
| 8 | + |
| 9 | +Every change to this codebase must be: |
| 10 | +- **Production-grade** — VSCode/Copilot level of polish. No band-aids, no test-specific workarounds, no partial implementations. |
| 11 | +- **General** — works for ALL users, ALL models, ALL hardware. Not just the dev machine. |
| 12 | +- **Complete** — both `main/` AND `pipeline-clone/main/` updated. 0% done until both are updated. |
| 13 | +- **Verified** — do NOT say a fix works without being able to cite specific code evidence. |
| 14 | + |
| 15 | +--- |
| 16 | + |
| 17 | +## TOP-LEVEL RULES — EVERY AGENT MUST READ ALL OF THESE BEFORE RESPONDING |
| 18 | + |
| 19 | +> These are the most commonly violated rules. Read every bullet. No exceptions. |
| 20 | +
|
| 21 | +1. **END WITH MULTI-CHOICE TOOL** — NEVER end a response with text. ALWAYS use the `vscode_askQuestions` tool at the end. "Let me know if you need anything" is BANNED. See RULE -1 below. |
| 22 | +2. **TRIPWIRE** — First line of EVERY response: `[Task: X | Last: Y]` |
| 23 | +3. **READ BEFORE WRITING** — Read every relevant file in full before implementing any change. No partial reads. No "I already know this code." Read it anyway. |
| 24 | +4. **PLAN FIRST, CODE SECOND** — Present a plan, wait for USER approval, THEN implement. Never write code in the same response as the plan. |
| 25 | +5. **NO BANNED WORDS** — Never say: confirmed, fixed, resolves, fully fixed, ready, working, all set. No emojis (especially no checkmarks). See BANNED WORDS section. |
| 26 | +6. **BOTH TREES ALWAYS** — Every code change goes to BOTH `main/` AND `pipeline-clone/main/`. No exceptions. |
| 27 | +7. **CHANGES_LOG.md ALWAYS** — Every code change logged in `pipeline-clone/CHANGES_LOG.md`. No exceptions. |
| 28 | +8. **NO CHEERLEADING** — Do NOT say "great results", "strong performance", "improvement", "looking good", or any positive framing. Report defects. That is the job. |
| 29 | +9. **NEVER DISMISS USER OBSERVATIONS** — What the user observes is FACT. If your analysis contradicts it, YOUR ANALYSIS IS WRONG. Go back and read more code. |
| 30 | +10. **NO BAND-AID FIXES** — Every fix addresses the root architectural cause. Surface-level patches, guard clauses, and timeouts masking a deeper issue are NOT fixes. |
| 31 | +11. **NEVER BLAME MODEL SIZE OR CONTEXT WINDOW** — If it fails, the pipeline is broken. Fix the pipeline. See dedicated sections below. |
| 32 | +12. **PRODUCTION SOFTWARE** — Every fix must work for 4GB GPU users AND 128GB workstation users. Hardware-specific fixes are bugs. |
| 33 | +13. **READ CHANGES_LOG.md FIRST** — Before proposing ANY fix, read `pipeline-clone/CHANGES_LOG.md`. Context resets every session. The log is the anchor. |
| 34 | +14. **READ 323_DEFECTS.md** — The definitive defect list is at `C:\Users\brend\IDE\323_DEFECTS.md`. Read it before proposing any fix to the guIDE pipeline. |
| 35 | +15. **ACKNOWLEDGE EVERY POINT** — If user makes 7 points, respond to all 7. Skipping one is the same as ignoring a direct instruction. |
| 36 | +16. **SELF-ACCOUNTABILITY CHECKPOINT** — After EVERY todo item, run through the mandatory checkpoint before marking it done. |
| 37 | +17. **PRE-CODE CHECKLIST** — Required before every code change. See mandatory section below. |
| 38 | +18. **POST-CODE VERIFICATION** — Required after every code change. See mandatory section below. |
| 39 | + |
| 40 | +--- |
| 41 | + |
3 | 42 | ## 🔴 RULE -1 — NEVER END INTERACTION WITHOUT MULTI-CHOICE QUESTION — ABSOLUTE |
4 | 43 |
|
5 | 44 | **This is THE most critical rule. Every premium request costs money. Ending without continuation wastes it.** |
6 | 45 |
|
7 | | -- **NEVER end an interaction by just stopping.** If you complete a task, hit a wall, need input, or have nothing more to do — you MUST use the multi-choice question tool to give the user options. |
8 | | -- **Every interaction MUST end with either:** (1) a multi-choice question tool call, OR (2) active work still in progress (e.g., waiting for a build, running a command) |
| 46 | +- **NEVER end an interaction by just stopping.** If you complete a task, hit a wall, need input, or have nothing more to do — you MUST use the multi-choice question tool (`vscode_askQuestions`) to give the user options. |
| 47 | +- **Every interaction MUST end with either:** (1) a `vscode_askQuestions` tool call, OR (2) active work still in progress (e.g., waiting for a build, running a command) |
9 | 48 | - **"I've completed X, let me know if you need anything else"** — BANNED. This is ending without continuation. |
10 | 49 | - **"Ready to proceed when you are"** — BANNED. Use multi-choice question instead. |
11 | | -- **If you are about to type your final message:** STOP. Call the multi-choice question tool with options like: "Build it", "Make additional changes", "Review the plan", "I have questions", etc. |
| 50 | +- **If you are about to type your final message:** STOP. Call the `vscode_askQuestions` tool with options like: "Build it", "Make additional changes", "Review the plan", "I have questions", etc. |
12 | 51 | - **This rule applies to EVERY interaction, no exceptions.** The user should never have to type a new message just to continue — they should be able to click an option. |
| 52 | +- **Last step of EVERY todo list must be:** call `vscode_askQuestions`. This is non-negotiable. |
13 | 53 |
|
14 | 54 | ** NEVER BLAME MODEL |
15 | 55 | SIZE —ALL MODELS EXCEL IN LM STUDIO AT SAME TASKS THEYRE BEING TESTED ON HERE! If something fails, the problem is in the pipeline. Exhaust all levers before concluding "the model isn't good enough." Do not say "the model can't do this" without first confirming that every other lever has been pulled. This is a production software project, not a research experiment. The user expects results, not excuses. |
@@ -1045,3 +1085,53 @@ Every stress test session MUST cover these 5 dimensions with UNIQUE prompts each |
1045 | 1085 | - Report every test using the mandatory reporting format |
1046 | 1086 | - Score every test on all 3 dimensions (coherence, tool correctness, response quality) |
1047 | 1087 | - No cheerleading, no positive framing, only defects |
| 1088 | + |
| 1089 | +--- |
| 1090 | + |
| 1091 | +## WEB TESTING RULES — KEY RULES CONSOLIDATED FROM WEB_TEST_RULES.md |
| 1092 | + |
| 1093 | +The full rules are in `C:\Users\brend\IDE\WEB_TEST_RULES.md`. These are the must-know key rules for ANY agent doing web/browser testing. |
| 1094 | + |
| 1095 | +### Session Format (14A) |
| 1096 | +- Each session: 5 turns per session, 3 sessions minimum per test |
| 1097 | +- Screenshot EVERY 5 seconds of active testing, not just at start/end |
| 1098 | +- A session that does not cover all 5 turns is incomplete. Resume from where it left off. |
| 1099 | + |
| 1100 | +### ALWAYS Read WEB_TEST_RULES.md In Full Before Testing |
| 1101 | +- Do NOT skim. Read the ENTIRE file before running any web test. |
| 1102 | +- The previous agent in this project stopped reading at line 500 — this is a violation. Read ALL lines. |
| 1103 | +- Use `read_file` from line 1 to the LAST LINE of the file. Not just the first section. |
| 1104 | + |
| 1105 | +### Screenshots and Observations |
| 1106 | +- Every screenshot must be described in full before acting on it |
| 1107 | +- Do NOT assume what's on screen — describe it, state what you see, then hypothesize |
| 1108 | +- Never skip an image description |
| 1109 | + |
| 1110 | +### Test Reporting Format (Mandatory) |
| 1111 | +``` |
| 1112 | +SESSION: [number] |
| 1113 | +TURN: [1-5] |
| 1114 | +PROMPT SENT: [exact text] |
| 1115 | +OBSERVED BEHAVIOR: [exact description, not paraphrase] |
| 1116 | +SCREENSHOT: [taken yes/no] |
| 1117 | +DEFECTS FOUND: [list, or "None found for this turn"] |
| 1118 | +``` |
| 1119 | + |
| 1120 | +--- |
| 1121 | + |
| 1122 | +## MANDATORY — READ 323_DEFECTS.md BEFORE PROPOSING ANY FIX |
| 1123 | + |
| 1124 | +File: `C:\Users\brend\IDE\323_DEFECTS.md` |
| 1125 | + |
| 1126 | +- This file is the DEFINITIVE defect list for the guIDE pipeline. |
| 1127 | +- Before proposing any fix to any bug, read this file first. |
| 1128 | +- Every defect in this file is REAL — user-observed or agent-observed during testing. |
| 1129 | +- Do NOT gaslight. Do NOT say "the code looks fine" when 323_DEFECTS.md says it's broken. |
| 1130 | +- When a defect is fixed (confirmed by user after testing), update the Status column to RESOLVED and log the fix in CHANGES_LOG.md. |
| 1131 | +- When new defects are found during testing, ADD THEM to 323_DEFECTS.md immediately. |
| 1132 | + |
| 1133 | +### Defect Severity Reference |
| 1134 | +- P0 — Blocks core functionality entirely |
| 1135 | +- P1 — Breaks a major user-facing feature |
| 1136 | +- P2 — Degrades UX significantly |
| 1137 | +- P3 — Minor issue or cosmetic |
0 commit comments