diff --git a/prompts/evaluator/_base.md b/prompts/evaluator/_base.md index 318ff31..78d1094 100644 --- a/prompts/evaluator/_base.md +++ b/prompts/evaluator/_base.md @@ -67,51 +67,11 @@ Be concrete — "the function doesn't handle null input" not "there might be edg End your response with the same verdict block so it's visible in the terminal output. -## Runtime Verification (Web Projects) +## Runtime Verification -If the project has an `index.html` or is a web application, you MUST verify it actually runs: +Do not just read the code — **actually run it.** Use whatever tools are available to you (bash, MCP tools, etc.) to verify the project builds, runs, and behaves correctly. Code that looks correct but doesn't run is not complete. -1. **Start a local server** (if not already running): - ```bash - python3 -m http.server 8080 & - SERVER_PID=$! - sleep 1 - ``` - -2. **Check the page loads** — use curl to verify the server responds: - ```bash - curl -s -o /dev/null -w "%{http_code}" http://localhost:8080 - ``` - Expected: 200. If not, REJECT. - -3. **Check for JavaScript errors** — if Node.js is available, run a quick headless check: - ```bash - node -e " - const http = require('http'); - http.get('http://localhost:8080', res => { - let data = ''; - res.on('data', chunk => data += chunk); - res.on('end', () => { - const hasModules = data.includes('type=\"module\"'); - const hasCanvas = data.includes('/dev/null - ``` - -**Runtime errors = automatic REJECT.** Code that looks correct but doesn't run is not complete. +**Runtime errors = automatic REJECT.** ## What Warrants Rejection