refactor: remove domain-specific language from prompts — fully universal
This commit is contained in:
@@ -2,30 +2,16 @@
|
||||
|
||||
You are evaluating an implementation story. The generator claims to have built a feature.
|
||||
|
||||
## Implementation-Specific Checks
|
||||
|
||||
In addition to the base evaluation process:
|
||||
## Checks
|
||||
|
||||
1. **Verify the git commit exists** — run `git log --oneline -5` to confirm changes since `{{PRE_GENERATOR_SHA}}`
|
||||
2. **Check commit scope** — does `git diff {{PRE_GENERATOR_SHA}}..HEAD --name-only` only contain files relevant to this story?
|
||||
3. **Read the actual test output** — if the generator claims tests pass, verify by running them yourself
|
||||
4. **For UI stories:**
|
||||
- Check that the component actually renders (not just that it exists)
|
||||
- Verify event handlers are wired up (not just defined)
|
||||
- Check accessibility basics (labels, semantic elements)
|
||||
5. **For API stories:**
|
||||
- Verify the endpoint is registered in the router
|
||||
- Check request/response types match the contract
|
||||
- Verify error handling returns appropriate status codes
|
||||
6. **For database stories:**
|
||||
- Verify migration runs cleanly
|
||||
- Check indexes are created for query patterns
|
||||
- Verify foreign key constraints
|
||||
3. **Run tests yourself** — don't trust the generator's claim that tests pass
|
||||
4. **Verify it actually works** — build, run, or load the project. Use whatever tools are available.
|
||||
|
||||
## Common Generator Failures to Watch For
|
||||
## Common Generator Failures
|
||||
|
||||
- Created the file but didn't wire it into the application (route not registered, component not imported)
|
||||
- Tests exist but don't actually assert meaningful behavior
|
||||
- "Passes typecheck" but only because types are `any` or too loose
|
||||
- UI component renders but doesn't respond to interaction
|
||||
- API endpoint exists but returns hardcoded/mock data
|
||||
- Created the file but didn't wire it into the application
|
||||
- Tests exist but don't assert meaningful behavior
|
||||
- Passes typecheck only because types are overly loose
|
||||
- Code exists but doesn't actually run
|
||||
|
||||
Reference in New Issue
Block a user