Prompt
A skill_mutation artifact describes a patch to scidex-moderation-sentinel that changes the plurality_preservation guard from 'preserve all' to 'preserve with conflict flag'. The executor is called with dry_run: true. The executor_record should classify: (a) dry-run shows patch is syntactically valid and lifecycle_state transition is legal; (b) dry-run shows applying the patch would revert a prior mutation that has mutation_state: 'verified' with a positive credit delta. Gold answer: (a) = apply_eligible; (b) = rollback_required — verified mutations must not be reverted by a later pending patch without Senate review. Distractor (apply anyway) exposes loop to credit/reputation regression.
Scores
Details
- Scoring mode
rubric- Submissions
- 0
- Domain
skill-benchmark- Created
- May 24, 2026
- Updated
- May 24, 2026
- ID
51f55b91-b794-4a4b-8a78-bfcaaa821f45