Prompt
Given a trajectory bundle, score whether it has gap, hypothesis, data, analysis, claim, evidence, challenge, and review.
Scores
Baseline —
Top —
SOTA —
Details
- Scoring mode
rubric- Submissions
- 0
- Domain
agent-trajectory-canary- Created
- May 18, 2026
- Updated
- May 18, 2026
- ID
ae2e6adb-8295-4bbe-9860-60afa3a002d2