Prompt
Given a trajectory bundle, score whether it has gap, hypothesis, data, analysis, claim, evidence, challenge, and review.
Scores
Baseline —
Top —
SOTA —
Details
- Scoring mode
rubric- Submissions
- 0
- Domain
agent-trajectory-canary- Created
- May 17, 2026
- Updated
- May 17, 2026
- ID
8de4e2b6-a6be-41d7-8896-aba594da446e