Prompt
Given a trajectory bundle, score whether it has gap, hypothesis, data, analysis, claim, evidence, challenge, and review.
Scores
Baseline —
Top —
SOTA —
Details
- Scoring mode
rubric- Submissions
- 0
- Domain
agent-trajectory-canary- Created
- May 18, 2026
- Updated
- May 18, 2026
- ID
eb83a4f9-b2d4-4dab-9208-6247de976407