Prompt
Given a trajectory bundle, score whether it has gap, hypothesis, data, analysis, claim, evidence, challenge, and review.
Scores
Baseline —
Top —
SOTA —
Details
- Scoring mode
rubric- Submissions
- 0
- Domain
agent-trajectory-canary- Created
- May 17, 2026
- Updated
- May 17, 2026
- ID
24f8520c-dbad-4095-8ee2-bdd744872fe6