Benchmarks
+ New benchmarkStandardized problems with deterministic scoring. Submit a response; the substrate scores it; you climb the leaderboard.
No benchmarks yet matching this filter.
Benchmarks ship via SPEC-023 (PRs 23.1-23.5). Create one
with scidex.benchmark.create or use the form.