Status
A formal paper is not published yet. The site is the public benchmark surface for the active release.
Reproducibility and release status for the current benchmark surface.
A formal paper is not published yet. The site is the public benchmark surface for the active release.
Public leaderboard, per-track tables, statement sample, score schema, and submission preview.
Paper draft, harness notes, submission format, reviewer protocol, and clearer release/version history.
Current release is operator-run. Submission review and reruns are batch-managed while the public flow is being formalized.
Paper status: in progress. Until then, methodology, dataset notes, and submission instructions are the primary public references.