paper

Paper

Reproducibility and release status for the current benchmark surface.

Updated Apr 13, 2026 · UTC

Status

A formal paper is not published yet. The site is the public benchmark surface for the active release.

Current release includes

Public leaderboard, per-track tables, statement sample, score schema, and submission preview.

Planned artifacts

Paper draft, harness notes, submission format, reviewer protocol, and clearer release/version history.

Reproducibility note

Current release is operator-run. Submission review and reruns are batch-managed while the public flow is being formalized.

Paper status: in progress. Until then, methodology, dataset notes, and submission instructions are the primary public references.