Code Audit
Usage
/audit <item>
What it does
Compares claims made in a paper against its public codebase. Surfaces mismatches, missing experiments, and reproducibility risks.
What it checks
- Do the reported hyperparameters match the code?
- Are all claimed experiments present in the repository?
- Does the training loop match the described methodology?
- Are there undocumented preprocessing steps?
- Do evaluation metrics match the paper’s claims?
Example
/audit 2401.12345
Output
An audit report with:
- Claim-by-claim verification
- Identified mismatches
- Missing components
- Reproducibility risk assessment