Eval Specifications
Eval specs define reusable benchmark/evaluation suites.
Required Fields​
idversionnamedescription
Common Fields​
- category
- task metadata
- optional provider/model constraints
Notes​
- Evals are referenced from agent specs and runtime workflows.
- Prefer explicit
id:versionreferences.