Skip to content

v0.1

Choose a tag to compare

@PabloCabaleiro PabloCabaleiro released this 08 Sep 14:02
· 23 commits to main since this release
  •  Schemas: CaseSpec, RubricCriterion/Rubric, RunResult, Judgment.
  • Judge: PydanticAIJudge (typed JSON result, model-agnostic).
  • API: evaluate_case(...) (sync wrapper calling async core).
  •  CLI: pondera run <cases_dir> --runner ... --artifacts ....
  •  Pytest helper: load_cases(), run_case(); sample test file using parametrize.
  •  Artifacts: answer.md, judgment.json, summary.md, meta.json.
  • Docs: README, YAML schema reference, quickstart examples.
  • Tests: Adding basic tests.