Skip to content

Commit 9f310fe

Browse files
upgrading version
1 parent 5c9fcf9 commit 9f310fe

File tree

5 files changed

+16
-6
lines changed

5 files changed

+16
-6
lines changed

CHANGELOG.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,18 @@
11
# Changelog
22

3+
<!-- markdownlint-disable MD024 -->
4+
35
The format follows the principles of [Keep a Changelog](https://keepachangelog.com/en/1.0.0/) and uses [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
46

57
## [Unreleased](https://github.com/PabloCabaleiro/pondera/tree/main)
68

7-
### Added (Unreleased)
9+
## [v0.4.1](https://github.com/PabloCabaleiro/pondera/releases/tag/v0.4.1) - 2025-09-19
10+
11+
<!-- markdownlint-disable-next-line MD024 -->
12+
### Changed
13+
14+
- Internal: route all evaluations through the unified `multi_evaluate` function (single-case path now uses the same aggregation pipeline).
15+
- Prompting: moved part of the prior user prompt into the system prompt and refined the system prompt wording for clarity and consistency.
816

917
## [v0.4.0](https://github.com/PabloCabaleiro/pondera/releases/tag/v0.4.0) - 2025-09-12
1018

@@ -18,6 +26,7 @@ The format follows the principles of [Keep a Changelog](https://keepachangelog.c
1826
- Structured error classes introduced: `RunnerError`, `JudgeError`, `TimeoutError` (subclass of `asyncio.TimeoutError`), and `ValidationError` with wrapping of raw exceptions in runner/judge execution and YAML load path.
1927
- Basic logging: added standard library logging calls (logger name `pondera`) in core API execution path and simple availability test.
2028

29+
<!-- markdownlint-disable-next-line MD024 -->
2130
### Changed
2231

2332
- API now always returns `MultiEvaluationResult` (single run wrapped with one `EvaluationResult`) for a stable schema.
@@ -27,6 +36,7 @@ The format follows the principles of [Keep a Changelog](https://keepachangelog.c
2736
- Fail-fast on missing criterion scores when per-criterion thresholds provided (no silent 0 default); `compute_pass` now raises `ValidationError`.
2837
- BREAKING: normalized naming: removed `Judgment.pass_fail`/dual serialization; single boolean field `evaluation_passed` everywhere (tests & artifacts updated, no backward alias).
2938

39+
<!-- markdownlint-disable-next-line MD024 -->
3040
### Fixed
3141

3242
- Updated tests to align with unified return type.

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -101,9 +101,9 @@ class ConstantJudge(JudgeProtocol):
101101

102102
```bash
103103
# Using uv (recommended)
104-
uv add 'git+ssh://git@github.com/PabloCabaleiro/pondera.git@v0.3.0'
104+
uv add 'git+ssh://git@github.com/PabloCabaleiro/pondera.git@v0.4.1'
105105
# or from source in editable mode
106-
uv pip install 'git+ssh://git@github.com/PabloCabaleiro/pondera.git@v0.3.0'
106+
uv pip install 'git+ssh://git@github.com/PabloCabaleiro/pondera.git@v0.4.1'
107107
```
108108

109109
The judge uses the pydantic-ai ecosystem. Configure provider credentials via env vars (`OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, `AZURE_OPENAI_API_KEY`, etc.) plus optional `PONDERA_` settings.

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[project]
22
name = "pondera"
3-
version = "0.2.0"
3+
version = "0.4.1"
44
description = "YAML-first, pluggable runners & LLM-as-a-judge evaluation framework."
55
readme = "README.md"
66
requires-python = ">=3.10"

src/pondera/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,4 +22,4 @@
2222
"ValidationError",
2323
]
2424

25-
__version__ = "0.2.0"
25+
__version__ = "0.4.1"

uv.lock

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)