Enhance types #93

mwildehahn · 2025-10-02T21:51:43Z

LMK how you feel about this -- I want to start on #61 but I wanted to have a more solid foundation of types before adding to that so I installed pyright and then leveraged codex to add the minimal types necessary to get pyright passing. I left out the examples and there were also some issues with dspy since dspy depends on gepa there is a circular import.

I'm not trying to open a can of worms with dynamic vs strictly typed python but I find it much easier to maintain / extend when there are clear typing contracts of the library.

LakshyAAAgrawal · 2025-10-06T19:31:29Z

Dear @mwildehahn ,

Thank you very much for the PR. I love types!

Pointers:

Can you look at the failing ruff above?
Can you introduce type checking as part of the CI pipeline or tests? I don't want strict type checking, as that can unnecessarily raise the bar to make changes to the code, but I would like to have soft type-checking, such that, the type checker ensures that all the type annotations that are already present are satisfied, but those that aren't are still OK. What do you think?

mwildehahn · 2025-10-06T20:15:38Z

Yup, sounds good.

…

On Mon, Oct 6, 2025 at 12:31 PM Lakshya A Agrawal ***@***.***> wrote: *LakshyAAAgrawal* left a comment (gepa-ai/gepa#93) <#93 (comment)> Dear @mwildehahn <https://github.com/mwildehahn> , Thank you very much for the PR. I love types! Pointers: 1. Can you look at the failing ruff above? 2. Can you introduce type checking as part of the CI pipeline or tests? I don't want strict type checking, as that can unnecessarily raise the bar to make changes to the code, but I would like to have soft type-checking, such that, the type checker ensures that all the type annotations that are already present are satisfied, but those that aren't are still OK. What do you think? — Reply to this email directly, view it on GitHub <#93 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFMAUNRRCNMSMWEVUMXEBD3WK7SRAVCNFSM6AAAAACIEYUIECVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTGNZTGU4DIOJQG4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

LakshyAAAgrawal · 2025-10-06T21:14:50Z

pyrightconfig.json

+    "src/gepa/adapters/generic_rag_adapter",
+    "src/gepa/adapters/anymaths_adapter",
+    "src/gepa/adapters/terminal_bench_adapter",
+    "tests/test_rag_adapter/",


Shouldn't the entire tests directory be ignored?

LakshyAAAgrawal · 2025-10-08T08:41:26Z

tests/test_experiment_tracking.py

        assert experiment is not None
+        assert experiment is not None
+        assert experiment is not None
+        assert experiment is not None


This seems to be an error

LakshyAAAgrawal · 2025-10-13T06:22:49Z

src/gepa/adapters/default_adapter/default_adapter.py

            scores.append(score)

-            if capture_traces:
+            if capture_traces and trajectories is not None:


This should be an assert instead.

LakshyAAAgrawal · 2025-10-13T06:25:02Z

src/gepa/api.py

+def _is_default_datainst(candidate: object) -> TypeGuard[DefaultDataInst]:
+    if not isinstance(candidate, dict):
+        return False
+    input_text = candidate.get("input")
+    additional_context = candidate.get("additional_context")
+    answer = candidate.get("answer")
+    if not isinstance(input_text, str) or not isinstance(answer, str):
+        return False
+    if not isinstance(additional_context, dict):
+        return False
+    return all(isinstance(key, str) and isinstance(value, str) for key, value in additional_context.items())
+
+
+def _is_default_dataset(dataset: Sequence[object]) -> TypeGuard[list[DefaultDataInst]]:
+    if not isinstance(dataset, list):
+        return False
+    return all(_is_default_datainst(item) for item in dataset)
+

 def optimize(
    seed_candidate: dict[str, str],
-    trainset: list[DataInst],
-    valset: list[DataInst] | None = None,
-    adapter: GEPAAdapter[DataInst, Trajectory, RolloutOutput] | None = None,
-    task_lm: str | Callable | None = None,
+    trainset: list[DataInstT],
+    valset: list[DataInstT] | None = None,
+    adapter: GEPAAdapter[DataInstT, TrajectoryT, RolloutOutputT] | None = None,
+    task_lm: str | TaskModelCallable | None = None,


Could you please clarify why these methods are created?

mwildehahn · 2025-10-13T22:24:31Z

src/gepa/api.py

+RolloutOutputT = TypeVar("RolloutOutputT")
+
+
+def _is_default_dataset(value: object) -> TypeGuard[list[DefaultDataInst]]:


@LakshyAAAgrawal I couldn't find a way around adding this type guard and the optimize + _optimize_impl. The issue is trying to maintain a fully generic optimize function but then also provide the default adapter if none is provided.

We have to do the type guard to validate the default dataset and then can init the default adapter and pass it.

I was trying to avoid changing the public API for this, lmk if there is something else you'd want to see here.

LakshyAAAgrawal reviewed Oct 6, 2025

View reviewed changes

LakshyAAAgrawal reviewed Oct 8, 2025

View reviewed changes

LakshyAAAgrawal reviewed Oct 13, 2025

View reviewed changes

mwildehahn added 2 commits October 13, 2025 15:15

Install pyright

8620902

Enhance types

cf87e74

mwildehahn force-pushed the mh/pyright branch from 2168f2e to cf87e74 Compare October 13, 2025 22:18

mwildehahn commented Oct 13, 2025

View reviewed changes

mwildehahn added 3 commits October 13, 2025 15:34

Fix ruff errors

d593738

Run ruff

378549b

Add type checking to CI

d90e81b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance types #93

Enhance types #93

Uh oh!

mwildehahn commented Oct 2, 2025

Uh oh!

LakshyAAAgrawal commented Oct 6, 2025

Uh oh!

mwildehahn commented Oct 6, 2025 via email

Uh oh!

LakshyAAAgrawal Oct 6, 2025

Uh oh!

LakshyAAAgrawal Oct 8, 2025

Uh oh!

LakshyAAAgrawal Oct 13, 2025

Uh oh!

LakshyAAAgrawal Oct 13, 2025

Uh oh!

mwildehahn Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		RolloutOutputT = TypeVar("RolloutOutputT")


		def _is_default_dataset(value: object) -> TypeGuard[list[DefaultDataInst]]:

Enhance types #93

Are you sure you want to change the base?

Enhance types #93

Uh oh!

Conversation

mwildehahn commented Oct 2, 2025

Uh oh!

LakshyAAAgrawal commented Oct 6, 2025

Uh oh!

mwildehahn commented Oct 6, 2025 via email

Uh oh!

LakshyAAAgrawal Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

LakshyAAAgrawal Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

LakshyAAAgrawal Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

LakshyAAAgrawal Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

mwildehahn Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants