Replies: 3 comments
-
Thanks for raising this! The agent should verify tests before moving on. I have seen a similar behavior where it added a test and then another test without running the previous test or making it pass. It definitely breaks the TDD flow. I have only seen it do that once though. I wonder if it is language-specific (more confident in Python?) or related to the recent model changes. The current design ensures that the validator never blocks the agent from adding a single new test. I wanted the agent to always be able to move forward by creating a new test, but this seems to allow it to break the TDD cycle. Your suggestion is definitely an improvement. However, Claude Code sometimes ignores postToolUse messages. Could we address this programmatically instead? We could enrich the validator's context with an action history: """
This gives the validator context about what happened before each modification. We can start simple: which file changed, test results: passed/failed and enrich it further later as needed. This would also let us create integration tests for this behavior. What are your thoughts? :) |
Beta Was this translation helpful? Give feedback.
-
I have been experimenting with idea that first time claude tries to finish a task, he sees a message (and it can retry it to pass) and move the next task For now, I have conflicting thoughts about the experiment, coz i noticed claude tries to cheat/cheese the system a lot, even the regular tests tdd implementation. |
Beta Was this translation helpful? Give feedback.
-
I do liked your idea of implementation, its going to be a lot "claude proof" than my experiment. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
A lot of times claude doesn't rerun his test after implementation to make sure it passes now.
We can force him on task change or even just output a print that reminds claude to do it.
Is it a good idea in tdd flow?
Beta Was this translation helpful? Give feedback.
All reactions