chore(tooling): add CLAUDE.md (ethereum#1749)

danceratopz · marioevz · fselmo · Carsons-Eels · commit 0c00c94036df · 2025-07-01T05:39:15.000-04:00
Co-authored-by: Mario Vega &lt;marioevz@gmail.com&gt;
Co-authored-by: felipe &lt;fselmo2@gmail.com&gt;
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,200 @@
+# CLAUDE.md - Ethereum Execution Spec Tests
+
+> **CRITICAL**: This repository aims to provide excellent tooling for generating JSON test vectors that test Ethereum execution layer clients. Correctness is absolute priority. The repo prioritizes a contributor-first mindset.
+
+## 🎯 Core Purpose
+
+- `./tests/`: Python tests that **generate JSON test vectors (fixtures)** via `fill` command
+- `./src/pytest_plugins/filler/`: Implements `fill` command (test vector generation from Python source)
+- `./src/pytest_plugins/consume/`: Implements `consume` command (test vector execution)
+- `./src/pytest_plugins/execute/`: Implements `execute` command (live JSON-RPC testing from Python source)
+- `./src/ethereum_test_*`: Core framework libraries and data structures
+
+### Key Terminology (CRITICAL)
+
+**"Fixtures" has TWO meanings:**
+
+1. **Test Fixtures** (JSON files) - The test vectors this framework generates
+2. **Pytest Fixtures** - Standard pytest setup/teardown (`pre`, `state_test`, etc.)
+
+### Workflows
+
+```text
+Fill/Consume: Python Tests → fill → JSON Fixtures → consume → Client Testing
+Execute: Python Tests → execute → Live JSON-RPC Testing
+```
+
+## 🚀 Essential Commands
+
+All commands use `uv run` prefix.
+
+### Setup
+
+```bash
+uv sync --all-extras
+uv run solc-select use 0.8.24 --always-install
+uvx pre-commit install
+```
+
+### Core Workflow
+
+```bash
+# Create test
+uv run eest make test
+
+# Generate fixtures (PRIMARY WORKFLOW)
+uv run fill --fork=Prague path/to/test.py --clean -v -m "not slow"
+
+# Execute against client
+uv run consume direct --bin=evm fixtures/
+
+# Framework testing
+uv run pytest -c pytest-framework.ini path/to/test.py::test_function
+```
+
+### Quality Checks
+
+```bash
+# Check code style and errors
+uv run ruff check src tests .github/scripts
+
+# Format code
+uv run ruff format src tests .github/scripts
+
+# Fix auto-fixable issues
+uv run ruff check --fix src tests .github/scripts
+
+# Type checking
+uv run mypy src tests .github/scripts
+
+# Framework unit tests
+uv run pytest -c pytest-framework.ini -n auto -m "not run_in_serial"
+uv run pytest -c pytest-framework.ini -m run_in_serial
+
+# Run specific checks (fast checks)
+uvx --with=tox-uv tox -e lint,typecheck,spellcheck
+
+# Local docs check (fast mode: these warnings can be ignored "WARNING -  Doc file 'writing_tests/..."):
+export FAST_DOCS=true && export GEN_TEST_DOC_VERSION="tox" && uv run mkdocs build
+```
+
+## 🎯 Core Framework Rules
+
+### NEVER Use Hardcoded Addresses
+
+```python
+def test_example(pre: Alloc, state_test: StateTestFiller):
+    # ✅ Dynamic allocation
+    sender = pre.fund_eoa()
+    contract = pre.deploy_contract(code=Op.SSTORE(1, 1))
+    
+    tx = Transaction(sender=sender, to=contract, gas_limit=5_000_000)
+    state_test(pre=pre, tx=tx, post={contract: Account(storage={1: 1})})
+```
+
+### Key Methods
+
+- `pre.deploy_contract(code, **kwargs) -> Address`
+- `pre.fund_eoa(amount=None, **kwargs) -> EOA`
+- `pre.fund_address(address, amount)`
+
+### Gas Calculation Pattern
+
+```python
+intrinsic_gas_calculator = fork.transaction_intrinsic_cost_calculator()
+tx_gas_limit = intrinsic_gas_calculator(
+    calldata=tx_data,
+    contract_creation=False,
+    access_list=access_lists,
+) + 100_000
+```
+
+## 📁 Key Directories
+
+```text
+src/
+├── ethereum_test_tools/     # Core framework
+├── ethereum_test_types/     # Type definitions
+├── ethereum_test_fixtures/  # Pydantic models for test fixtures
+├── pytest_plugins/         # Plugin system
+tests/                       # Test cases by fork
+fixtures/                    # Generated test vectors (default output directory)
+```
+
+## ⚠️ Critical Anti-Patterns
+
+- ❌ Hardcoded addresses (use `pre` fixture)
+- ❌ `TestAddress` in new tests (use `pre.fund_eoa()`)
+- ❌ Missing `sender` parameter in transactions
+- ❌ Missing `@pytest.mark.valid_from("Fork")` markers
+- ❌ Manual nonce management
+
+## 🔧 Common Patterns
+
+### Fork Compatibility
+
+```python
+@pytest.mark.valid_from("Cancun")
+def test_example(pre: Alloc, state_test: StateTestFiller):
+    if fork >= Fork.Cancun:
+        # Cancun-specific logic
+    else:
+        # Pre-Cancun logic
+```
+
+### Parameterized Tests
+
+```python
+@pytest.mark.parametrize("value", [0, 1, 2**256-1])
+def test_with_params(value: int, pre: Alloc, state_test: StateTestFiller):
+```
+
+## 🐛 Debugging Test Filling
+
+### Generate EVM Traces
+
+```bash
+uv run fill --fork=Prague --evm-dump-dir=debug_output/ --traces path/to/test.py
+jq -r '.opName' debug_output/**/*.jsonl
+```
+
+### Common Issues
+
+- **Fill failures**: Check gas limits (increase if needed, use `transaction_intrinsic_cost_calculator`)
+- **Address conflicts**: Always use `pre` fixture for dynamic allocation
+- **Test collection**: Functions must start with `test_`
+- **Import errors**: Check dependencies in `pyproject.toml`, run `uv sync --all-extras`
+
+## 📝 Code Standards
+
+- **Line length**: 100 characters
+- **Type annotations**: Required
+- **Import style**: Explicit imports only, no `import *`, no local imports.
+- **Path handling**: Use `pathlib`
+- **Code**: Use idiomatic python, prioritize readability.
+- **Docstrings**: Always include for methods and classes. For one-liners """Use one single-line docstring with quotes on same line."""
+
+## Commit Format
+
+```text
+<type>(<scope>): <description>
+
+# Types: feat, fix, docs, style, refactor, test, chore, new
+# Scopes: evm, forks, tools, pytest, tests, docs, ci, consume, fill, eest
+```
+
+## 🔍 Tool Preferences
+
+- **Search**: `rg "pattern" --type python` (not grep)
+- **JSON**: `jq -r '.field' file.json`
+- **GitHub**: `gh` CLI for all operations
+
+## 🎯 Development Workflow
+
+1. `uv run eest make test` - Create test
+2. Implement tests using `pre` fixture patterns
+3. `uv run fill --fork=Fork test.py --clean -v tests/path/to/module` - Generate fixtures
+4. `uv run ruff check --fix` - Fix linting
+5. Commit with semantic format
+
+**Critical**: Always run linting and type checking. Use `--clean` when filling. Never use hardcoded addresses.
diff --git a/HUMANS.md b/HUMANS.md
@@ -0,0 +1,169 @@
+# HUMANS.md - Working with Claude and LLMs in Ethereum Execution Spec Tests
+
+This guide helps human developers understand the dependencies and get the most out of Claude and other LLMs when working with this codebase.
+
+## 🤖 Why This Repository Has LLM Support
+
+Humans are faster when they use LLMs correctly.
+
+## 📋 Required Dependencies for LLM-Assisted Development
+
+### Requirements
+
+#### LLM Context File
+
+- **CLAUDE.md** - Primary LLM guidance (keep up-to-date). Use `#memorize` in Claude to update with new info.
+
+### Recommended Available Tooling (for use with Claude)
+
+1. **GitHub CLI**: `gh` for PR and issue management.
+2. **ripgrep**: `rg` for fast code searching.
+3. **jq**: For JSON analysis and EVM trace debugging.
+4. **markdownlint-cli**: To verify markdown files (CI enforced).
+5. **VS Code**: With recommended extensions (see [setup guide](docs/getting_started/setup_vs_code.md)). Run `claude` in VS Code for the best results.
+
+## 🎯 Getting the Best Results from Claude
+
+### 1. Provide Relevant Context
+
+**Always mention:**
+
+- What you're trying to accomplish.
+- Which part of the codebase you're working on (`tests/`, `src/`, `docs/`).
+- Any error messages or specific issues you're encountering.
+
+**Example - Good Context**:
+> "I'm writing a new test for EIP-7702 in `tests/prague/eip7702_set_code_tx/`. The test should verify that delegation target validation works correctly. The test fails to fill when running `uv run fill --fork=Prague path/to/test.py`"
+
+**Example - Poor Context**:
+> "My test isn't working"
+
+### 2. Reference Key Documentation
+
+When asking Claude for help, mention which documentation you've already checked:
+
+- "I've read the test patterns in CLAUDE.md but...".
+- "According to the debugging section in CLAUDE.md...".
+- "The CONTRIBUTING.md mentions X, but I need help with Y...".
+
+### 3. Share Specific Commands and Output
+
+Claude works best with concrete information:
+
+```bash
+# Share the exact command you ran
+uv run fill --fork=Prague tests/cancun/eip4844_blobs/test_blob_txs.py --clean -v
+
+# Include relevant error output
+ERROR: Failed to compile Yul source: ...
+```
+
+### 4. Ask for Complete Solutions
+
+Request end-to-end guidance rather than partial answers:
+
+- "Show me the complete test function with proper imports.".
+- "What's the full workflow from creating the test to verifying it works?".
+- "Include the commands I need to run to validate this change.".
+
+## 🚀 Optimizing LLM Workflows
+
+### Quick Start Template
+
+When starting a new (and complicated) task, provide Claude with something similar to template.
+
+```console
+I'm working on [describe task] in the Ethereum execution-spec-tests repository.
+
+**Context**:
+- Working directory: [tests/shanghai/, src/ethereum_test_tools/, etc.]
+- Trying to: [specific goal]
+- Current status: [what you've tried, any errors]
+
+**References**:
+- Checked CLAUDE.md section: [which sections you've read]
+- Related documentation: [any other docs you've reviewed]
+
+**Specific question**: [exactly what you need help with]
+```
+
+### Debugging Template
+
+For troubleshooting issues:
+
+```console
+I'm encountering [specific error] when [doing what].
+
+**Command run**:
+```bash
+[exact command that failed]
+```console
+
+**Error output**:
+
+```
+[full error message]
+```console
+
+**What I've tried**:
+
+- [list previous attempts].
+
+**Environment details**:
+
+```bash
+uv run eest info
+```console
+
+**Request**: Please help me understand what's wrong and provide the fix.
+
+```
+
+### Code Review Template
+
+When asking Claude to review code:
+
+```console
+
+Please review this [test/function/module] for:
+
+- Compliance with project standards (CLAUDE.md, code_standards.md).
+- Correct usage of the `pre` fixture.
+- Proper error handling and type annotations.
+- Performance considerations.
+
+**Code**:
+
+```python
+[your code here]
+```console
+
+**Specific concerns**: [any particular areas you're unsure about]
+
+## 🧠 Understanding LLM Limitations
+
+### What Claude Excels At
+
+- ✅ **Code patterns and structure**.
+- ✅ **Following established conventions**.
+- ✅ **Debugging common issues**.
+- ✅ **Explaining complex concepts**.
+- ✅ **Generating boilerplate code**.
+- ✅ **Reviewing code for standards compliance**.
+
+### What to Verify Independently
+
+- ⚠️ **Latest dependency versions** (always check official docs).
+- ⚠️ **New EIP specifications** (verify against ethereum/EIPs).
+- ⚠️ **Breaking changes** in recent updates.
+- ⚠️ **Environment-specific issues** (OS, architecture).
+- ⚠️ **Security implications** of suggestions.
+
+### For Effective LLM Collaboration
+
+- Provide clear, specific prompts.
+- Break complex tasks into smaller pieces.
+- Always validate LLM output against standards.
+- Use the codebase's existing patterns as examples.
+
+Remember: **LLMs are powerful tools that work best when given good context and clear objectives.** The better you understand this codebase, the better you can guide Claude to help you effectively.
diff --git a/docs/CHANGELOG.md b/docs/CHANGELOG.md
@@ -52,6 +52,7 @@ Users can select any of the artifacts depending on their testing needs for their
 - ✨ Add documentation "Running Tests" that explains the different methods available to run EEST tests and reference guides for running `consume` and `hive`: ([#1172](https://github.com/ethereum/execution-spec-tests/pull/1172)).
 - ✨ Added a new `eest` sub-command, `eest info`, to easily print a cloned EEST repository's version and the versions of relevant tools, e.g., `python`, `uv` ([#1621](https://github.com/ethereum/execution-spec-tests/pull/1621)).
 - ✨ Add `CONTRIBUTING.md` for execution-spec-tests and improve coding standards documentation ([#1604](https://github.com/ethereum/execution-spec-tests/pull/1604)).
+- ✨ Add `CLAUDE.md` to help working in @ethereum/execution-spec-tests with [Claude Code](https://docs.anthropic.com/en/docs/claude-code/overview) ([#1749](https://github.com/ethereum/execution-spec-tests/pull/1749)).
 - ✨ Use `codespell` instead of `pyspelling` to spell-check python and markdown sources ([#1715](https://github.com/ethereum/execution-spec-tests/pull/1715)).
 - 🔀 Updated from pytest 7 to [pytest 8](https://docs.pytest.org/en/stable/changelog.html#features-and-improvements), benefits include improved type hinting and hook typing, stricter mark handling, and clearer error messages for plugin and metadata development ([#1433](https://github.com/ethereum/execution-spec-tests/pull/1433)).
 - 🐞 Fix bug in ported-from plugin and coverage script that made PRs fail with modified tests that contained no ported tests ([#1661](https://github.com/ethereum/execution-spec-tests/pull/1661)).