vcs-python
diff --git a/‎notes/2025-03-08 - test-audit - pydantic-plan.md
Lines changed: 206 additions & 0 deletions b/‎notes/2025-03-08 - test-audit - pydantic-plan.md
Lines changed: 206 additions & 0 deletions
diff --git a/‎notes/2025-03-08 - test-audit - test plan.md
Lines changed: 12 additions & 16 deletions b/‎notes/2025-03-08 - test-audit - test plan.md
Lines changed: 12 additions & 16 deletions
diff --git a/‎notes/TODO.md
Lines changed: 63 additions & 27 deletions b/‎notes/TODO.md
Lines changed: 63 additions & 27 deletions
@@ -0,0 +1,206 @@
+# VCSPull Test Coverage Audit and Pydantic Integration Plan
+
+## Overview
+
+VCSPull has a good overall test coverage of 85%, but certain modules like the validator need improvement. This updated plan outlines how to enhance the codebase using Pydantic for better validation and type safety.
+
+## Coverage Metrics
+
+```
+Name                                     Stmts   Miss Branch BrPart  Cover   Missing
+------------------------------------------------------------------------------------
+conftest.py                                 39      8      4      1    79%   31-32, 91-98
+src/vcspull/_internal/config_reader.py      39      5     12      3    84%   50, 69, 114, 160, 189
+src/vcspull/cli/sync.py                     85     14     34     11    79%   29, 61, 76->78, 81, 89, 91, 109-111, 115, 129-130, 132-133, 142, 151->153, 153->155, 160
+src/vcspull/config.py                      148     10     88     13    89%   105, 107->110, 110->117, 121, 128-131, 151->153, 220->235, 266, 281, 307, 342->344, 344->347, 424
+src/vcspull/log.py                          55      8      4      1    85%   39, 67-96, 105-106
+src/vcspull/validator.py                    18      6     16      6    65%   17, 21, 24, 27, 31, 34
+------------------------------------------------------------------------------------
+TOTAL                                      414     51    170     35    85%
+```
+
+## Pydantic Integration Plan
+
+### 1. Core Model Definitions
+
+Replace the current TypedDict-based system with Pydantic models to achieve better validation and type safety:
+
+1. **Base Models**
+   - Create `RawConfigBaseModel` to replace `RawConfigDict`
+   - Create `ConfigBaseModel` to replace `ConfigDict`
+   - Implement field validations with descriptive error messages
+
+2. **Nested Models Structure**
+   - `Repository` model for repository configuration 
+   - `ConfigSection` model for config sections
+   - `Config` model for the complete configuration
+
+3. **Validator Replacement**
+   - Use Pydantic validators instead of manual function-based validation
+   - Implement field-level validators for URLs, paths, and VCS types
+   - Create model methods for complex validation scenarios
+
+### 2. Error Handling Integration
+
+Enhance the exception system to work seamlessly with Pydantic validation:
+
+1. **Exception Integration**
+   - Create adapters between Pydantic validation errors and VCSPull exceptions
+   - Enrich error messages with contextual information
+   - Provide suggestions for fixing validation errors
+
+2. **Error Reporting**
+   - Improve error messages with field-specific context
+   - Add schema validation details in error messages
+   - Include path information in nested validation errors
+
+### 3. Configuration Processing Updates
+
+Update the configuration processing to leverage Pydantic models:
+
+1. **Parsing and Loading**
+   - Update config reader to return Pydantic models
+   - Maintain backward compatibility for existing code
+   - Add serialization methods for different output formats
+
+2. **Path Handling**
+   - Implement path validators with environment variable expansion
+   - Add path normalization in model fields
+   - Handle relative and absolute paths correctly
+
+3. **URL Processing**
+   - Add URL validators for different VCS schemes
+   - Implement URL normalization in model fields
+   - Add protocol-specific validation
+
+## Testing Strategy
+
+### 1. Model Testing
+
+1. **Unit Tests for Models**
+   - Test model instantiation with valid data
+   - Test model validation errors with invalid data
+   - Test model serialization and deserialization
+   - Test backward compatibility with existing data structures
+
+2. **Validation Logic Tests**
+   - Test field validators individually
+   - Test model validators for complex validations
+   - Test conversion between different model types
+   - Test error message generation and context
+
+### 2. Integration Testing
+
+1. **Config Loading Tests**
+   - Test loading configurations from files with Pydantic models
+   - Test backward compatibility with existing files
+   - Test error scenarios and validation failures
+
+2. **End-to-End Flow Tests**
+   - Test CLI operations with Pydantic-based config handling
+   - Test sync operations with validated models
+   - Test error handling and recovery in full workflows
+
+### 3. Regression Testing
+
+1. **Migration Tests**
+   - Ensure existing tests pass with Pydantic models
+   - Verify that all edge cases are still handled correctly
+   - Test performance impact of model-based validation
+
+2. **Backward Compatibility Tests**
+   - Test with existing configuration files
+   - Ensure command-line behavior remains consistent
+   - Verify API compatibility for external consumers
+
+## Implementation Plan
+
+### Phase 1: Core Model Implementation
+
+1. **Create Base Pydantic Models**
+   - Implement `models.py` with core Pydantic models
+   - Add field validators and descriptive error messages
+   - Implement serialization and deserialization methods
+
+2. **Update Types Module**
+   - Update type aliases to use Pydantic models
+   - Create Protocol interfaces for structural typing
+   - Maintain backward compatibility with TypedDict types
+
+3. **Validator Integration**
+   - Replace manual validators with Pydantic validators
+   - Integrate with existing exception system
+   - Improve error messages with context and suggestions
+
+### Phase 2: Config Processing Updates
+
+1. **Update Config Reader**
+   - Modify config reader to use Pydantic parsing
+   - Update config loading functions to return models
+   - Add path normalization and environment variable expansion
+
+2. **Sync Operations Integration**
+   - Update sync operations to use validated models
+   - Improve error handling with model validation
+   - Add type safety to repository operations
+
+3. **CLI Updates**
+   - Update CLI modules to work with Pydantic models
+   - Improve error reporting with validation details
+   - Add schema validation to command line options
+
+### Phase 3: Testing and Documentation
+
+1. **Update Test Suite**
+   - Update existing tests to work with Pydantic models
+   - Add tests for model validation and error scenarios
+   - Implement property-based testing for validation
+
+2. **Documentation**
+   - Document model schemas and field constraints
+   - Add examples of model usage in docstrings
+   - Create API documentation for Pydantic models
+
+3. **Performance Optimization**
+   - Profile model validation performance
+   - Optimize critical paths if needed
+   - Implement caching for repeated validations
+
+## Expected Benefits
+
+1. **Improved Type Safety**
+   - Runtime validation of configuration data
+   - Better IDE autocomplete and suggestions
+   - Clearer type hints for developers
+
+2. **Better Error Messages**
+   - Specific error messages for validation failures
+   - Context-rich error information
+   - Helpful suggestions for fixing issues
+
+3. **Reduced Boilerplate**
+   - Less manual validation code
+   - Automatic serialization and deserialization
+   - Built-in schema validation
+
+4. **Enhanced Maintainability**
+   - Self-documenting data models
+   - Centralized validation logic
+   - Easier to extend and modify
+
+## Metrics for Success
+
+1. **Type Safety**
+   - Pass mypy in strict mode with zero warnings
+   - 100% of functions have type annotations
+   - All configuration types defined as Pydantic models
+
+2. **Test Coverage**
+   - Overall test coverage > 90%
+   - Core modules coverage > 95%
+   - All public APIs have tests
+
+3. **Documentation**
+   - All public APIs documented
+   - All Pydantic models documented
+   - Examples for all major features 
@@ -15,24 +15,20 @@ Throughout this plan, we'll ensure all code follows these standards:
    - Always import typing as a namespace: `import typing as t`
 
 2. **Mypy Configuration**
-   - Use strict mode (`--strict`) for mypy checking
-   - Enable all error checks in the mypy configuration:
-     ```
-     [mypy]
+   - ✓ Strict mode is already enabled in `pyproject.toml` under `[tool.mypy]`
+   - ✓ The project uses the following mypy configuration:
+     ```toml
+     [tool.mypy]
      python_version = 3.9
-     warn_return_any = True
-     warn_unused_configs = True
-     disallow_untyped_defs = True
-     disallow_incomplete_defs = True
-     check_untyped_defs = True
-     disallow_untyped_decorators = True
-     no_implicit_optional = True
-     strict_optional = True
-     warn_redundant_casts = True
-     warn_unused_ignores = True
-     warn_no_return = True
-     warn_unreachable = True
+     warn_unused_configs = true
+     files = [
+       "src",
+       "tests",
+     ]
+     strict = true
      ```
+   - All necessary error checks are enabled via the `strict = true` setting
+   - Remaining task: Add CI checks for type validation
 
 3. **Python 3.9+ Features**
    - Use built-in generic types when possible (but always access typing via namespace)
 
@@ -2,18 +2,35 @@
 
 This document outlines the tasks needed to improve the test coverage, type safety, and overall quality of the VCSPull codebase based on the test audit plan.
 
+## Progress Update (2025-03-08)
+
+- ⬜ Initiated Pydantic integration for improved type safety and validation
+  - Plan to replace TypedDict with Pydantic models
+  - Will use Pydantic validators instead of manual validation functions
+  - Will leverage Pydantic's built-in error handling
+
+- ⬜ Enhanced test coverage for the validator module
+  - Will add tests for edge cases and complex configurations
+  - Will ensure all tests pass with mypy in strict mode
+  - Need to update tests to work with Pydantic models
+
 ## 1. Type Safety Improvements
 
+- [ ] **Implement Pydantic Models**
+  - [ ] Replace TypedDict definitions with Pydantic models
+  - [ ] Add field validators with meaningful error messages
+  - [ ] Use Pydantic's built-in error handling
+  - [ ] Create model hierarchies for nested configurations
+
 - [ ] **Enhance Exception Hierarchy**
   - [ ] Expand `exc.py` with specific exception types for different error scenarios
   - [ ] Add rich exception metadata (path, url, suggestions, risk level)
-  - [ ] Add proper typing to all exception classes
+  - [ ] Integrate exceptions with Pydantic validation errors
 
 - [ ] **Improve Type Definitions**
-  - [ ] Revise `types.py` to use more specific types (avoid Any)
-  - [ ] Create type aliases for complex types to improve readability
-  - [ ] Add Protocols for structural typing where appropriate
-  - [ ] Ensure all TypedDict definitions are complete and accurate
+  - [ ] Revise `types.py` to use Pydantic models instead of TypedDict
+  - [ ] Create model aliases for complex types to improve readability
+  - [ ] Add Protocol interfaces for structural typing where appropriate
 
 - [ ] **Type Annotation Completeness**
   - [ ] Audit all functions for missing type annotations
@@ -22,8 +39,8 @@ This document outlines the tasks needed to improve the test coverage, type safet
   - [ ] Properly annotate all class methods
 
 - [ ] **Configure Strict Type Checking**
-  - [ ] Set up `mypy.ini` with strict mode enabled
-  - [ ] Enable all recommended type checking flags
+  - [ ] Strict mode enabled in `pyproject.toml` under `[tool.mypy]`
+  - [ ] Recommended type checking flags enabled
   - [ ] Add CI checks for type validation
 
 ## 2. Test Coverage Improvements
@@ -47,15 +64,16 @@ This document outlines the tasks needed to improve the test coverage, type safet
   - [ ] Test recovery mechanisms
 
 - [ ] **Validator Module**
-  - [ ] Add tests for each validation function
+  - [ ] Update validator tests to use Pydantic models
+  - [ ] Add tests for each validation function and validator
   - [ ] Test validation of malformed configurations
   - [ ] Ensure all validators throw appropriate exceptions
 
 ## 3. Test Infrastructure
 
 - [ ] **Improve Test Fixtures**
   - [ ] Create reusable fixtures for common test scenarios
-  - [ ] Implement typed fixtures using Protocols
+  - [ ] Implement typed fixtures using Protocols and Pydantic models
   - [ ] Add fixtures for different repository types (git, svn, etc.)
 
 - [ ] **Add Property-Based Testing**
@@ -76,10 +94,11 @@ This document outlines the tasks needed to improve the test coverage, type safet
   - [ ] Document possible exceptions and error conditions
   - [ ] Add type information to docstrings (NumPy format)
 
-- [ ] **Add Type Documentation**
-  - [ ] Document complex type behavior
-  - [ ] Add clear explanations for TypedDict usage
-  - [ ] Document Protocol implementations
+- [ ] **Add Pydantic Model Documentation**
+  - [ ] Document model schemas and field constraints
+  - [ ] Add examples of model usage
+  - [ ] Document validation logic and error messages
+  - [ ] Create API documentation for Pydantic models
 
 ## 5. Refactoring for Testability
 
@@ -108,29 +127,46 @@ This document outlines the tasks needed to improve the test coverage, type safet
 ## Prioritized Tasks
 
 1. **Immediate Priorities**
-   - Enhance exception hierarchy
-   - Complete type annotations
-   - Configure strict type checking
-   - Add tests for core configuration functionality
+   - [ ] Implement base Pydantic models for configuration
+   - [ ] Integrate Pydantic validation with existing validation logic
+   - [ ] Configure strict type checking
+   - [ ] Update validator tests to work with Pydantic models
 
 2. **Medium-term Goals**
-   - Improve test fixtures
-   - Add tests for CLI operations
-   - Improve docstrings
-   - Refactor for better testability
+   - [ ] Improve test fixtures
+   - [ ] Add tests for CLI operations
+   - [ ] Improve docstrings
+   - [ ] Refactor for better testability
 
 3. **Long-term Objectives**
-   - Implement property-based testing
-   - Achieve 90%+ test coverage
-   - Complete documentation overhaul
-   - Integrate comprehensive CI checks
+   - [ ] Implement property-based testing
+   - [ ] Achieve 90%+ test coverage
+   - [ ] Complete documentation overhaul
+   - [ ] Integrate comprehensive CI checks
+
+## Next Steps
+
+1. **Create Pydantic Models**
+   - Create base models for RawConfigDict and ConfigDict
+   - Add validators for required fields and constraints
+   - Implement serialization and deserialization methods
+
+2. **Update Validation Logic**
+   - Replace manual validators with Pydantic validators
+   - Integrate Pydantic error handling with existing exceptions
+   - Update validation tests to use Pydantic models
+
+3. **Update Config Processing**
+   - Update config processing to use Pydantic models
+   - Ensure backward compatibility with existing code
+   - Add tests for model-based config processing
 
 ## Metrics and Success Criteria
 
 - [ ] **Type Safety**
   - [ ] Pass mypy in strict mode with zero warnings
   - [ ] 100% of functions have type annotations
-  - [ ] No usage of `Any` without explicit justification
+  - [ ] All configuration types defined as Pydantic models
 
 - [ ] **Test Coverage**
   - [ ] Overall test coverage > 90%
@@ -139,5 +175,5 @@ This document outlines the tasks needed to improve the test coverage, type safet
 
 - [ ] **Documentation**
   - [ ] All public APIs documented
-  - [ ] All complex types documented
+  - [ ] All Pydantic models documented
   - [ ] Examples for all major features