Skip to content

Conversation

@barneydobson
Copy link
Collaborator

@barneydobson barneydobson commented Oct 30, 2025

Efficient Data Loading
Adds a faster unified Parquet save/load path while keeping backward compatibility with CSV. The reason this wasn't done originally was so that people could use without pandas but this seems a pointless reason to inconvenience everyone...

Details

  • Unified Parquet format: store node and surface data in a single file for faster loading and easier model management.
  • Direct comparison test added (including a zipped model file).
  • Performance benchmarks in test
  • Made date parsing more comprehensive to accommodate this.

@barneydobson barneydobson marked this pull request as ready for review October 30, 2025 15:33
@barneydobson barneydobson requested a review from dalonsoa October 30, 2025 15:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant