This repository was archived by the owner on Jan 12, 2024. It is now read-only.
-
-
Notifications
You must be signed in to change notification settings - Fork 2
This repository was archived by the owner on Jan 12, 2024. It is now read-only.
Add tests for pudl.sqlite
and ferc1.sqlite
#76
Copy link
Copy link
Open
Labels
inframundosqliteA file based relational database that we use for distributing much of the PUDL data.A file based relational database that we use for distributing much of the PUDL data.testingAutomated software testing and data validation often done with CI / GitHub ActionsAutomated software testing and data validation often done with CI / GitHub Actions
Description
Right now CI only tests whether the EPA CEMS parquet data is working, but we've included the pudl.sqlite
and ferc1.sqlite
databases in the manifest as well, so they also need to be tested.
Messing around with the v2022.11.30
data I found that there were a variety of issues with some tables in the PUDL DB, and none of the data in the ferc1 DB was accessible so... there's work to be done here. I've implemented just the most basic tests as an example of some of these problems in #75 and marked the ones that aren't working with xfail
.
Some potential tests to implement
- Check that
urlpath
topudl.sqlite
looks reasonable - Check that
urlpath
toferc1.sqlite
looks reasonable - Check that a few expected tables exist in
pudl.sqlite
- Check that a few expected tables exist in
ferc1.sqlite
- Check that the number of tables in
pudl.sqlite
is at least some minimum. - Check that the number of tables in
ferc1.sqlite
is at least some minimum. - Read a table from
pudl.sqlite
and check that it has a reasonable shape and contents. - Read a table from
ferc1.sqlite
and check that it has a reasonable shape and contents.
Metadata
Metadata
Assignees
Labels
inframundosqliteA file based relational database that we use for distributing much of the PUDL data.A file based relational database that we use for distributing much of the PUDL data.testingAutomated software testing and data validation often done with CI / GitHub ActionsAutomated software testing and data validation often done with CI / GitHub Actions