Replies: 1 comment 3 replies
-
|
Is it the cell which constructs the DataTree object that is taking a while, or the cell which creates the |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm working with CMIP6 CORDEX data which has a bunch of different domains (
AR-44,EUR-11, etc), the usual experiments (historical,rcp85, etc) and a bunch of different models (CNRM-CERFACS-CNRM-CM5,ICHEC-EC-EARTH, etc).I've created virtual icechunk repos for each of these which reference all the netcdf files over time for all variables.
Then I'm opening xarray datasets from the icechunk repos and creating a datatree.
The construction of the datatree is taking a while (1.5 min on my machine) and I was thinking to speed it up using Dask, but running into pickling problems. That got me wondering whether I should have just used a single icechunk repo with groups for the CORDEX data.
As you call tell, I'm a bit confused. Maybe because it's Friday afternoon. 🤷
Any suggestions for how to construct this datatree more efficiently?
Beta Was this translation helpful? Give feedback.
All reactions