-
|
Hi again, The purpose of the function is to sequentially read files and write their data to the store along the appropriate dimension (the files are too large to load all into memory and then write the group in one big piece). It works fine in the old version of the function which just used pure zarr3 with xarray. Here I'm committing after running this function, but I have tried another version where you commit on each loop iteration. Both have the same problem (and so does a version that calls to_icechunk on each iteration). Any chance you can see where I'm going wrong? notes: |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
|
Thanks for opening an issue. Are you able to make this a minimal reproducible example please? Can you also tell us the output of At the very least it would be good to know actual values for |
Beta Was this translation helpful? Give feedback.
In that case I'd append along some other dimension that is chunked.
In my tests Icechunk worked well with a chunk size of 1. With the current setup, there are interleaved sets and gets, each of which holds the GIL while sending data across the python-rust bridge. This GIL contention is the slowdown you see.