Skip to content

Is it really recommended to save intermediate Dask results as netCDF? #8372

Answered by trexfeathers
maresb asked this question in Q&A
Discussion options

You must be logged in to vote

Intermediate compute() will achieve the same results, but depends on having enough memory available.

The advantage of saving/loading is if your array is larger than memory - it can be streamed chunk-by-chunk into a file, then streamed back out again, only using 1 chunk's worth of memory at a time.

I hadn't even considered that memory wouldn't be a concern 😂 It can be quite difficult to remember that some people only use Dask for parallelisation, and others only use it for larger-than-memory operations. Not everyone needs both. Makes it hard to think from each others' perspectives!

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@maresb
Comment options

@trexfeathers
Comment options

Answer selected by maresb
@dcherian
Comment options

@TomNicholas
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants