How can I optimize Z-ordering performance and resource usage for large DataFrames? #4648

Byunk · 2025-05-28T08:24:33Z

Byunk
May 28, 2025

I'm encountering OOM and "no space on disk" errors when running Z-ordering on large datasets because it appears to generate only a single task that utilizes just one executor. Is there a way to configure Z-ordering to run across multiple executors in parallel, or are there alternative approaches and best practices to distribute the workload and manage resource usage more effectively during Z-ordering operations?

Answered by Byunk

May 29, 2025

Sorry for dump question... The only need is repartition before optimization.

View full answer

Byunk · 2025-05-29T08:01:43Z

Byunk
May 29, 2025
Author

Sorry for dump question... The only need is repartition before optimization.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How can I optimize Z-ordering performance and resource usage for large DataFrames? #4648

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How can I optimize Z-ordering performance and resource usage for large DataFrames? #4648

Uh oh!

Byunk May 28, 2025

Replies: 1 comment

Uh oh!

Byunk May 29, 2025 Author

Byunk
May 28, 2025

Byunk
May 29, 2025
Author