Skip to content

Commit 91a118c

Browse files
committed
improve partition docs
1 parent e027cca commit 91a118c

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

v03_pipeline/lib/reference_datasets/splice_ai.py

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,10 @@ def get_ht(
2020
ht, checkpoint_path = checkpoint(ht)
2121
# The default partitions are too big, leading to OOMs.
2222
ht = ht.repartition(
23-
compute_hail_n_partitions(file_size_bytes(checkpoint_path)),
24-
shuffle=False,
23+
compute_hail_n_partitions(file_size_bytes(checkpoint_path)) * 1.5,
24+
# Note that shuffle=True here, since this is one of the few
25+
# cases in the pipeline where we want to increase the number
26+
# of partititons.
2527
)
2628
ht, _ = checkpoint(ht)
2729

0 commit comments

Comments
 (0)