We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e027cca commit 91a118cCopy full SHA for 91a118c
v03_pipeline/lib/reference_datasets/splice_ai.py
@@ -20,8 +20,10 @@ def get_ht(
20
ht, checkpoint_path = checkpoint(ht)
21
# The default partitions are too big, leading to OOMs.
22
ht = ht.repartition(
23
- compute_hail_n_partitions(file_size_bytes(checkpoint_path)),
24
- shuffle=False,
+ compute_hail_n_partitions(file_size_bytes(checkpoint_path)) * 1.5,
+ # Note that shuffle=True here, since this is one of the few
25
+ # cases in the pipeline where we want to increase the number
26
+ # of partititons.
27
)
28
ht, _ = checkpoint(ht)
29
0 commit comments