Replies: 1 comment 1 reply
-
Hi, how many unique values are there for columnA, and what is the dtype of it? Regards, Maarten Breddels |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, sorry for all these questions. Really appreciate the help I've gotten here.
I was able to read multiple HDF5 files into a single vaex dataframe containing 1.12 billion rows. However, when I tried to do a
groupby
operation on the entire dataframe, the kernel dies.I tried decreasing the size of the dataframe and found that when I'm trying to access beyond the 4 million-th row of the dataframe, the kernel usually dies. But the operation is always successful for the first 4 million rows.
It does not seem related to the size of the dataframe, because the kernel fails when I'm trying to read 10 random rows beyond the 4 million-th row, e.g.
df[9001000:9001010]
, though there are a few times when reading some other 10 rows is successful. Basically, when reading beyond the 4 million-th row, the kernel dies about 70% of the time, but operation is always successful when reading up to the 4 million-th row. I'm not sure why this is happening.Thank you.
Beta Was this translation helpful? Give feedback.
All reactions