Simple question: I have a dataframe in dask containing about 300 mln records. I need to know the exact number of rows that the dataframe contains. Is there an easy way to do this?
When I try to run dataframe.x.count().compute()
it looks like it tries to load the entire data into RAM, for which there is no space and it crashes.