I just realized (after some empirical tests) that applying the limit
function on a Dataset
produces a new Dataset
with only 1 partition. How come ?
I can't find any related questions. And reading the source code on LocalLimit
and GlobalLimit
didn't provide any insight as I'm not familiar with the internals.
This can be problematic as one might want to use something like limit(1000000)
for whatever reason.