I have a list of about 100M+ elements (currently sorted) that I want to randomize (shuffle) AND chunk/split into smaller lists (about 50K buckets). What's the best approach to do this in terms of maximizing speed?
I'm open to any libraries or languages (currently using node and python for the project) if they have fast pre-built methods. Thanks!
P.S. This isn't just a theoretical exercise, I'm trying to figure this out for my internship since we'll be running another script in parallel using about 50K Digital Ocean nodes that takes the smaller lists as an input.