Collection.partition in spark

Asked Mar 15 '16 at 01:28

Active Mar 15 '16 at 01:28

Viewed 22 times

I often write this helper to partition a collection, akin to the partition method in the standard library.

def partition[T](xs: RDD[T], predicate: (T) => Boolean): (RDD[T], RDD[T]) = {
  (xs.filter(predicate), xs.filter(!predicate(_)))
}

I was never able to find such a method in the Spark API. Does it exist?

asked Mar 15 '16 at 01:28

Synesso

No, there's no such method in the spark API. – Yuval Itzchakov Mar 15 '16 at 06:11

0 Answers0