I am exploring operation fusion capabilities of Spark and am curios if Spark can fuse a filter followed by a map into a single operation, e.g.
val names = sc.parallelize(List("Subhrajit Bhattacharya", "John Doe"))
val longNames = names.filter( x => x.length > 10)
val splitLongNames = longNames.map(x => x.split(" ").toList)
If so, what will the code for that function be ? Also is there any way of knowing which operations Spark is fusing ?
Thanks.