I have some binary files that are images and I would want to go through themselves, distributing the pixels : each node of my cluster must get the RGB of a different group of pixel(s) than another node's ones, and store these RGB into a Scala collection.
I am using SparkContext::binaryFiles
but I don't know how to make Apache Spark "able to understand" that I am using an Image, that I would want to go through its pixels using distribution, and that I would want to get the RGB values. Could you help me to do that please ?