I want to do a simple k-means with hadoop map reduce and python.
The mapper gets points and maps each point to its nearest center.
The reducer gets center as key and points as value and calculate a new center to the points.
But now , i need to gather all new centers from the reducers and give them in some way to the mapper at the next round.
How can i do it? I need to have a global array of centers for each map tasks.
What is the right way of doing it?