0

During the mapreduce job, the individual tasks will be running on random nodes, Is there any way to restrict the nodes in which the tasks should run ?

SachinJose
  • 8,462
  • 4
  • 42
  • 63

1 Answers1

0

Hadoop doesn't pick the nodes to run the tasks on random. Data locality is considered or else there would be a lot of network overhead.

There is no affinity between the tasks and the nodes on which they run. Hadoop doesn't offer any such functionality.

Praveen Sripati
  • 32,799
  • 16
  • 80
  • 117