How to specify individual tasktrackers(nodes) on which a mapreduce should run?

Question

During the mapreduce job, the individual tasks will be running on random nodes, Is there any way to restrict the nodes in which the tasks should run ?

score 0 · Accepted Answer · answered Apr 12 '13 at 14:05

0

Hadoop doesn't pick the nodes to run the tasks on random. Data locality is considered or else there would be a lot of network overhead.

There is no affinity between the tasks and the nodes on which they run. Hadoop doesn't offer any such functionality.

answered Apr 12 '13 at 14:05

Praveen Sripati

32,799
16
80
117

How to specify individual tasktrackers(nodes) on which a mapreduce should run?

1 Answers1

Linked