First Question: I want to know if the Splits change the blocks in any means (i.e. change size, shift the block to another location, create new blocks, ...).
Second Question: I think the splits doesn't change the blocks but it specifies where each MapTask should exist and run on the cluster for locality of data or rack awareness because the DataNodes are already running and are having the blocks so the splits I think will tell Hadoop to run the MapTask beside the node that contains the data. Notice: Inside the InputSplit there is the Location/Host which I think for this purpose. Please correct me if I am wrong
Third Questions: Initially before actually executing the task, will the blocks move to where the MapTask is or the MapTask will move to where the blocks are (i.e. Location of the DataNode)?