I want to store some information of the files being processed from HDFS. What would be the most suitable way to read a file location and byte offset in a java program of a file stored in HDFS?
Is there concept of a unique file id being associated to each file stored in Hadoop 1? If yes, then how can it be fetched in a MapReduce program?