I currently am trying to modify the block size of the file splits in hadoop through the java file (no I don't want to modify it through the xml file) and the java file that I need to modify is called DFSOutputStream.java
(in the org.apache.hadoop.hdfs
package). In hadoop 2.7.1 this java file is located in a jar file called hadoop-hdfs-2.7.1.jar and when I extract it, it contains a class file called DFSOutputStream.class
. Once I decompile DFSOutputStream.class
I retrieve the DFSOutputStream.java
file and that is where I can modify the block sizes in the java code.
Because of using the eclipse maven plugin, the dependencies/imports for DFSOutputStream.java
works with no errors, all imports are detected and working. However, DFSOutputStream.java
can not resolve any of the variables with a variable declaration of a DataStreamer
type. Is there a class file I am missing so that this class variable (DataStreamer) work?
Once again, I don't think there is anything wrong with my dependencies because all my imports work without error. it's just that one variable, if I'm wrong, please do enlighten me.