I'm just starting out with Hive, and I have a question about Input/Output Format. I'm using the OpenCSVSerde
serde, but I don't understand why for text files the Input format is org.apache.hadoop.mapred.TextInputFormat
but the output format is org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
.
I've read this question but it's still not clear to my why the Input/Output formats are different. Isn't that basically saying your going to store data added to this table differently the data that's read from the table??
Anyway, any help would be appreciated