0

iam a beginner in hadoop,can any one help me in reading json in mapreduce job.

i have googled and found jaql is suitable for reading json.but i didnot find any documentaion on how it could be implemented in our map reduce job.

is there any other framework which supports reading json in map reduce?

any suggestions on this?

Thanks in Advance

user1585111
  • 1,019
  • 6
  • 19
  • 35

1 Answers1

0

I would rather trust the MapReduce framework itself to handle this. MapReduce allows us to write custom Inout/Output Formats to handle data which is not supported by it OOTB, like JSON. See this question for an example. I would prefer this as I won't require any third party stuff for this. It's just a matter of extending the MapReduce API(But it's just my choice. Other's may find something else more suitable).

But, the easiest way, IMHO, would be to use Hive or Pig to handle JSON data. You don't have to do much in order to make it work, as both these project have OOTB JSON support. See this for Hive-JSON SerDe and this for Pig's JsonLoader and JsonStorage.

HTH

Community
  • 1
  • 1
Tariq
  • 34,076
  • 8
  • 57
  • 79