27

I would like to start learning about the big data technologies. I want to work in this area in the future. Does anyone know good books to start learning about it? Hadoop, HBase.

Beginner - intermediate - advanced -

Thanks in advance

Gunter Amorim
  • 77
  • 1
  • 5
  • 14
  • 1
    Welcome to StackOverflow! Asking about book recommendations are generally not considered good question material. You can read more about that [here](http://meta.stackexchange.com/questions/113678/how-to-request-book-recommendations), so I'm voting to close this question based on that. – Christofer Eliasson Nov 09 '12 at 14:32
  • 3
    @ChristoferEliasson it is quite surprising that this is closed , whereas this one http://stackoverflow.com/questions/388242/the-definitive-c-book-guide-and-list is not. – g24l Nov 16 '15 at 13:35

5 Answers5

12

I think a good start for beginner will be the Big Data course from Coursera

For example I've learnt the basics of MapReduce techonlology.

megas
  • 21,401
  • 12
  • 79
  • 130
12

How about Hadoop: The Definitive Guide, from O'Reilly Media. It covers everything to do with Hadoop, MapReduce, HDFS and more.

rsp
  • 23,135
  • 6
  • 55
  • 69
6

Besides the Cloudera resources I'd highly recommend you the reference books from O'Reilly :

  • Hadoop: The Definitive Guide
  • Programming Pig
  • Programming Hive
  • HBase: The Definitive Guide

You might also check it's data science kit as well.

Lorand Bendig
  • 10,630
  • 1
  • 38
  • 45
2

I would suggest to learn machine learning alongside the technology part https://www.coursera.org/course/ml. Learning statistics is also very important.

David Gruzman
  • 7,900
  • 1
  • 28
  • 30
2

If you are interested in Hive and Pig there are also more specialised books about these technologies:

cic
  • 7,310
  • 3
  • 23
  • 35
Samuel Kerrien
  • 6,965
  • 2
  • 29
  • 32