Interested in deriving insights from data and solving real life big data challenges. Certified Hadoop Developer(HDPCD Certified - http://bcert.me/sjjibqyq). Familiar with:
- Apache Hadoop(Pig, Hive, Sqoop, HCatalog)
- Apache Spark (PySpark, Java)
- Timeseries Analysis(Python - Statsmodels)
- Machine Learning(Python - H2O, SciKitLearn)
- NoSQL (Apache Cassandra)
- Text Analytics (Python - SpaCy, NLTK)