Murari Goswami

188
reputation
1
7

Holds degree in Computer Technology and experienced with 9 years in building Enterprise DW/BI and Big data application. Experienced in migrating classical DW/ BI application to Big data architecture.

I have operated in Germany (Berlin, Nuremberg) , Great Britain (London), Singapore, India area working for Tier 1 business organisations in building enterprise DW and Big data applications.

Experienced in Big Data eco systems (Hadoop, Map Reduce, Cloudera Impala, Pig, Hive, HBase, Oozie, Yarn, Spark, Storm, Spark Mlib, Spark SQL). Implementation of Big data Lake architecture in a multi node clusters in Hadoop 2.0.

Experienced with RDBMS like postgreSQL, Oracle, MySQL, SQL Server and NoSQL - MongoDB with document Query Language. Columnar Database - SAP HANA

Specialties: • BIG Data eco system : o Data Mgmt : YARN, Impala Admission Control. o Data Access: Java Map Reduce, Cloudra Impala, Pig, Hive, Solr, Spark, HBase, Storm. o Governance & Integration: Sqoop, Flume, Kafka, Apache Drill, Apache Ambari, Nagios o Operations: Oozie, Zookeeper o Libraries: Spark SQL, Spark mlib. o NoSQL: Mongo DB, HBase • Hadoop cluster administration and managing HDFS Federation. • Conceptual and Physical Data Modelling for Big Data • Functional Programming using Scala and implementation of Spark streaming. • ETL Integration: Pentaho Data Integrator (KETTLE, SPOON) and Python ETL, • Map Reduce Design Patterns. • Java - Spring, Hibernate, REST, Collections - List, Tree, Hash Map, Set, advanced Generics. • Advanced Analytics, Data visualization - Dashboard, Customer Scorecard, Data Modelling, KPI
Unification. • Reporting - Tableau reporting and administration, Micro Strategy BI suite. • Experienced in implementing Snowplow event analytic in Amazon S3 with Amazon EMR.