Phd Candidate, Data Scientist, subspecies Statistician. R, Scala, Python, Hadoop, Spark and whatever comes my way. Tends to write long detailed questions, here this is a plus.