Lead Data Scientist with experience in designing, building, and shipping diverse AI/ML and data engineering solutions which include Large Scale IoT streaming Analytics/Data Pipelines, Large Scale Machine Learning, ASR, Recommendation System, Image semantics, Text semantics, Information extraction, Information retrieval, ML and heuristics-based image segmentation as well as MLOps leveraging MLFlow and Kubeflow with massive scalability and optimizations.
Well-versed in designing cloud-native horizontally scalable distributed system architecture leveraging queuing, consumer groups, etc, detecting bottlenecks, optimizing infrastructure costs as well as compute, and creating high throughput deployment patterns for serving machine learning and deep learning models in production.
Skills:
- Formulating a business problem, discover key use cases, and do a feasibility study in terms of business requirements, creating RFPs based on discovered key use cases
- Well-versed in the AWS ecosystem, python, microservices, machine learning, and deep learning
- Implementing and building a horizontally scalable data pipeline for the ML platforms incorporating various best practices
- Creating highly optimized, and scalable SOTA Deep Neural Network architectures for various use cases using both TensorFlow and PyTorch.
- Creating a horizontally scalable distributed environment for training and deployment of big, small, or huge ML and Deep Neural Network architectures
- Writing production-grade python systems for data management, processing, and machine learning.
- Have sound knowledge of git, docker, and basic IaC with terraform along with good understanding of project life-cycle
Past Achievements:
Won few ML competitions in the past (as solo): -- AIM Identify the author Challenge by Machine Hack (rank 2) -- ZS Young data scientist challenge 2018 by HackerEarth (rank 3) -- World data science challenge by Bitgrit (rank 4)
Kaggle Competitions Expert
Competitive Programming: -- Won 5 medals (2 silver and 3 bronze) at Hackerrank
B.Tech Double Gold Medalist in Academics
Writer @ Medium (https://mayank-k-jha.medium.com/)
Speaker - PyData, Kaggle Days, ACM Student Chapter