My Projects
NLP - Emergence Prediction Analysis
Natural language processing (NLP) project to predict cosine similarity between SEC 10K and YouTube data to estimate growth from emerging technolgies.
- NLP
- Python
- NLTK
- R

Big Data - Medical No-Shows
Big Data ML project to predict patient no-shows at clinics by building a Data Science pipeline on Databricks.
- Python
- Spark SQL
- Databricks
- MLlib

Malware Classification
Multi-level Classification (Unbalanced Data) project to predict if a given file is a malware and of what type (Level of quarantine required).
- Python
- TensorFlow
- CNN
- GCP

NLP - Reddit Content Analysis
Natural language processing project to analyse Reddit posts data based on various data processing methods like BOW, TFIDF etc.
- NLP
- Python
- Gensim
- NLTK

NLP - Kickstarter Projects Analysis
Natural language processing project where we perform similarity analysis on Kickstarter text data in different time frames.
- NLP
- Python
- Gensim
- NLTK

Tools







About Me
Experienced Data Science professional with 4+ years hands-on experience in statistical modelling and machine learning techniques. Exceptional understanding of descriptive and predictive analytics. I have extensively worked on data processing, interpreting complex and multidimensional datasets, database management, programming, problem solving, visualizations and reporting, making a true impact across various domains and industries.
My Resume