My Projects
NLP - Emergence Prediction Analysis
Natural language processing (NLP) project to predict cosine similarity between SEC 10K and YouTube data to estimate growth from emerging technolgies.
- NLP
- Python
- NLTK
- R
Big Data - Medical No-Shows
Big Data ML project to predict patient no-shows at clinics by building a Data Science pipeline on Databricks.
- Python
- Spark SQL
- Databricks
- MLlib
Malware Classification
Multi-level Classification (Unbalanced Data) project to predict if a given file is a malware and of what type (Level of quarantine required).
- Python
- TensorFlow
- CNN
- GCP
NLP - Reddit Content Analysis
Natural language processing project to analyse Reddit posts data based on various data processing methods like BOW, TFIDF etc.
- NLP
- Python
- Gensim
- NLTK
NLP - Kickstarter Projects Analysis
Natural language processing project where we perform similarity analysis on Kickstarter text data in different time frames.
- NLP
- Python
- Gensim
- NLTK
Tools
About Me
Experienced Data Science professional with 4+ years hands-on experience in statistical modelling and machine learning techniques. Exceptional understanding of descriptive and predictive analytics. I have extensively worked on data processing, interpreting complex and multidimensional datasets, database management, programming, problem solving, visualizations and reporting, making a true impact across various domains and industries.
My Resume