My Projects

NLP - Emergence Prediction Analysis

Natural language processing (NLP) project to predict cosine similarity between SEC 10K and YouTube data to estimate growth from emerging technolgies.

  • NLP
  • Python
  • NLTK
  • R
Project 1

Big Data - Medical No-Shows

Big Data ML project to predict patient no-shows at clinics by building a Data Science pipeline on Databricks.

  • Python
  • Spark SQL
  • Databricks
  • MLlib
Project 2

Malware Classification

Multi-level Classification (Unbalanced Data) project to predict if a given file is a malware and of what type (Level of quarantine required).

  • Python
  • TensorFlow
  • CNN
  • GCP
Project 3

NLP - Reddit Content Analysis

Natural language processing project to analyse Reddit posts data based on various data processing methods like BOW, TFIDF etc.

  • NLP
  • Python
  • Gensim
  • NLTK
Project 4

NLP - Kickstarter Projects Analysis

Natural language processing project where we perform similarity analysis on Kickstarter text data in different time frames.

  • NLP
  • Python
  • Gensim
  • NLTK
Project 5

Tools

About Me

Experienced Data Science professional with 4+ years hands-on experience in statistical modelling and machine learning techniques. Exceptional understanding of descriptive and predictive analytics. I have extensively worked on data processing, interpreting complex and multidimensional datasets, database management, programming, problem solving, visualizations and reporting, making a true impact across various domains and industries.

My Resume
Suryateja