Suryateja Chalapati

Logo

Data Scientist | NLP | ML

Summary

Experienced Data Science/Engineer professional with 7+ years hands-on experience in Designing data-intensive applications using Hadoop Ecosystem and Big Data Analytical, Cloud Data engineering (AWS, GCP), Data Visualization, Data Warehousing, Reporting, Data Pipelines and Data Quality solutions. Hands-on expertise with data engineering stack, including Python, SQL, and worked with databases like My SQL, SQL Server, Snowflake, Mongo DB, Cassandra and writing ETL’s. I have also Built Statistical and Machine Learning models. Exceptional understanding of Descriptive and Predictive Analytics. I have extensively worked on Data Preprocessing, Interpreting Complex and Multidimensional Datasets, Database Management, Programming, Problem Solving, Model Deployment and Maintenance, Model Optimization, Metrics etc. making a true impact across various domains and industries.

Experience

Data Scientist @ XSELL Technologies (Jun 2022 - Present)

Summary: Developed advanced Natural Language Processing Models, Pipelines and Metrics using Unstructured Data for Multiple High Profile Clients

Data Scientist @ University of South Florida (Aug 2021 - May 2022)

Summary: Deployed Natural Language Processing Models based on Similarity and Semantic Analysis on Unstructured Data

Data Engineer @ University of South Florida (Jan 2021 - Jul 2021)

Summary: Built GCP Data Pipelines and Forecast Company Growth from Emerging Technologies using Social Media and Financials - GitHub

Machine Learning Engineer @ Tampa General Hospital (Feb 2020 - Dec 2020)

Summary: Optimized Medical Clinic Operations by Building ML Models and Metrics

Junior Data Scientist @ Amazon (Sep 2017 - Jul 2019)

Data Engineer @ Mahindra & Mahindra (Aug 2016 - Aug 2017)

Summary: Built AWS data pipelines and Metrics for production forecasting and Optimized process flow to achieve significant cost savings

Data Analyst @ Mahindra & Mahindra (Jun 2015 - Jul 2016)

Summary: Built tools and methods for Optimizing production process flow

Technical Skills

Programming Languages: Python, R, SQL, JavaScript, C#, HTML, CSS, ASP .Net Core MVC, SAS

Databases: Oracle, My SQL, SQL Server, MongoDB, Cassandra, Snowflake, ETL

Big Data & Deployment: Apache Hadoop, Hive, Impala, Spark, Spark SQL, MLlib, Databricks, AWS Sagemaker, Docker, TensorFlow Serving

Visualization & Cloud Tools: Google Cloud (GCP), AWS, EC2, S3, Redshift, Microsoft Azure, Tableau, Power BI, Excel, VBA

Domain Experience: IT, E-Commerce, Healthcare & Automotive

Machine Learning: Supervised, Unsupervised Learning, Classification, Regularization, CNN, RNN, Anomaly Detection, K-NN, SVM, Naïve Bayes, Decision Tree, Random Forest, Keras, TensorFlow, Text Mining, Natural Language processing (NLP)

Statistics: Descriptive and Inferential Statistics, Linear (OLS, GLM), Logistic, Poisson, Hypothesis Testing, ANOVA, Survival Analysis, Mixed Models, Linear-Mixed Effect Models (LMER), A/B Testing, Data Science Pipelines, Time Series, APIs, Excel, Git

Education

Master of Science in Business Analytics and Information Systems (MS-BAIS) @ University of South Florida (Aug 2020 - May 2022)

Bachelor of Technology in Mechanical Engineering (MECH) @ GITAM University (Jul 2012 - May 2016)

Projects

Applied Data Scientist @ University of South Florida (Jan 2021 - May 2022)

Patient Turnup Rate at Clinics Analysis: Forecast Patient Show-Up Rate at Medical Clinics - GitHub

(Data Science Pipelines, Spark SQL, Python, Spark, Spark MLlib, Databricks, Classification, Azure, ML Pipeline)

Multi-Class Malware Classification: Flagged Malicious Software into Multilevel Categories - GitHub

(Classification, Python, CNN, TensorFlow, Keras, Neural Networks, Unbalanced Data, GCP)

Multi-Level House Price Prediction: Matched Customers with Estimated Average House Pricing based on various KPIs - GitHub

(R, Tableau, Python, Multi-Level Regression, Linear Mixed Effect Models (LMER), Statistical Modelling)

Let’s Connect

Thanks for stopping by. You can find my portfolio and my other projects from the following links.

Email | Portfolio | LinkedIn | GitHub | Twitter