Data Science Portfolio

Project 1: Breast Cancer Analysis: Project Overview

Objective

Classifying the breast cancer type: Malignant and Benign using real-valued features such as radius, texture, perimeter etc.

Methodology

PART 1

Data cleaning and exploration is done before applying the machine learning models. KNN (1NN, 3NN) is used to classify the two class for the dataset and their accuracy has been recorded. Futher, Linear discriminant classification and FDA is applied and compared with the KNN model and the detailed observations are noted in the report.

PART 2

Unsupervised machine learning algorithms such as PCA and K-Means (k=2, 3, 5) clustering is applied to the dataset. Davies Bouldin Index is calculated for the clusters to select the appropriate model. The results are compared and detailed observations are recorded in the report.

Learning Outcome

Understanding the mechanics of behind supervised and un-supervised algorithms for a simple two-classed dataset.

Project 2: Cryptocurreny time-series analysis: Project Overview

Dataset Obtained from the year 2018 to MAY 2021:

Objective

To do EDA of the time series and to find out the correlation between several cryptocurrencies. Finding out whether Tesla stock price has an effect on cryptocurrencies. Trying to find a way to predict the stock price of cryptocurrencies in he future.

Results are yet to be documented

Project 3: PUBG’s Downfall and E-Sports Industry

Abstact

This project focus on widening the active gamers count for an e-Sports title. This study gives strategies to the game developers to make the game more interesting and to the new gamers entering an E-Sports title by giving them an introductory guide. The main objective is to address the skill gap between players and giving satisfaction to the gamers while they are gaming.

Proposed Idea

Interactive Visualizations

Future Scope and Conclusion

Results are documented in GitHub.

Project 4: Finanical Serives: Analysis on CAPEX and OPEX of a company’s software investment

Objective

Report

Project 5: New York Airbnb Data Exploration: Project Overview

Since 2008, guests and hosts have used Airbnb to expand on traveling possibilities and present more personalized ways of experiencing the world. This dataset contains information on 2019 listings in New York and its geographical information, prices, number of reviews, and more.

Project 6: Electric Load Forecasting: Project Overview

Objective

We are very much dependent to the Electricity. It is our reponsibility to prevent from a catastrophic power failure. To prevent this we need to do some precautionary measures.

Solution

Clustering the Household with respect to their Electric load consumption

Forecasting Electric Load in order to be ready for the worst case scenario