GitHub - foofx88/Machine_Learning_Models: Demonstrating different Machine Learning Model

Machine Learning Demo - Exoplanet Exploration

Upon exploring the dataset - exoplanet_data.csv. There are various columns in the CSV file that might not make a lot of sense at first, after understanding what each data column means through NESI Archive

In this challenge, we will be utilizing 4 different models (Logistic Regression, K Nearest Neighbour, Random Forest and Deep Learning). The data is initially cleaned before input to the models by dropping any N/A values. Then the required X and Y are identified.

Data preprocessing

Train Test Split is created first, then data preprosessing starts from scaling the data using MinMaxScaler.

The X values use all columns except for 'koi_disposition' which is reserved for Y. 'koi_disposition' is used as the decider for the dataset as to confirm if an exoplanet can be categorized as 'Confirmed','Candidate','Falsed Positive'.

Because the data in column 'koi_disposition' are of a string value and is classified as categorical data, it needs to be converted into meaningful numbers in order to be used by Machine learning algorithms.

Models Comparison

Looking at the results from the 4 models. Random Forest had the best score with the testing data score at 91%.Deep learning followed with a close 90% whereas the other 2 models LR and KNN did alright but still not as good. It made sense that Random Forest performed the best here as there are 40 columns worth of data and each column could be the decider to predict a new Exoplanet Other than that, as the end result is to classify a newly discovered exoplanet, Random Forest works best with classification. By using all the features (X) in the dataset, it randomly creates decision trees to predict results from each decision trees and only the prediction with the maximum votes are presented as the final prediction.

Hence, that is why with this dataset and its required results, Random Forest would be the best model to do the predictions

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.ipynb_checkpoints		.ipynb_checkpoints
saved models		saved models
snips		snips
LogisticRegression.ipynb		LogisticRegression.ipynb
README.md		README.md
deep_learning.ipynb		deep_learning.ipynb
deep_learning_model.h5		deep_learning_model.h5
exoplanet_data.csv		exoplanet_data.csv
knn.ipynb		knn.ipynb
randomforest.ipynb		randomforest.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Demo - Exoplanet Exploration

Data preprocessing

Models Comparison

About

Releases

Packages

Languages

foofx88/Machine_Learning_Models

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Demo - Exoplanet Exploration

Data preprocessing

Models Comparison

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages