Playground Series - Season 4, Episode 3: EDA/Modelling for Multi-Class Prediction of Steel Plate Defects

This repository contains a Jupyter Notebook detailing the exploratory data analysis (EDA) and modeling process for multi-class prediction of steel plate defects. The notebook is part of the Playground Series - Season 4, Episode 3.

Kaggle Notebook

Overview

The notebook is structured into six main parts:

Data loading and first exploration
Target analysis
EDA and data preparation
Modeling
Explainability
Preparation of the submission

Part 1: Data loading and first exploration

The data is loaded and basic exploration is performed to understand the dataset's structure and features.

Part 2: Target analysis

An analysis of the target variables is conducted to understand their distribution and characteristics.

Part 3: EDA and data preparation

Exploratory data analysis (EDA) techniques are applied to understand the relationships between features and prepare the data for modeling.

Part 4: Modeling

Modeling is performed using XGBoost with a focus on optimizing hyperparameters and evaluating model performance.

Part 5: Explainability

The model's explainability is explored using SHAP values to understand feature importance and model predictions.

Part 6: Preparation of the submission

The final model predictions are prepared for submission, including ensembling strategies to improve performance.

For the full details and code implementation, please refer to the notebook in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__results___files		__results___files
LICENSE		LICENSE
README.md		README.md
steel-plate-eda-xgboost-is-all-you-need.ipynb		steel-plate-eda-xgboost-is-all-you-need.ipynb
target.csv		target.csv
test.csv		test.csv
train.csv		train.csv
xgb_submission.csv		xgb_submission.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Playground Series - Season 4, Episode 3: EDA/Modelling for Multi-Class Prediction of Steel Plate Defects

Kaggle Notebook

Overview

Part 1: Data loading and first exploration

Part 2: Target analysis

Part 3: EDA and data preparation

Part 4: Modeling

Part 5: Explainability

Part 6: Preparation of the submission

About

Releases

Packages

Languages

License

hardikjp7/SteelPlate-Multiclass-EDA-Modeling

Folders and files

Latest commit

History

Repository files navigation

Playground Series - Season 4, Episode 3: EDA/Modelling for Multi-Class Prediction of Steel Plate Defects

Kaggle Notebook

Overview

Part 1: Data loading and first exploration

Part 2: Target analysis

Part 3: EDA and data preparation

Part 4: Modeling

Part 5: Explainability

Part 6: Preparation of the submission

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages