Skip to content

nurkholiqaganihafid/Kalbe_Nutritionals_Data_Scientist_Project

Repository files navigation

📌Kalbe_Nutritionals_Data_Scientist_Project

  • In this project, I use the Machine Learning Regression (Time Series) method to predict the number of daily sales of Kalbe products
    • The main purpose of this analysis is to help the inventory team make sufficient daily inventory
    • Helps find out every daily product sold
  • Apart from that, we also use the Machine Learning Clustering method to segment customers based on several criteria
    • The purpose of Clustering is to assist the marketing team in providing promotions and personal sales treatment to customers.

The data link contains each column's information: Case Study - Legend

🎯This is the Link to the Results of My Presentation

Presentation

My Youtube Channel: Presentation

🎯Let's Begin with the Results

📊Dashboard

My Tableau Profile: My Tableau

  • Exploratory Data Sales Analysis Dashboard

Exploratory Data Sales Analysis Dashboard

  • Daily and Monthly Sales Analysis Dashboard

Daily and Monthly Sales Analysis Dashboard

📈My Prediction Results

Data Preparation

  • Forecasting Data
df_forecast = df.groupby('date')[['qty']].sum()

Forecasting Data

  • Total Data from the Merge Results

Data Frame

Time Series Regression

  • Decomposition Analysis of Product Forecast

Decomposition Analysis of Product Forecast

  • Kalbe Daily Sales Data (Qty) within 1 Year

Rolling Statistics

  • Model ARIMA Prediction from Quantity Sold

Forecast Quantity Sold

  • Forecast of Product Quantity Sold in the Next 2 Months

Forecast of Product Quantity Sold in the Next 2 Months

  • Average Daily Sales Amount for Each Product in the Next 2 Months

Average Daily Sales Amount for Each Product in the Next 2 Months

  • 📚Conclusion and Recommendation
    • Conclusion
      • Based on the time series prediction results, the average number of products sold per day is 51.
      • The product with the highest sales is Thai Tea, with an average quantity of 8 per day, followed by Choco Bar and Ginger Candy, with an average quantity of 7 per day.
      • The product with the lowest sales is cashew, with an average of 2 per day.
    • Recommendation
      • Increasing the daily stock of Thai Tea, Choco Bar and Ginger Candy products to meet high customer demand.
      • Additionally, the lowest-selling products, such as Cashew, can be evaluated to determine whether there are opportunities to increase sales with marketing strategies or changes in product packaging or presentation.

K-Means Clustering

  • Data Frame Cluster
df_cluster = df.groupby(['customer_id']).agg({ 'transaction_id': 'count',
                                              'qty': 'sum',
                                               'total_amount': 'sum'
                                               }).reset_index()

Data Frame Cluster

  • Elbow Method

Elbow Method

In the plot above, the best cluster is 3

  • Customer Clustering based on Quantity and Total Amount

Customer Clustering

  • Summary of Clusters

Summary of Clusters

  • 📚Conclusion and Recommendation
    • Conclusion
      • Based on the clustering prediction results, there are 3 customer segments.
      • Segment 0 has 135 customers with an average of 8 transactions per customer, 27 average items per transaction, and an average total purchase of Rp 229.389.
      • Segment 1 has 202 customers with an average of 11 transactions per customer, 41 average items per transaction, and an average total purchase of Rp 363.267.
      • Segment 2 has 107 customers with an average of 15 transactions per customer, 58 average items per transaction, and an average total purchase of Rp 525.432.
    • Recommendation
      • Segment 0: Customers in this segment have a low transaction frequency, a moderate number of items per transaction, and relatively low total purchases. The recommendation for this segment is to provide special promotions or discounts to encourage customers to make more transactions.
      • Segment 1: Customers in this segment have a moderate frequency of transactions, a high number of items per transaction, and a moderate total purchase. The recommendation for this segment is to provide promotions that focus on increasing the number of items per transaction, such as product bundling or special offers for purchases of a certain amount.
      • Segment 2: Customers in this segment have high transaction frequency, a high number of items per transaction, and high total purchases. The recommendation for this segment is to provide promotions that focus on customer rewards and recognition, such as loyalty programs or special discounts for loyal customers.

Back to top

About

Machine Learning Project: Creating a Regression Model and Clustering

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published