This project involves an in-depth analysis of a supply chain dataset focused on beauty products to uncover insights, identify trends, and visualize various aspects of the data. The primary objectives are to understand customer preferences, supplier performance, and key factors impacting different metrics within the supply chain.
- Overview
- Data Preparation
- Correlation Analysis
- Data Visualization
- Feature Importance Analysis
- Key Insights
- Usage
The project focuses on the analysis of a beauty products supply chain dataset to extract meaningful insights that can help in decision-making and optimization within supply chain management. Various visualizations are used to represent the findings, and a Decision Tree Regressor model is employed to identify the most impactful features on product cost and manufacturing lead time.
- Loading the Data: The dataset is loaded using
pandas
. - Initial Exploration:
- Display the first few rows and descriptive statistics.
- Check for missing values and confirm their absence.
- Encode categorical variables using one-hot encoding for further analysis.
- Pearson Correlation:
- Calculated to check for linear relationships between features.
- Filtered values to identify significant correlations.
- Spearman Correlation:
- Calculated to check for monotonic relationships between features.
- Filtered values to identify significant monotonic relationships.
-
Product Quantities by Location:
- Bar chart showing product quantities sold by different locations.
-
Distribution of Product Types by Customer Demographics:
- Pie charts displaying the distribution of product types among various customer demographics.
-
Defect Rates by Supplier:
- Bar chart comparing average defect rates among different suppliers.
-
Order Quantities by Routes and Transportation Modes:
- Stacked bar chart visualizing the distribution of order quantities across different routes segmented by transportation modes.
-
Average Lead Time by Supplier and Product Type:
- Bar chart analyzing the average lead time for each supplier by product type.
-
Manufacturing Costs by Product Type:
- Box plots showing the distribution of manufacturing costs for different product types.
-
Product Costs by Product Type:
- Box plots showing the distribution of product costs for different product types.
-
Shipping Costs vs. Order Quantities:
- Scatter plots exploring the relationship between shipping costs and order quantities for different carriers.
-
Revenue Generated by Customer Demographics:
- Pie chart showing the revenue generated by each customer demographic.
Using a Decision Tree Regressor to identify the top features that impact:
-
Product Costs:
- Determined the top 5 features affecting the cost of products.
-
Manufacturing Lead Time:
- Determined the top 5 features affecting the manufacturing lead time.
- Customer Preferences: Female customers primarily prefer skincare and cosmetic products, while male customers favor haircare products.
- Supplier Performance: Supplier 5 has the highest average defect rates, whereas Supplier 1 has the lowest.
- Order Distribution: Different transportation modes affect order quantities across various routes.
- Lead Time Analysis: Average lead time varies significantly by supplier and product type.
- Cost Analysis: Key features impacting product costs and manufacturing lead time identified using the Decision Tree Regressor.
Clone the repository:
git clone https://github.com/mishra-krishna/Analysis-and-Optimization-of-Supply-Chain-Operations
cd Analysis-and-Optimization-of-Supply-Chain-Operations
pip install -r requirements.txt
Beauty_Products_Supply_Chain_Analysis.ipynb