PROBLEM STATEMENT

Participants are expected to use ML models to predict welding defects in the materials by developing algorithms using the provided parameters. Participants are free to use any technique provided it is suited for the variety and volume of data provided. The end goal of this competition is to enable the Godrej Aerospace team to produce defect-free products every time.

PROJECT AIM :

Our project's aim is to do advanced analytics on welder’s details provided in the dataset and to build a model to predict defects before it occurs as per the input parameters.

Brief Walkthrough of work done :

● Cleaning and Preprocessing the data using necessary techniques.
● Getting insight from data by doing necessary EDA.
● Methods to determine welder’s performances:
○ Visualisation Techniques – plots like histplot, countplot ,box-plots ,sub-plots ,pair-plots, frequency distribution etc are used .
○ Statistical Techniques : pearson correlation , kendall correlation , p-values and heatmap for statistical visualisation like redundancy , confusion matrix .
● SWEET SPOT ANALYSIS :
○ Feature visualisation using various methods like U-Map and t-SNE for insight into how whole data is distributed over features .
○ We have also thought of an idea , sorting and dividing each feature column into 10-20 equal subsets and doing separate analysis to find correlations and zooming/analysing over subsets having higher proportion of defect and non defect in them and going iteratively to find the sweet spot of that feature .
● Using different technique to prepare data for training
○ Splitting data for training and testing .
○ Feature scaling (Standardisation like standard , quantile , robust scaler)the data to enable machine learning models to converge faster and also fit to distance based algorithms like svms , knns .
○ Dealing with an imbalanced dataset to prevent overfitting using techniques like gaussian noise upsampling , SMOTE .
● Training dataset and evaluating accuracy :
○ Fitting both original data and SMOTE generated data on various ML models like Decision Trees , Random Forest , Logistic Regression ,XGBoost , Gradient Boosting and SVMs also using cross validation techniques.
○ Obtaining Classification report from both data to get insight of overfitting .

CLASSIFICATION REPORT OF FEW ML MODELS :-

DATA PROVIDED

We have identified multiple parameters affecting the welding process and recorded the dataset from past activities. Process parameters such as ambient temperature, weld job temperature, humidity, voltage current, welding travel speed, shielding gas flow, and metal composition. For advanced analytics machine data, welder details are provided.

link to the data:-

https://docs.google.com/spreadsheets/d/1PifuLam1mMntrI0dPTWJosTYUWt8elwi/edit?usp=share_link&ouid=107098656021209623719&rtpof=true&sd=true

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
HA-225636_WeldRight_Abstract1.docx		HA-225636_WeldRight_Abstract1.docx
README.md		README.md
tim(5).ipynb		tim(5).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PROBLEM STATEMENT

PROJECT AIM :

Brief Walkthrough of work done :

CLASSIFICATION REPORT OF FEW ML MODELS :-

DATA PROVIDED

link to the data:-

About

Releases

Packages

Languages

jha04amartya/WeldRight

Folders and files

Latest commit

History

Repository files navigation

PROBLEM STATEMENT

PROJECT AIM :

Brief Walkthrough of work done :

CLASSIFICATION REPORT OF FEW ML MODELS :-

DATA PROVIDED

link to the data:-

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages