Telco Default Prediction

Here, I explore some machine learning techniques for handling tabular data and deploy the models using the Flask API.

Process

Literature

My experience has laid more in computer vision and NLP, so I was not very familiar with machine learning and deep learnng techniques for structured tabular data. I was somewhat familiar with tree-based techniques. So, I looked through the relevant literature and later attempted to implement some of these models.

The papers are documented in "literature".

Exploration and development of models

Using a Jupyter Notebook to document my findings and progress, I clean and explore the dataset and develop several models and attempt to engineer features to improve performance. Specifically, I have implemented an XGBoost model, a model based on ResNet and another based on the Transformer architecture.

All three have decent model performance, with classification accuracy, precision, recall and F1-scores of about 80%. These metrics have been chosen to give a holistic picture of classification performance.

These are documented in the "Development" folder.

Deployment

I have deployed the models using Flask.

How to

Using Docker

To get this up and running, go to the docker folder. Download the repo. Then go to your terminal, and run:

sudo docker build -t build

This should install the relevant dependencies. The main program can be run from the main.py file in "Deployment" folder.

Manual (in case Docker fails)

Download all the files in the repo. You can find the necessary dependencies under the Docker folder, in requirements.txt. The entire application is implemented in python. I recommend installing Anaconda and running it from a virtual environment.

After installing all the dependencies by:

pip3 install <package>

You can run the application by calling

main.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
deployment		deployment
development		development
docker		docker
instance/uploads		instance/uploads
literature		literature
.gitattributes		.gitattributes
FTT_weights_v1.tf.data-00000-of-00001		FTT_weights_v1.tf.data-00000-of-00001
FTT_weights_v1.tf.index		FTT_weights_v1.tf.index
README.md		README.md
XGB_model_v1.bin		XGB_model_v1.bin
base.csv		base.csv
checkpoint		checkpoint
cleaned_default.csv		cleaned_default.csv
development_v1.ipynb		development_v1.ipynb
finantier_data_scientist_technical_test.txt		finantier_data_scientist_technical_test.txt
finantier_ds_technical_test_dataset.csv		finantier_ds_technical_test_dataset.csv
resnet_weights_v1.tf.data-00000-of-00001		resnet_weights_v1.tf.data-00000-of-00001
resnet_weights_v1.tf.index		resnet_weights_v1.tf.index
test.csv		test.csv
test_real.csv		test_real.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telco Default Prediction

Process

How to

About

Releases

Packages

Languages

tanyjnaaman/Telco-Default-Prediction

Folders and files

Latest commit

History

Repository files navigation

Telco Default Prediction

Process

How to

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages