This project is made using Django framework and machine learning classification algorithm (Naive bayes algo). This project detects whether mail is spam or not.
- Data gathering
- Data preprocessing
- Machine Learning
- Model evaluation
- Predictive Modelin
- Python
- Numpy
- Pandas
- jupyter
- joblib
- HTML
- CSS
- JavaScript
- Django
This is project based on classification ml algorithm.
Dataset used by this project is 'spam.csv' downloaded from kaggle. After downloading and importing dataset(in jupyter Notebook) i did data mapping like ham=0 and spam=1 and created one column 'sapm'.
After that i just divided my dataset into dependent and independent features. (Independent features=Message | Dependent feature=Spam) and split into training and testing data.
After that i did text preprocessing for that i used 'CountVectorizer'.
After that i created 'Multinomial naive bayes' model and used pipline.
I got 97% testing accuracy.
- frontend developers
- data exploration/descriptive statistics
- data processing/cleaning
- statistical modeling
- writeup/reporting
- Feel free to contact me any questions or if you are interested in contributing!