This GitHub repository contains code for predicting breast cancer using the Breast Cancer Wisconsin (Diagnostic) Data Set from Kaggle. The prediction is done using a logistic regression model, and the analysis includes data visualization using Seaborn and data manipulation using Pandas.
The Breast Cancer Wisconsin (Diagnostic) Data Set contains features of the breast of the Patient and their corresponding diagnosis. The dataset includes a total of 569 instances, each described by 30 numeric attributes. The target variable indicates whether the tumor is malignant (M) or benign (B).
- Logistic Regression
- Decision Tree
- Random Forest
- Linear Regression
- SVM (Support Vector Machine)
To run the code in this repository, you need to have Python installed along with the following libraries:
- Pandas
- Seaborn
- Scikit-learn
- Matplotlib