- In this project, we analyzed the Breast Cancer dataset and built machine learning models to predict the survival status of patients based on various features.
- We preprocessed the data by handling missing values, encoding categorical variables, identifying and removing outliers, and selecting relevant features.
- We implemented the KNN algorithm from scratch and evaluated the performance of the models using accuracy, precision, recall, and F1 score metrics.
- We compared the performance of different classifiers, including Linear Regression, Naive Bayes, Decision Tree, Random Forest, and Gradient Boosting, using hyperparameter tuning with GridSearchCV.
- Random Forest and Gradient Boosting classifiers achieved the highest recall, and F1 score on the test set, indicating that they are suitable for predicting the survival status of breast cancer patients.
-
Notifications
You must be signed in to change notification settings - Fork 0
s-haider10/predicting_breast_cancer_survivability_ML
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published