Skip to content

Detecting fraudulent car insurance claims using classification models and machine-learning techniques with resampling methods to handle imbalanced data. Built for a student project.

License

Notifications You must be signed in to change notification settings

stat-by-tish/car-insurance-fraud-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš— Car Insurance Fraud Detection

This project aims to detect fraudulent car insurance claims using machine learning classification techniques. The dataset consists of various categorical and numerical variables representing customer, vehicle, and claim characteristics.

🧠 Objectives

  • Identify potential fraudulent claims in a binary classification setup
  • Handle class imbalance using SMOTE and alternative resampling techniques
  • Compare model performance

πŸ› οΈ Tools & Techniques

  • R
  • Handling Imbalance: SMOTE, ROSE, Undersampling, Oversampling
  • Models: Logistic Regression, SVM, Classification Tree, XG Boost, Random Forest
  • Hyperparameter Tuning Methods: GridSearch with Cross Validation, Bayesian Optimization
  • Evaluation metrics: Accuracy, Precision, Recall, F1 Score, ROC-AUC
  • Model Interpretation: Partial Dependence Plots

πŸ“ Project Structure

  • /code/ – All R codes for preprocessing, modeling, and evaluation
  • /report/ – PDF report and presentation slides
  • /data/ – Includes the dataset

πŸ“„ License

Distributed under the MIT License. See LICENSE for more information.

About

Detecting fraudulent car insurance claims using classification models and machine-learning techniques with resampling methods to handle imbalanced data. Built for a student project.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages