This project uses machine learning to detect fraudulent credit card transactions. The dataset used is highly imbalanced and contains only a small fraction of fraudulent transactions. This system is designed to help financial institutions identify suspicious activity effectively and minimize potential financial losses.
- Goal: Accurately classify credit card transactions as fraudulent or legitimate.
- Dataset: European credit card transactions (from Kaggle).
- Tech Stack: Python, Pandas, Scikit-Learn, Matplotlib, Seaborn
- Data preprocessing and cleaning
- Exploratory Data Analysis (EDA) and visualization
- Feature selection and scaling
- Model training using multiple ML algorithms
- Evaluation using accuracy, precision, recall, F1-score, and ROC-AUC
- Handling imbalanced data using SMOTE and undersampling techniques
- Confusion matrix and ROC curve visualization
Credit_Card_Fraud_Detection_System/
β
βββ data/ # Dataset files (CSV)
βββ models/ # Saved model files
βββ notebooks/ # Jupyter notebooks
βββ visuals/ # Plots and graphs
βββ fraud_detection.py # Main Python script
βββ requirements.txt # Dependencies
βββ README.md # Project documentation
- Confusion Matrix
- Accuracy
- Precision
- Recall
- F1 Score
- ROC-AUC Curve
- Logistic Regression
- Random Forest Classifier
- Decision Trees
- XGBoost
- Support Vector Machine (SVM)
- K-Nearest Neighbors (KNN)
- Integrate deep learning (LSTM or autoencoders)
- Real-time fraud detection with stream data
- Web interface or dashboard using Flask/Streamlit
- Kaggle - Credit Card Fraud Detection Dataset
- Scikit-learn, Matplotlib, Seaborn, Imbalanced-learn