This project aims to predict customer churn using various machine learning models to help telecom providers proactively retain their customers. Based on real-world data and methodologies from published research, we evaluated model performance and enhanced churn prediction with advanced techniques.
- Predict customer churn using models like Decision Trees, Random Forest, SVM, KNN, Naive Bayes, SGD, and ADAM.
- Address class imbalance using SMOTE.
- Perform Survival Analysis to estimate customer lifetime.
- Evaluate models using metrics: Accuracy, Precision, Recall, F1-Score.
- Identify key features influencing churn for better decision-making.
- Source: Telco Customer Churn Dataset - Kaggle
- Contains customer demographics, account info, and service usage.
- Decision Tree
- Random Forest
- K-Nearest Neighbors (KNN)
- Naive Bayes
- Support Vector Machine (SVM)
- Stochastic Gradient Descent (SGD)
- Multi-layer Perceptron (MLP) with ADAM Optimizer
- Data Preprocessing: Feature engineering, one-hot encoding
- Class Balancing: SMOTE
- Model Optimization: ADAM & SGD
- Evaluation: Confusion Matrix, ROC, Survival Plots
- Random Forest achieved ~98% accuracy and strong precision/recall.
- SMOTE significantly improved model fairness across classes.
- Survival analysis (Kaplan-Meier) helped estimate customer retention trends.
- Week 1β2: Literature Review and Base Paper Analysis
- Week 3β4: Dataset preprocessing & Exploration
- Week 5β6: Model Development & Base Paper Implementation
- Week 7β8: Optimization, Testing, and Final Evaluation
- Sharmila K. Wagh et al., Customer churn prediction in telecom sector using machine learning techniques, Results in Control and Optimization, 2024.
- Abhishek Gaur, Ratnesh Dubey, IEEE Conference 2018, DOI: 978-1-5386-5367-8/18.
- Telco Customer Churn Dataset - Kaggle
- Lifelines - Survival Analysis in Python
This project showcases the application of machine learning to real-world problems like customer churn in telecom. Using ensemble methods and data balancing techniques, we built a robust and reliable churn prediction model with actionable business insights.