RFM Segmentation & Churn Prediction

Summary

This project involves a comprehensive analysis of transactional data from Pizza Hut Vietnam, spanning from 2021 to 2023. The data includes transaction date, sales amount, customer ID, customer gender, voucher status, and province. The goal of the project is to gain insights into customer behavior, identify sales trends, segment customers, predict churn rate and evaluate the effectiveness of various sales channels and strategies.

The project is divided into five main parts:

  1. Import Data, EDA and RFM Segmentation: The data is imported, cleaned, and explored. The exploration includes checking for missing values, duplicates, and outliers. The data is then segmented using RFM (Recency, Frequency, Monetary) analysis.

  2. Churn and CLV Predict: Models are built to predict customer churn and Customer Lifetime Value (CLV). These models help understand which customers are likely to churn and the potential value of each customer to the business.

  3. Final Model for CLV and Churn Probability: The models from the previous step are finalized and applied to different customer segments.

  4. Additional Insights: Additional insights are derived from the data, such as the retention rate and the number of new customers per month.

  5. Recommendations: Based on the analysis, recommendations are made to improve business performance.

Project Image

Open Notebook

Techniques

The analysis uses various data analysis and machine learning techniques:

The analysis is implemented using Python, with libraries such as pandas for data manipulation, matplotlib and seaborn for visualization, and scikit-learn and XGBoost for machine learning.