This project is an end-to-end Customer Behavior Analytics case study designed to demonstrate practical data analysis skills across Python, SQL, and Power BI. It replicates a real-world business analytics workflow where raw customer data is transformed into meaningful insights for decision-making.
As a student pursuing a career in Data Analytics, this project helped me practice the complete process — from understanding the business problem to delivering the final analytical report and dashboard.
The goal of this project is to analyze customer purchasing behavior and identify the key factors that influence sales, customer loyalty, and product performance. This project highlights my ability to:
Framed a clear business problem statement focusing on customer behavior, revenue trends, discounts, and loyalty patterns.
- Cleaned and transformed the dataset
- Resolved missing values and inconsistencies
- Performed exploratory data analysis (EDA)
- Engineered new features such as Age Groups
Executed SQL queries to analyze:
- Revenue contribution
- Product performance
- Customer segmentation
- Discount impact
- Subscription trends
- Repeat customer behavior
Designed a visually interactive dashboard that presents business insights clearly for stakeholders.
Prepared PDF reports and a presentation that summarize insights and business recommendations.
📄 Business Problem Statement.pdf
📄 Customer Shopping Behavior Analysis.pdf
📊 Customer-Shopping-Behavior-Analysis.pptx
🐍 Customer_Shopping_Behavior_Analysis.ipynb
📘 README.md
📈 customer_behavior_dashbord.pbix
🗄 customer_behavior_sql_queries.sql
📁 customer_shopping_behavior.csv
Each file plays a specific role in the project:
Defines the business understanding, objectives, and key questions.
A detailed summary of findings and business insights.
A visual walkthrough of insights suitable for interviews or academic evaluations.
Contains Python-based data cleaning, feature engineering, and exploratory analysis.
Interactive dashboard revealing patterns in customer behavior.
All SQL scripts used to analyze the PostgreSQL database.
The original Customer Shopping Behavior dataset used in the project.
- Python: Pandas, NumPy, Data Cleaning & EDA
- SQL: Aggregations, Joins, CTEs, Window Functions
- Power BI: DAX, Power Query, Dashboard Design
- PostgreSQL Database Management
- Customer segmentation
- Revenue analysis
- Product & discount analysis
- Insight generation
- Storytelling with data
- RFM segmentation (Recency, Frequency, Monetary)
- Predictive modeling for customer churn
- Additional DAX measures
- Automated ETL workflow
