This dataset represents customer purchase behavior and preferences for an e-commerce or retail clothing and footwear platform. Here’s a column-wise breakdown of the dataset:
- Customer ID (Column A): A unique identifier for each customer (not shown completely but inferred from context).
- Age (Column B): The customer’s age.
- Gender (Column C): The gender of the customer (all shown here are “Male”).
- Item Purchased (Column D): The specific item bought by the customer (e.g., Blouse, Sweater, Jeans).
- Category (Column E): The broad category of the item (e.g., Clothing, Footwear, Outerwear).
- Purchase Location (Column F): Numerical code, possibly representing store ID or purchase region.
- Location (Column G): The U.S. state where the customer is located.
- Size (Column H): The size of the purchased item (e.g., L, M, S).
- Color (Column I): The color of the item.
- Season (Column J): The season associated with the purchase, possibly indicating seasonal demand or use.
- Review Rating (Column K): Customer rating of the product (out of 5).
- Subscription Status (Column L): Indicates if the customer has a subscription (all shown are “Yes”).
- Shipping Type (Column M): Method of shipping (e.g., Express, Free Shipping, Next Day Air).
- Discount Applied (Column N): Indicates if a discount was applied (“Yes” in all shown).
- Promo Code Used (Column O): Indicates if a promo code was used (“Yes” in all shown).
- Previous Purchases (Column P): The number of past purchases made by the customer.
- Payment Method (Column Q): How the customer paid (e.g., Venmo, PayPal, Credit Card).
- Frequency of Purchase (Column R): How often the customer makes purchases (e.g., Weekly, Annually, Fortnightly).
Dataset Link: https://github.com/slidescope/data/blob/master/shopping_trends_updated.csv
This type of dataset is valuable for customer segmentation, recommendation systems, personalized marketing, and analyzing purchase behavior trends.
Here are KPIs and visualization ideas for building a customer purchase behavior dashboard in Power BI using this dataset:
🔑 Key Performance Indicators (KPIs)
- Total Customers
- Unique count of customer IDs.
- Total Sales / Purchases
- Sum of purchase counts (if monetary value is available, then total revenue).
- Average Review Rating
- Average of the “Review Rating” column.
- Repeat Purchase Rate
- Percentage of customers with “Previous Purchases” > 1.
- Subscription Rate
- % of customers with “Subscription Status” = Yes.
- Average Purchase Frequency
- Mode or weighted frequency (Weekly, Monthly, etc.).
- Top Payment Methods
- Distribution of payment methods used.
- Promo Code Utilization Rate
- % of purchases where “Promo Code Used” = Yes.
- Discount Usage Rate
- % of transactions with a discount applied.
- Top Performing Categories
- Category-wise purchase counts or revenue.
- Most Popular Shipping Type
- Count or % of each shipping type used.
- Purchase Season Trend
- Purchases broken down by season (Winter, Summer, etc.).
📊 Visual Ideas
1. Customer Demographics Panel
- Pie Chart: Gender Distribution
- Histogram: Age Distribution
- Map: Purchase distribution by state (Location)
2. Sales & Reviews Overview
- KPI Cards: Total Sales, Avg. Review Rating, Subscription Rate
- Line Chart: Review Rating trends over time or age
3. Category Performance
- Stacked Bar Chart: Items Purchased by Category
- Treemap: Category vs. Item Purchased counts
4. Customer Behavior
- Bar Chart: Frequency of Purchases by Payment Method
- Funnel Chart: Subscription → Promo Use → Discount Use → Repeat Purchases
5. Shipping & Seasonal Trends
- Donut Chart: Shipping Type breakdown
- Stacked Column Chart: Purchase counts by Season and Category
6. Promo & Discount Analytics
- Gauge: Promo Code Usage %
- Clustered Column Chart: Promo Code Used vs Not Used by Payment Method
🧠 Advanced Ideas
- Churn Prediction (using Previous Purchases and Frequency)
- Customer Segmentation (age, purchase pattern, subscription status)
