Skip to content

Analyze Walmart Black Friday purchase data to understand customer spending patterns by gender, marital status, and age using Confidence Intervals and the Central Limit Theorem (CLT). Includes EDA, visualization, hypothesis testing, and actionable business insights.

License

Notifications You must be signed in to change notification settings

SamarthKolge-Analyst/Walmart-Purchase-Analysis-Confidence-Interval-CLT

Repository files navigation

Walmart Customer Purchase Behavior Analysis: Confidence Interval & Central Limit Theorem (CLT)

Walmart Company


🎯 Objective

To analyze and compare customer purchase amounts during Walmart's Black Friday sales based on gender, marital status, and age groups. Using statistical methods like confidence intervals and the Central Limit Theorem, the project aims to uncover spending patterns and provide Walmart with actionable recommendations to optimize marketing and sales strategies.


📚 About Walmart

Walmart is a global retail giant operating a vast network of supercenters, discount department stores, and grocery stores. Serving millions of customers worldwide, Walmart continually seeks data-driven insights to enhance customer experience and business decisions.


Dataset

The dataset includes transaction data of Walmart customers during Black Friday with the following key columns:

Column Name Description
User_ID Unique identifier for each customer
Product_ID Unique identifier for each product
Gender Gender of the customer (M/F)
Age Age group (binned)
Occupation Masked occupation category
City_Category City category (A, B, C)
StayInCurrentCityYears Years lived in current city
Marital_Status Married or Unmarried
ProductCategory Masked product category
Purchase Purchase amount in currency units

Dataset link: Walmart_data.csv


Methodology

  • Data Import and Cleaning
  • Exploratory Data Analysis (EDA) with visualizations
  • Detect and handle missing values and outliers
  • Calculate average purchase amounts by gender, marital status, and age
  • Use Central Limit Theorem to build confidence intervals for population means
  • Compare confidence intervals to check for significant differences
  • Provide business insights and recommendations

Key Concepts Used

  • Confidence Interval Calculation
  • Central Limit Theorem (CLT)
  • Exploratory Data Analysis (EDA)
  • Data Visualization (Boxplots, Histograms, Heatmaps)
  • Statistical Inference

Results Summary

  • Insights into whether women spend more than men on Black Friday
  • Confidence intervals showing spending behavior by gender, marital status, and age
  • Recommendations on how Walmart can tailor marketing and inventory decisions
  • Observations on overlap of confidence intervals and their business implications

📂 Files in this Repository

File Type Description
Jupyter Notebook Walmart_Purchase_Analysis.ipynb - Full code, analysis, and visualizations
PDF Report Walmart_Purchase_Analysis.pdf - PDF export of the notebook with detailed explanations
Dataset Walmart_data.csv - Dataset used for the analysis

Feel free to reach out for questions and suggestions !


💻Compiled by Samarth

About

Analyze Walmart Black Friday purchase data to understand customer spending patterns by gender, marital status, and age using Confidence Intervals and the Central Limit Theorem (CLT). Includes EDA, visualization, hypothesis testing, and actionable business insights.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published