This project presents a complete data preparation and visualization workflow using Microsoft Excel. The dataset comprises customer information related to bike purchases, including demographic, behavioral, and regional attributes. All data cleaning, transformation, and visual analysis were conducted in Excel, making this a comprehensive no-code data analytics project.
- Clean and standardize a raw dataset for business analysis
- Transform encoded categorical data into human-readable formats
- Derive new features (e.g., age group segmentation)
- Generate pivot tables for analytical insights
- Design an interactive Excel dashboard for stakeholder presentation
The dataset contains 1026 customer records and 13 attributes:
Column | Description |
---|---|
ID |
Unique customer identifier |
Marital Status |
Encoded as M (Married) or S (Single) |
Gender |
Encoded as M (Male) or F (Female) |
Income |
Annual income in USD |
Children |
Number of children |
Education |
Highest academic qualification |
Occupation |
Profession type (e.g., Professional, Clerical) |
Home Owner |
Homeownership status (Yes /No ) |
Cars |
Number of cars owned |
Commute Distance |
Distance between home and workplace (e.g., 0-1 Miles ) |
Region |
Geographic region (e.g., Europe, Pacific) |
Age |
Age of the individual |
Purchased Bike |
Target variable – whether the customer purchased a bike |
The Working Sheet
contains the cleaned and enhanced version of the original dataset with 1000 rows and 14 columns. The following operations were performed:
- Marital Status: Replaced codes with readable labels
M
→Married
,S
→Single
- Gender: Replaced gender codes
M
→Male
,F
→Female
- Applied Consistent Casing using Excel's
PROPER()
andTRIM()
functions across:Education
Occupation
Commute Distance
Region
- Removed any extra whitespace or inconsistent formatting in all categorical columns
- Verified consistent capitalization in values like:
- Education: e.g.,
Partial High School
,Graduate Degree
- Occupation: e.g.,
Skilled Manual
,Clerical
,Management
- Commute Distance: e.g.,
0-1 Miles
,More than 10 Miles
- Region: Normalized to
Europe
,Pacific
,North America
- Education: e.g.,
A new column Age-Brackets
was derived using conditional logic:
Adolescent
: Age < 36Middle Age
: Age between 36 and 55Old
: Age > 55
This binning facilitates demographic segmentation and enhances dashboard filtering.
- Confirmed that all records are complete (no missing values)
- Ensured numeric columns (
Income
,Children
,Cars
,Age
) are valid and within expected ranges
Three detailed pivot tables were developed to uncover insights into bike buying behavior:
- Row Labels: Gender (Male, Female)
- Column Labels: Bike Purchase (
Yes
,No
) - Values: Average of Income
- Insight: Compare income levels across genders and their likelihood to purchase bikes
- Row Labels: Commute Distance (e.g.,
0-1 Miles
,More than 10 Miles
) - Column Labels: Purchased Bike (
Yes
,No
) - Values: Count of records
- Insight: Evaluate how commuting range affects purchase behavior
- Row Labels: Age Brackets (
Adolescent
,Middle Age
,Old
) - Column Labels: Purchased Bike (
Yes
,No
) - Values: Count of records
- Insight: Identify the most likely customer age group to purchase bikes
Each pivot includes grand totals for high-level comparison.
The Dashboard
sheet hosts a fully functional Excel dashboard titled "Bike Sales Dashboard". It visually represents insights using charts and filters for real-time exploration.
-
📊 Pivot Charts:
- Bike purchases by gender
- Bike purchases by age bracket
- Commute distance vs. purchase decision
-
🧩 Slicers:
- Marital Status
- Education
- Region
- Children
These tools allow users to filter data dynamically and gain actionable insights through simple clicks — no code or formula writing required.
- Microsoft Excel
IF()
,PROPER()
,TRIM()
for categorical transformation- Custom column creation (Age Brackets)
- Pivot tables for aggregation and analysis
- Pivot charts and slicers for dashboard interactivity
- Sheet formatting and layout design for clarity and usability
- Cleaned and standardized a customer dataset using Excel
- Transformed encoded fields into meaningful labels
- Developed demographic-based segmentation features
- Designed three pivot tables for data-driven insights
- Built an interactive dashboard to support decision-making and reporting