0% found this document useful (0 votes)

3 views

ALOJIPAN Assessment_Task_1_Sampling_Data_Visualization

The document outlines an assessment task for students in a Quantitative Methods course at Emilio Aguinaldo College, focusing on exploratory data analysis and visualization of an iPhone purchase dataset. Students will utilize Python libraries like Matplotlib and Seaborn to analyze purchasing patterns based on demographics and visualize trends over time. The task includes specific instructions for data manipulation and visualization techniques, with expected outputs and interpretations for various graphical representations.

Uploaded by

shieldawnalojipan51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

ALOJIPAN Assessment_Task_1_Sampling_Data_Visualization

Uploaded by

shieldawnalojipan51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

EMILIO AGUINALDO COLLEGE

SCHOOL OF ENGINEERING AND TECHNOLOGY

QUANTITATIVE METHODS
(INCLUDING MODELING AND SIMULATION)

ASSESSMENT TASK 1
(Exploratory Data Analysis and Visualization)

Name (Student):
SHIEL DAWN AMON ALOJIPAN

Instructor / Professor:
ALEX HERNANDEZ

Submission Date:
12/02/2025
Objective:
The objective of this data visualization practice exercise is to analyze and interpret the iPhone
purchase dataset using various graphical representations. Through this exercise, students will:

● Gain hands-on experience with data visualization techniques in Python.

● Learn to use libraries such as Matplotlib and Seaborn for effective data representation.

● Identify patterns and trends in the dataset, including purchase behavior by age, gender, education
level, and location.
● Understand how different types of visualizations help convey meaningful insights from data.

Instructions:

● Please download the iphone purchase data set

(https://classroom.google.com/c/NzQ4ODU3NDAzNDMx/m/NzM3NjYwNjM2ODg5/details)
● Upload the data set under files of Google Colab.

● Use Google Colab to run the python code.

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
# Load the dataset
df = pd.read_csv("iphone_purchases.csv")
# Convert Date of Purchase to datetime format
df["Date of Purchase"] = pd.to_datetime(df["Date of Purchase"])
# 1. Distribution of Age
df["Age"].plot(kind='hist', bins=20, edgecolor='black', title='Age Distribution')
plt.xlabel("Age")
plt.show()

Output

After observing the generated histogram, you might describe the findings like this:
"The histogram shows that the customer ages are mostly concentrated between [age range], with
a peak around [age]. The distribution appears to be slightly skewed to the [left/right], indicating a
higher proportion of [younger/older] customers. There are a few outliers above [age], representing
a small segment of older customers."
# 2. Count of Purchases by Gender
sns.countplot(x="Gender", data=df, palette="coolwarm")
plt.title("Purchases by Gender")
plt.show()

Output

After observing the generated countplot, you might describe the findings like this:
"The countplot reveals that [Gender] made significantly more purchases compared to [Gender],
accounting for [Percentage]% of total purchases. This suggests a potential preference for
[product/brand] among [Gender] customers. The difference in purchase counts is substantial,
indicating a clear distinction in purchasing behavior between the two genders."

# 3. Count of Purchases by Education Level

sns.countplot(y="Education", data=df, palette="viridis")
plt.title("Purchases by Education Level")
plt.show()

Output
"The countplot reveals that customers with [Education Level] have the highest number of
purchases, followed by [Education Level] and [Education Level]. This suggests a potential
correlation between education level and purchasing behavior for this product/service. There is a
noticeable trend of [increasing/decreasing] purchases with higher education levels."

# 4. Most Purchased iPhone Models

sns.countplot(y="Item Purchased", data=df, order=df["Item Purchased"].value_counts().index,
palette="pastel")
plt.title("Most Purchased iPhone Models")
plt.show()

Output

he iPhone 15 is the most purchased iPhone model, followed by [second most purchased model]
and [third most purchased model]. There is a noticeable difference in purchase frequency
between the iPhone 15 and other models, indicating its strong popularity among customers. The
remaining iPhone models have relatively similar purchase counts, suggesting a more balanced
distribution for those models."

# 5. Total Sales by Location

df.groupby("Purchase Location")["Total Amount"].sum().sort_values().plot(kind='barh', color='skyblue',
title='Total Sales by Location')
plt.xlabel("Total Sales ($)")
plt.show()

Output
The bar chart highlights that [Location 1] and [Location 2] are the top-performing locations,
generating the highest total sales and contributing [Percentage 1]% and [Percentage 2]% of overall
revenue, respectively. [Location 3] follows with a significantly lower sales figure, indicating a
potential need for further investigation or marketing efforts in that region. The remaining locations
show a relatively even distribution of sales, suggesting a balanced market presence across those
areas."

# 6. Purchase Trends Over Time

df.groupby("Date of Purchase")["Quantity"].sum().plot(figsize=(12,5), title='Purchase Trends Over Time',
color='purple')
plt.ylabel("Quantity Sold")
plt.show()
Output

he plot reveals an overall [increasing/decreasing/stable] trend in purchase quantity over time.

There appears to be a seasonal pattern with sales peaking around [month/quarter/year] and
declining during [month/quarter/year]. A significant increase in sales occurred in [month/year], as
indicated by a sharp rise in the percentage change line. This could be attributed to [potential
reason, e.g., new product launch, marketing campaign]. Overall, the purchase trends show a
[positive/negative/mixed] outlook for sales performance.

# 7. Boxplot of Total Amount by Gender

sns.boxplot(x="Gender", y="Total Amount", data=df, palette="coolwarm")
plt.title("Total Amount Spent by Gender")
plt.show()
Output

The boxplot reveals that [Gender] tend to spend more on average, as indicated by a higher median
spending amount. The spread of spending is wider for [Gender], suggesting greater variability in
their purchase amounts. There are a few outliers for [Gender], representing unusually high
spending instances. Overall, [Gender] contribute a larger percentage ([Percentage]%) to the total
spending compared to [Gender] ([Percentage]%)."

# 8. Correlation Heatmap
plt.figure(figsize=(8,6))
sns.heatmap(df.corr(numeric_only=True), annot=True, cmap="coolwarm", linewidths=0.5)
plt.title("Correlation Heatmap")
plt.show()

Output
The correlation heatmap reveals a strong positive correlation between [Variable 1] and [Variable
2], suggesting that they tend to move together in the same direction. There is a moderate negative
correlation between [Variable 3] and [Variable 4], indicating an inverse relationship. [Variable 5]
shows weak or no correlation with other variables. These findings provide insights into the
relationships between different features in the dataset and can guide further analysis or
modeling."

# 9. Please provide the mean, median, mode of the purchase data.

Output

GCE N Level English Paper II - Answerkey - Final PDF
No ratings yet
GCE N Level English Paper II - Answerkey - Final PDF
6 pages
SMDM Project Report-Survi Ghura
100% (1)
SMDM Project Report-Survi Ghura
26 pages
Case Study
50% (2)
Case Study
8 pages
Diwali Sales Analysis
No ratings yet
Diwali Sales Analysis
14 pages
Data Collection and Data Cleaning: Next Connect To The Drive
No ratings yet
Data Collection and Data Cleaning: Next Connect To The Drive
16 pages
Technologyname Phase2
No ratings yet
Technologyname Phase2
20 pages
Week 7 - Data Visualization
No ratings yet
Week 7 - Data Visualization
14 pages
BIDA practical print
No ratings yet
BIDA practical print
56 pages
Data Visualization: Types of Data Visualization: Charts and Graphs Line Charts
No ratings yet
Data Visualization: Types of Data Visualization: Charts and Graphs Line Charts
15 pages
Divyanshi 05401172023 Ds Practical
No ratings yet
Divyanshi 05401172023 Ds Practical
18 pages
Tableau Lab Manual
No ratings yet
Tableau Lab Manual
6 pages
Supermarket Sales Analysis 1
No ratings yet
Supermarket Sales Analysis 1
13 pages
Project Sale Analysis
No ratings yet
Project Sale Analysis
8 pages
Diwali Sales Analysis EDA 1696347982
No ratings yet
Diwali Sales Analysis EDA 1696347982
8 pages
Python Data Analysis and Visualization 100 Practical Exercises With Results and Explanations (Yuka, Horikawa Yui, Kirigaya Kouta Etc.) (Z-Library)
No ratings yet
Python Data Analysis and Visualization 100 Practical Exercises With Results and Explanations (Yuka, Horikawa Yui, Kirigaya Kouta Etc.) (Z-Library)
453 pages
DOC-20250118-WA0002.
No ratings yet
DOC-20250118-WA0002.
4 pages
2
No ratings yet
2
9 pages
Data Science
No ratings yet
Data Science
22 pages
Practical D.V
No ratings yet
Practical D.V
13 pages
Data Visualization For Python - Sales Retail - r1
No ratings yet
Data Visualization For Python - Sales Retail - r1
19 pages
Cap 793
No ratings yet
Cap 793
17 pages
Prac - 6
No ratings yet
Prac - 6
7 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
SMA EXP4 AYU
No ratings yet
SMA EXP4 AYU
6 pages
DMV Lab 12
No ratings yet
DMV Lab 12
8 pages
Axe Submission
No ratings yet
Axe Submission
4 pages
Assignment
No ratings yet
Assignment
2 pages
Data Visulization
No ratings yet
Data Visulization
2 pages
Coding and Communication in Statistics Presentation 2024
No ratings yet
Coding and Communication in Statistics Presentation 2024
11 pages
Notes 20241025083428
No ratings yet
Notes 20241025083428
4 pages
Exploratory Data Analysis66
No ratings yet
Exploratory Data Analysis66
17 pages
Supermarket Sales Data analysis
No ratings yet
Supermarket Sales Data analysis
6 pages
Python Project
No ratings yet
Python Project
20 pages
2303A54054 - Lab Assignment 1 - Colab-1
No ratings yet
2303A54054 - Lab Assignment 1 - Colab-1
3 pages
An Extensive Step by Step Guide To Exploratory Data Analysis
No ratings yet
An Extensive Step by Step Guide To Exploratory Data Analysis
26 pages
Matplotlib Pandas Guide (1)
No ratings yet
Matplotlib Pandas Guide (1)
9 pages
Assgn
No ratings yet
Assgn
6 pages
report
No ratings yet
report
17 pages
21CS644 Module 4
No ratings yet
21CS644 Module 4
24 pages
matplotlib
No ratings yet
matplotlib
7 pages
banking_analysis
No ratings yet
banking_analysis
2 pages
Case Study Module 1
No ratings yet
Case Study Module 1
4 pages
Week13 2 Data Analysis 2
No ratings yet
Week13 2 Data Analysis 2
44 pages
DVPD Final Lab Word PDF
No ratings yet
DVPD Final Lab Word PDF
93 pages
Dsa and ML 10
No ratings yet
Dsa and ML 10
18 pages
IP Project Final
No ratings yet
IP Project Final
9 pages
Data Visualization
No ratings yet
Data Visualization
31 pages
Visualisation All
0% (1)
Visualisation All
70 pages
Matplotlib Pandas Guide
No ratings yet
Matplotlib Pandas Guide
7 pages
DF PD - Read - Excel ('Sample - Superstore - XLS') : Anjaliassignmnet - Ipy NB
No ratings yet
DF PD - Read - Excel ('Sample - Superstore - XLS') : Anjaliassignmnet - Ipy NB
23 pages
Plots of Matplotlib and Insights
No ratings yet
Plots of Matplotlib and Insights
5 pages
Supermart Grocery Sales Analysis
No ratings yet
Supermart Grocery Sales Analysis
8 pages
3rd part customer analysis
No ratings yet
3rd part customer analysis
1 page
prac2
No ratings yet
prac2
11 pages
W04- Visualization and Data Tools
No ratings yet
W04- Visualization and Data Tools
18 pages
00. Data+Visualization+in+Python
No ratings yet
00. Data+Visualization+in+Python
17 pages
Assignment 3 - Exploratory Data Analysis
No ratings yet
Assignment 3 - Exploratory Data Analysis
2 pages
Sample Project - IP - 12
No ratings yet
Sample Project - IP - 12
14 pages
Lab1 for module3- Python code (1)
No ratings yet
Lab1 for module3- Python code (1)
10 pages
4
No ratings yet
4
9 pages
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
From Everand
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
Zemelak Goraga
No ratings yet
Story of An Invitation Class 7
0% (1)
Story of An Invitation Class 7
2 pages
Facts About The Rapture
No ratings yet
Facts About The Rapture
1 page
Grade 12 Math Homework
100% (1)
Grade 12 Math Homework
6 pages
Study Resources For Test Bank For Introduction To JavaScript Programming With XML and PHP: 0133068307
100% (4)
Study Resources For Test Bank For Introduction To JavaScript Programming With XML and PHP: 0133068307
55 pages
2-Safety (Sight Distance at Circular Curve)
No ratings yet
2-Safety (Sight Distance at Circular Curve)
7 pages
SHRM Bock Final
100% (2)
SHRM Bock Final
52 pages
API - FB (Cavity Relief)
No ratings yet
API - FB (Cavity Relief)
5 pages
Books for EASTRIP Project
No ratings yet
Books for EASTRIP Project
52 pages
Educators Deserve Better
No ratings yet
Educators Deserve Better
6 pages
Service Manual: Model: K 1000 - 2 - G / H 35
No ratings yet
Service Manual: Model: K 1000 - 2 - G / H 35
39 pages
Fascia
100% (1)
Fascia
4 pages
GIU WIR E-Leaflet - Edition 4
No ratings yet
GIU WIR E-Leaflet - Edition 4
11 pages
Q2 Math6 Week6
No ratings yet
Q2 Math6 Week6
16 pages
Job Vacancy Details at Attingal Job Fair
No ratings yet
Job Vacancy Details at Attingal Job Fair
18 pages
POST Newspaper For 11th of July, 2015
No ratings yet
POST Newspaper For 11th of July, 2015
72 pages
Observation MB 1
No ratings yet
Observation MB 1
4 pages
Firm Expo Business Plan Development
No ratings yet
Firm Expo Business Plan Development
14 pages
Devops Training Curriculum - Course Content
No ratings yet
Devops Training Curriculum - Course Content
4 pages
Review English Q1 Sem 2
No ratings yet
Review English Q1 Sem 2
4 pages
Killing For Eros: Psychological Conseequences of Training To Kill 1
No ratings yet
Killing For Eros: Psychological Conseequences of Training To Kill 1
9 pages
Internship Report of UBL
No ratings yet
Internship Report of UBL
52 pages
Shoe Centreless Paper
No ratings yet
Shoe Centreless Paper
9 pages
CV MUH ZAHIRSYAH Jan 22 Rev4
No ratings yet
CV MUH ZAHIRSYAH Jan 22 Rev4
23 pages
Q4 DLL-MAPEH 10 - Health-Wk5-8
100% (1)
Q4 DLL-MAPEH 10 - Health-Wk5-8
6 pages
Something Wicked This Way Comes
No ratings yet
Something Wicked This Way Comes
7 pages
30TH EDITED..... July 2022 Saturday WSF Teaching Study Guide (FM)
No ratings yet
30TH EDITED..... July 2022 Saturday WSF Teaching Study Guide (FM)
1 page
Pharmaa Topnotch Review
No ratings yet
Pharmaa Topnotch Review
14 pages
Sta. Rosa National High School
No ratings yet
Sta. Rosa National High School
2 pages
What Is Helical Flow?: Relation Can Be Established With The Help of Hjulstrøm Diagram Here
No ratings yet
What Is Helical Flow?: Relation Can Be Established With The Help of Hjulstrøm Diagram Here
4 pages