0% found this document useful (0 votes)
13 views

Practical Assignment ( Data Analytics )

This document presents a practical file for a Data Analytics course, detailing an analysis of sales data from an e-commerce platform. The analysis involved data cleaning, exploratory data analysis, and visualizations, revealing key insights such as Classic Cars being the primary revenue driver and a peak sales year in 2004. The findings emphasize the importance of strategic planning to address potential challenges in revenue growth.

Uploaded by

Lawrence mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views

Practical Assignment ( Data Analytics )

This document presents a practical file for a Data Analytics course, detailing an analysis of sales data from an e-commerce platform. The analysis involved data cleaning, exploratory data analysis, and visualizations, revealing key insights such as Classic Cars being the primary revenue driver and a peak sales year in 2004. The findings emphasize the importance of strategic planning to address potential challenges in revenue growth.

Uploaded by

Lawrence mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Session: 2022-2025

Branch: BCA (Honours) - 5th Semester


Subject: Data Analytics (BCAH-503)

Practical File

Submitted To: Submitted By:


Mr. Ratnesh Luv Jain (BCAH1CA22019)
Assistant Professor
Dept of CSE

Content
Introduction……………………………………………….3
Objectives………………………………………………….4
Steps of Implementation…………………………………….5
Key Code Snippets…………………………………………11
Conclusion……………………………………………………12
References……………………………………………………13

Introduction
The dataset used in this investigation includes detailed sales
transaction data from an e-commerce platform. It contains 2,823
records with 25 attributes that capture critical information such
as order details, client information, product categories, and sales
figures.

The key elements of the dataset include:


a) Temporal Data: ORDERDATE provides insights into sales
trends over time.
b) Geographic Data: Attributes such as STATE, COUNTRY, and
TERRITORY enable regional sales analysis.
c) Product Details: PRODUCTLINE and PRODUCTCODE are
useful for understanding category performance.
d) Sales metrics: Attributes such as SALES, PRICEEACH, and
QUANTITYORDERED offer specific financial transaction
information.

Objective
The purpose of this assignment is to analyze sales data from an
e-commerce platform to uncover key insights. Using Python, the
dataset was cleaned, analyzed, and visualized to ensure data
quality and reveal trends, outliers, and revenue-driving factors.
The dataset contained columns such as order details, customer
and product information, and sales figures.
Steps of Implementation
1) Data Cleaning:
 Missing values in STATE and TERRITORY were replaced
with "Unknown."
 Empty ADDRESSLINE2 fields were filled with blank
strings.
 Missing POSTALCODE values were standardized as
"00000."
 Converted ORDERDATE to datetime format to enable date-
based analysis.

(Uncleaned Data) (Cleaned Data)


2) Exploratory Data Analysis (EDA):
 Sales Distribution: I visualized the distribution of sales
figures, which confirmed significant fluctuation across
orders.

 Deal Size Trends: We analyzed the number of orders by


DEALSIZE and discovered that medium transactions
were the most common.

 Outliers: We identified extraordinary sales numbers that


could suggest high-value orders or errors.
3) Data Aggregation and Grouping
 Total Sales: $10,032,628.85.
 Average Sales Per Customer (examples):
 Alpha Cognac: $3,524.42
 Amica Models: $3,619.89
 Product Line Performance:
 Classic Cars: $3,919,615.66.
 Motorcycles: $1,166,388.34.
 Yearly Sales Trends:
 2004 was the peak year, contributing $4,724,162.60
4) Visualizations
a) Bar Charts:
Sales by PRODUCTLINE and
DEALSIZE
revealed top-
performing
categories and
deal sizes.

b) Line Chart:
Yearly sales
trends showed performance over time, peaking in 2004.

c) Histograms:
Analyzed sales distribution and identified frequent order ranges.
5) Key Insights
 Revenue Drivers:
o Classic Cars and Vintage Cars are the primary revenue-
generating product lines.
 Customer Preferences:
o Medium-sized deals contribute the majority of revenue.
 Trends Over Time:
o Sales growth peaked in 2004 but showed a decline in
2005, indicating potential operational or market
challenges.
Key Code Snippets
Conclusion
The analysis of the e-commerce sales data revealed key insights
critical to understanding revenue drivers, customer behaviour,
and performance trends over time. Classic Cars and Vintage
Cars emerged as the primary contributors to overall revenue,
showcasing their dominance in customer preferences. The
prevalence of medium-sized deals highlights a balanced
approach between value and volume, suggesting these deals as a
sweet spot for maximizing revenue. A temporal analysis revealed
that 2004 was the peak sales year, indicating operational
efficiency or favourable market conditions during that period.
However, the subsequent decline in 2005 raises concerns about
potential challenges such as market saturation, reduced demand,
or inefficiencies. These insights underscore the importance of
strategic planning and targeted efforts to sustain and grow
revenue streams in underperforming years.

Overall, the findings provide actionable guidance for optimizing


product focus, deal structuring, and operational strategies to
enhance business performance.
References
1)https://www.kaggle.com/datasets/kyanyoga/sample-
sales-data
2) https://docs.python.org/3/library/index.html
3)https://guides.lib.uoguelph.ca/c.php?
g=700755&p=4976233
4) https://www.sisense.com/glossary/data-
standardization/
5) https://www.spyder-ide.org/

SalesDataSet

You might also like