Customer_Segmentation_Using_Hierarchical_Clustering (1)
Customer_Segmentation_Using_Hierarchical_Clustering (1)
zohaibkhanmcitp@gmail.com, 10Arslanravian97@gmail.com
Abstract— In the dynamic landscape of retail, the utilization of hierarchical clustering, an advanced data-
understanding customer behavior is paramount for businesses driven approach, this study aims to unveil valuable insights
seeking to optimize marketing strategies and enhance the that can revolutionize how businesses interact with their mall
shopping experience. This study explores the utilization of customers.
hierarchical clustering techniques for mall customer
segmentation, with a focus on the paper titled ’MALL The essence of this research lies in its potential to uncover
CUSTOMER SEGMENTATION USING MACHINE hidden patterns and preferences governing customer choices
LEARNING’ as the benchmark. Our dataset encompasses a within the mall environment. As businesses grapple with
diverse range of mall customers, spanning demographics and fierce competition in the retail arena, the ability to segment
behavioral attributes. Hierarchical clustering systematically customers effectively and tailor marketing strategies to
groups customers into clusters, revealing distinct segments specific segments holds the promise of a substantial
within the mall’s customer base. A comprehensive analysis of advantage. This paper not only introduces the concept of
these clusters unveils profound insights into customer hierarchical clustering as a potent tool for customer
tendencies, preferences, and purchasing habits. These insights segmentation but also explores its adaptability to diverse mall
form a solid foundation for tailored marketing campaigns, customer datasets.
personalized recommendations, and resource allocation within
the mall. The study contributes significantly to customer In the pages that follow, we delve into the methodology,
analytics, providing retailers with a powerful tool to gain a results, and implications of applying hierarchical clustering to
competitive edge in the retail sector. By leveraging hierarchical mall customer data. By doing so, we aim to provide businesses
clustering for mall customer segmentation, businesses can in the retail industry with a practical resource to enhance
enhance customer satisfaction, drive sales, and foster lasting customer satisfaction, drive sales, and foster enduring
customer relationships. This paper underscores the importance customer relationships in an ever-evolving retail landscape.
of data-driven method- ologies in understanding customer
behavior and offers a practical framework for businesses to II. BACKGROUND AND RELATED WORK
harness the potential of hierarchical clustering for strategic
decision-making in the retail industry. A. Background
Understanding customer behavior through segmentation is
Keywords: Hierarchical Clustering, Customer Segmentation, a fundamental strategy across various industries. This process
Data Mining Techniques, Market Segmentation, Cluster Analysis, involves categorizing customers into distinct groups based on
Agglomerative Clustering Dendrogram, Machine Learning in shared characteristics, enabling businesses to tailor their
Marketing, Consumer Behavior Analysis, Multivariate Data
approaches to meet specific needs and preferences. Malls and
Analysis, Customer Data Clustering, Behavioral Segmentation,
shopping centers serve as vibrant hubs attracting a diverse
Marketing Strategy, Customer Profiling, Predictive Analytics in
CRM
range of customers, each with their unique demographics,
shopping habits, and preferences. Yet, the application of
I. INTRODUCTION hierarchical clustering to segment mall customers remains a
relatively unexplored area. This study aims to address this gap
Understanding customer behavior is a fundamental pursuit by evaluating the suitability and effectiveness of hierarchical
for businesses in the dynamic retail sector. The ability to clustering in segmenting mall customers. The insights gained
dissect and categorize customers into distinct groups, known will have implications not only for retail but also for various
as customer segmentation, plays a pivotal role in shaping sectors seeking to enhance their understanding of diverse
marketing strategies and enhancing shopping experiences. customer bases and optimize their strategies accordingly.
This research project ventures into the application of
hierarchical clustering techniques within the context of mall B. Related Work
customers, a diverse and multifaceted consumer demographic. Customer segmentation and clustering techniques have
Malls attract a wide range of individuals, each with their been extensively studied across various domains, providing
unique prefer- ences, demographics, and behaviors. Through
2
Authorized licensed use limited to: Zhejiang University. Downloaded on November 15,2024 at 15:06:04 UTC from IEEE Xplore. Restrictions apply.
Regularly update and improve the resources based on user
input to enhance ease of use continually.
V. SYSTEM DIAGRAM AND FLOWCHART
Here Fig.1 demonstrates the Flowchart. This flow
represents the process of customer segmentation using various
Hierarchical Clustering methods. It starts by preparing the
data and then employs different clustering techniques
(univariate, bivariate, and multivariate) to group customers
based on different features. The flow then visualizes these
clusters and evaluates their quality using the Silhouette Score.
Finally, it presents various visual representations to
understand and analyze the segmented clusters before
concluding the segmentation pro- cess. The system
architecture Fig.2 starts with user interaction through the User
Interface, followed by data preprocessing for cleaning and
structuring data. The chosen distance metric in the Distance
Metric Selector influences the Linkage Matrix generation
utilized by the Hierarchical Clustering Engine to form
clusters. Evaluation, visualization, labeling, and the final
output, then summarize and present the clustered data for user
interpretation and further analysis.
3
Authorized licensed use limited to: Zhejiang University. Downloaded on November 15,2024 at 15:06:04 UTC from IEEE Xplore. Restrictions apply.
Fig. 3. . Univariate Clustering Dendogram
4
Authorized licensed use limited to: Zhejiang University. Downloaded on November 15,2024 at 15:06:04 UTC from IEEE Xplore. Restrictions apply.
F. Dataset Augmentation and Export represents the clustering patterns, offering insights into the
The cluster labels derived from the multivariate analysis relationships among customers in the context of the selected
features (see Fig.9).
were appended to the original dataset, resulting in a new
dataset named ’cluster.csv.’Additional details, such as the
maximum customer ID, were also extracted and
documented.
Fig. 9. 2D PCA
5
Authorized licensed use limited to: Zhejiang University. Downloaded on November 15,2024 at 15:06:04 UTC from IEEE Xplore. Restrictions apply.
REFERENCES
[1] Chongkolnee Rungruang, Pakwan Riyapan, Arthit Intarasit, Khan- chit
Chuarkham, Jirapond Muangprathub,RFM model customer seg-
mentation based on hierarchical approach using CA,Expert Sys- tems
with Applications,Volume 237, Part B,2024,121449,ISSN 0957-
174,https://doi.org/10.1016/j.eswa.2023.121449.
[2] Sukanlaya Sawang, Chia-Chi Lee, Cindy Yunhsin Chou, Nanjangud
Vishwanath Vighnesh, Deepak Chandrashekar, Understanding post-
pandemic market segmentation through perceived risk, behavioural
intention, and emotional wellbeing of consumers, Journal of Re- tailing
and Consumer Services, Volume 75,2023,103482,ISSN 0969-
6989,https://doi.org/10.1016/j.jretconser.2023.103482.
[3] Hidenori Komatsu, Osamu Kimura,Customer segmentation
based on smart meter data analytics: Behavioral similarities
with manual categorization for building types,Energy and
Buildings, Volume 283,2023,112831,ISSN 0378-
7788,https://doi.org/10.1016/j.enbuild.2023.112831.
[4] D. Teslenko, A. Sorokina, K. Smelyakov and O. Filipov, ”Comparative
Analysis of the Applicability of Five Clustering Algorithms for Market
Segmentation,” 2023 IEEE Open Conference of Electrical, Electronic
and Information Sciences (eStream), Vilnius, Lithuania, 2023, pp. 1-6,
doi: 10.1109/eStream59056.2023.10134796.
[5] Luo, L., Li, B., Fan, X. et al. Dynamic customer segmentation via
hierar- chical fragmentation-coagulation processes. Mach Learn 112,
281–310 (2023). https://doi.org/10.1007/s10994-022-06276-8
[6] S. Jeena, A. Chaudhary and A. Thakur, ”Implementation & Anal- ysis
of Online Retail Dataset Using Clustering Algorithms,” 2023 4th
International Conference on Intelligent Engineering and Man- agement
(ICIEM), London, United Kingdom, 2023, pp. 16, doi:
10.1109/ICIEM59379.2023.10166552.
[7] Jie Yu and Xikun Zhang. 2023. Research on Online Learning User
Classification Based on Hierarchical Clustering. In Proceedings of the
2023 6th International Conference on Big Data and Education
(ICBDE ’23). Association for Computing Machinery, New York, NY,
USA, 100–105. https://doi.org/10.1145/3608218.3608222
[8] Phan Duy Hung, Nguyen Thi Thuy Lien, and Nguyen Duc Ngoc. 2019.
Customer Segmentation Using Hierarchical Agglomerative Clustering.
In Proceedings of the 2nd International Conference on Information Sci-
ence and Systems (ICISS ’19). Association for Computing Machinery,
New York, NY, USA, 33–37.
https://doi.org/10.1145/3322645.3322677
6
Authorized licensed use limited to: Zhejiang University. Downloaded on November 15,2024 at 15:06:04 UTC from IEEE Xplore. Restrictions apply.