Evaluating Customer Segmentation Techniques in the Retail Sector.
DOI:
https://doi.org/10.9781/ijimai.2025.05.001Keywords:
Clustering Algorithms, Customer Segmentation, Machine Learning, Retail Analysis, Unsupervised LearningAbstract
In the current competitive corporate landscape, understanding client preferences and adapting marketing strategies accordingly has become crucial. This study evaluates the effectiveness of four machine learning algorithms (K-Means, Density-Based Spatial Clustering of Applications with Noise (DBSCAN), Gaussian Mixture Models (GMM), and Self-Organizing Maps (SOM)) for customer segmentation in the Turkish retail market. Two datasets were analyzed: a large-scale Turkish market sales dataset and a focused marketing campaign dataset. The research employed a comprehensive methodology encompassing data preparation, algorithm application, and performance evaluation using metrics such as the Calinski-Harabasz Index and Davies- Bouldin score. Results indicate that K-Means demonstrated superior performance in terms of interpretability and statistical validity. DBSCAN showed strengths in identifying non-spherical clusters, while GMM and SOM provided more granular segmentation. The findings offer actionable insights for Turkish retailers to optimize marketing strategies and enhance customer relationship management. This study contributes to the field of retail analytics by providing a methodological framework for evaluating customer segmentation techniques in specific market contexts.
Downloads
References
P. Sharma, The ultimate guide to K-means clustering: Definition, methods and applications. Retrieved from https://www.analyticsvidhya.com/blog/2019/08/comprehensive-guide-k-means-clustering/. Accessed on March, 2024.
S. P. Nguyen, “Deep customer segmentation with applications to a Vietnamese supermarkets' data,” Soft Computtng, vol. 25, no. 12, pp. 7785-7793, 2021.
İ.Kabasakal, “Customer segmentation based on recency frequency monetary model: A case study in E-retailing,“ Bilişim Teknolojileri Dergisi, vol. 13, no. 1, pp. 47-56, 2020.
G. Armstrong and P. Kotler, Marketing: an introduction, Pearson Educación, 2003.
N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, A. Galstyan, “A survey on bias and fairness in machine learning,” ACM computing surveys (CSUR), vol. 54, no. 6, pp. 1-35, 2021.
T. Kansal, S. Bahuguna, V. Singh, and T. Choudhury, “Customer segmentation using K-means clustering,” 2018 international conference on computational techniques, electronics and mechanical systems (CTEMS), pp. 135-139, IEEE, 2018.
G. Mohit, Customer Segmentation using Machine Learning applied to Banking Industry (Doctoral dissertation, Hochschule Neu Ulm), 2023.
D. Zakrzewska and J. Murlewski, “Clustering algorithms for bank customer segmentation,” In 5th International Conference on Intelligent Systems Design and Applications (ISDA’05), pp. 197-202, IEEE, 2005.
C. Rungruang, P. Riyapan, A. Intarasit, K. Chuarkham, andJ.Muangprathub, “RFM model customer segmentation based on hierarchical approach using FCA,” Expert Systems with Applications, vol. 237, 121449, 2024.
K. Tabianan, S. Velu, and V. Ravi, “K-means clustering approach for intelligent customer segmentation using customer purchase behavior data,” Sustainability, vol. 14, no. 12, 7243, 2022.
A. Ashabi, S. B. Sahibuddin, and M. Salkhordeh Haghighi, “The systematic review of K-means clustering algorithm,” In Proceedings of the 2020 9th International Conference on Networks, Communication and Computing, pp. 13-18, 2020.
M. Ester, H. P. Kriegel, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters,” in large spatial databases with noise. In kdd, Vol. 96, No. 34, pp. 226-231, 1996.
V. Kachroo, “Customer segmentation and profiling for e-commerce using DBSCAN and fuzzy C-means,” Proceedings on Engineering, vol. 5, no. 3, pp. 539-544, 2023.
E. A. Laksana, and M. M. Fahrezi, “Customer segmentation and analysisbased on Gaussian mixture model algorithm,” In Widyatama International Conference on Engineering 2024 (WICOENG 2024), pp. 67-75, Atlantis Press, 2024.
D. Birant and A. Kut, “ST-DBSCAN: An algorithm for clustering spatial-temporal data,” Data & knowledge engineering, vol 60, no. 1, pp. 208-221, 2007.
E. Schubert, J. Sander, M. Ester, H. P. Kriegel, X. Xu, “DBSCAN revisited, revisited: why and how you should (still) use DBSCAN,” ACM Transactions on Database Systems (TODS), vol. 42, no. 3, pp. 1-21, 2017.
L. L. Scientific, “Data segmentation using mixture regression models with generalized Gaussian distribution and K-means,” Journal of Theoretical and Applied Information Technology, vol. 103, no. 8, 2025.
E. A. Laksana and M. M. Fahrezi, “Customer segmentation and analysisbased on Gaussian mixture model algorithm,” In Widyatama International Conference on Engineering 2024 (WICOENG 2024), pp. 67-75, Atlantis Press, 2024.
M. A. Camilleri and, M. A. Camilleri, Market segmentation, targeting and positioning, pp. 69-83, Springer International Publishing, 2018.
S. M. Lundberg and S. I. Lee, “A unified approach to interpreting model predictions,” Advances in neural information processing systems, 30, 2017.
G. J. McLachlan, S. X. Lee, and S. I. Rathnayake, Finite mixture models. Annual review of statistics and its application, vol. 6, no. 1, pp. 355-378, 2019.
F. Saadi, B. Atmani, and F. Henni, “Improving retrieval performance of case based reasoning systems by fuzzy clustering,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 9, no. 1, pp. 84-91, 2024.
Y. C. Liu, M. Liu, and X. L. Wang, “Application of Self- Organizing Maps in text clustering,” Applications of Self- Organizing Maps, 205, 2012.
D. Barman and N. Chowdhury, “A novel approach for the customer segmentation using clustering through self-organizing map,” International Journal of Business Analytics (IJBAN), vol. 6, no. 2, pp. 23-45, 2019.
R. Vohra, J. Pahareeya, A. Hussain, F. Ghali, and A. Lui, “Using self organizing maps and K means clustering based on RFM model for customer segmentation in the online retail business,” In Intelligent Computing Methodologies: 16th International Conference, ICIC, Bari, Italy, Proceedings, Part III 16, pp. 484-497, Springer International Publishing, 2020.
S. Üstebey, İ. Yelmen, and M. Zontul, “Customer segmentation based on self-organizing maps: a case study on airline passengers,” Havacılık Ve Uzay Teknolojileri Dergisi, 2020.
T. Kohonen, “The self-organizing map,” Proceedings of the IEEE, vol. 78, no. 9, pp. 1464-1480, 1990.
I. Valova, G. Georgiev, N. Gueorguieva, and J. Olson, “Initialization issues in self-organizing maps,” Procedia Computer Science, vol. 20, pp. 52-57, 2013.
A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,” Journal of the royal statistical society: series B (methodological), vol. 39, no. 1, pp. 1-22, 1977.
K. Kumar Sharma, A. Seal, A. Yazidi, and O. Krejcar, “S- divergence-based internal clustering validation index,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 8, no. 4, pp. 127-139, 2023.
O. Colakoglu, 10 Million Rows Turkish Market Sales Dataset (MSSQL). Retrieved from https://www.kaggle.com/datasets/omercolakoglu/10million-rows-turkish-market-sales-dataset. Accessed: March 12, 2024.
R. Saldanha, Marketing Campaign. Retrieved from https://www.kaggle.com/datasets/rodsaldanha/arketing-campaign. Accessed on March, 2024.
Downloads
Published
-
Abstract223
-
PDF156