Abstract
Classical clustering algorithms use all features to partition a dataset, making it difficult for users to understand the clustering results. Some scholars have proposed interpretable clustering algorithms that use a few understandable features to explain clustering results. However, the existing algorithms can only generate one interpretation and fail to satisfy the diverse needs of different users. To address this challenge, the Clustering Customized Explanation Tree (CCET), a visual analytics system, was constructed in this paper. The system helps users modify existing explanations to obtain customized explanations. Firstly, a variety of views are designed to visualize the explanations and help users judge whether the existing explanations meet the requirements. Then, an explanations modification strategy based on cluster centroids splitting is proposed making it easy for users to revise explanations according to the requirement. We demonstrate the CCET using a case study and a user study. The results show that the system can deepen users’ understanding of clustering results and make it easy for them to conduct further decision analysis.













Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Madhulatha, T.S.: “An overview on clustering methods,” arXiv preprint arXiv:1205.1117, (2012)
Chen, J., Chang, Y., Hobbs, B., Castaldi, P., Cho, M., Silverman, E., Dy, J.: Interpretable clustering via discriminative rectangle mixture model. In: 2016 IEEE 16th international conference on data mining (ICDM), pp. 823–828, IEEE, (2016)
Zhao, Y., Liang, S., Ren, Z., Ma, J., Yilmaz, E., de Rijke, M.: Explainable user clustering in short text streams. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pp. 155–164, (2016)
Blockeel, H., De Raedt, L., Ramon, J.: Top-down induction of clustering trees. arXiv preprint cs/0011032, (2000)
Loyola-Gonzalez, O., Gutierrez-Rodríguez, A.E., Medina-Pérez, M.A., Monroy, R., Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., Garcia-Borroto, M.: An explainable artificial intelligence model for clustering numerical databases. IEEE Access 8, 52370–52384 (2020)
Liu, B., Xia, Y., Yu, P.S.: Clustering through decision tree construction. In: Proceedings of the ninth international conference on Information and knowledge management, pp. 20–29, (2000)
Plant, C., Böhm, C.: Inconco: interpretable clustering of numerical and categorical objects. in Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1127–1135, (2011)
Fraiman, R., Ghattas, B., Svarc, M.: Interpretable clustering using unsupervised binary trees. Adv. Data Anal. Classif. 7(2), 125–145 (2013)
Bertsimas, D., Orfanoudaki, A., Wiberg, H.: Interpretable clustering: an optimization approach. Mach. Learn. 110(1), 89–138 (2021)
Sokol, K., Flach, P.: One explanation does not fit all. KI-Künstliche Intelligenz 34(2), 235–250 (2020)
Henin, C., Le Métayer, D.: A multi-layered approach for tailored black-box explanations. In: International Conference on Pattern Recognition, pp. 5–19, Springer, (2021)
Rezaee, M.R., Lelieveldt, B.P., Reiber, J.H.: A new cluster validity index for the fuzzy c-mean. Pattern Recogn. Lett. 19(3–4), 237–246 (1998)
Ester, M., Kriegel, H.-P., Sander, J., Xu, X. ,et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: kdd, vol. 96, pp. 226–231, (1996)
Richardson, S., Green, P.J.: On bayesian analysis of mixtures with an unknown number of components (with discussion). J. R. Stat. Soc. Ser. B (Stat. Methodol.) 59(4), 731–792 (1997)
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)
Moshkovitz, M., Dasgupta, S., Rashtchian, C., Frost, N.: Explainable k-means and k-medians clustering. In: International conference on machine learning, pp. 7055–7065, PMLR, (2020)
Basak, J., Krishnapuram, R.: Interpretable hierarchical clustering by constructing an unsupervised decision tree. IEEE Trans. Knowl. Data Eng. 17(1), 121–132 (2005)
Yang, W., Wang, X., Lu, J., Dou, W., Liu, S.: Interactive steering of hierarchical clustering. IEEE Trans. Visual Comput. Graphics 27(10), 3953–3967 (2020)
Adadi, A., Berrada, M.: Peeking inside the black-box: a survey on explainable artificial intelligence (xai). IEEE access 6, 52138–52160 (2018)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Routledge (2017)
Clancey, W.J.: The epistemology of a rule-based expert system-a framework for explanation. Artif. Intell. 20(3), 215–251 (1983)
Kohavi, R.: The power of decision tables. In: European conference on machine learning, pp. 174–189, Springer, (1995)
Rivest, R.L.: Learning decision lists. Mach. Learn. 2(3), 229–246 (1987)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
Schulz, H.-J.: Treevis. net: a tree visualization reference. IEEE Comput. Graph Appl. 31(6), 11–15 (2011)
Van Den Elzen, S., Van Wijk, J.J.: Baobabview: Interactive construction and analysis of decision trees. In: 2011 IEEE conference on visual analytics science and technology (VAST), pp. 151–160, IEEE, (2011)
Ming, Y., Qu, H., Bertini, E.: Rulematrix: visualizing and understanding classifiers with rules. IEEE Trans. Visual Comput. Graphics 25(1), 342–352 (2018)
Acknowledgements
This work was supported by the Hefei University of Technology Academic Newcomer Enhancement A Program (JZ2021HGTA0140), the National Natural Science Foundation of China (62277014) and the National Key Research and Development Program of China (2020YFC1523100).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, Y., Luo, Y., Liu, Y. et al. CCET: towards customized explanation of clustering. Vis Comput 39, 3169–3181 (2023). https://doi.org/10.1007/s00371-023-02958-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-023-02958-z