Skip to main content

Advertisement

Log in

CCET: towards customized explanation of clustering

  • Original article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Classical clustering algorithms use all features to partition a dataset, making it difficult for users to understand the clustering results. Some scholars have proposed interpretable clustering algorithms that use a few understandable features to explain clustering results. However, the existing algorithms can only generate one interpretation and fail to satisfy the diverse needs of different users. To address this challenge, the Clustering Customized Explanation Tree (CCET), a visual analytics system, was constructed in this paper. The system helps users modify existing explanations to obtain customized explanations. Firstly, a variety of views are designed to visualize the explanations and help users judge whether the existing explanations meet the requirements. Then, an explanations modification strategy based on cluster centroids splitting is proposed making it easy for users to revise explanations according to the requirement. We demonstrate the CCET using a case study and a user study. The results show that the system can deepen users’ understanding of clustering results and make it easy for them to conduct further decision analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
€32.70 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Madhulatha, T.S.: “An overview on clustering methods,” arXiv preprint arXiv:1205.1117, (2012)

  2. Chen, J., Chang, Y., Hobbs, B., Castaldi, P., Cho, M., Silverman, E., Dy, J.: Interpretable clustering via discriminative rectangle mixture model. In: 2016 IEEE 16th international conference on data mining (ICDM), pp. 823–828, IEEE, (2016)

  3. Zhao, Y., Liang, S., Ren, Z., Ma, J., Yilmaz, E., de Rijke, M.: Explainable user clustering in short text streams. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pp. 155–164, (2016)

  4. Blockeel, H., De Raedt, L., Ramon, J.: Top-down induction of clustering trees. arXiv preprint cs/0011032, (2000)

  5. Loyola-Gonzalez, O., Gutierrez-Rodríguez, A.E., Medina-Pérez, M.A., Monroy, R., Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., Garcia-Borroto, M.: An explainable artificial intelligence model for clustering numerical databases. IEEE Access 8, 52370–52384 (2020)

    Article  Google Scholar 

  6. Liu, B., Xia, Y., Yu, P.S.: Clustering through decision tree construction. In: Proceedings of the ninth international conference on Information and knowledge management, pp. 20–29, (2000)

  7. Plant, C., Böhm, C.: Inconco: interpretable clustering of numerical and categorical objects. in Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1127–1135, (2011)

  8. Fraiman, R., Ghattas, B., Svarc, M.: Interpretable clustering using unsupervised binary trees. Adv. Data Anal. Classif. 7(2), 125–145 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  9. Bertsimas, D., Orfanoudaki, A., Wiberg, H.: Interpretable clustering: an optimization approach. Mach. Learn. 110(1), 89–138 (2021)

    Article  MathSciNet  MATH  Google Scholar 

  10. Sokol, K., Flach, P.: One explanation does not fit all. KI-Künstliche Intelligenz 34(2), 235–250 (2020)

    Article  Google Scholar 

  11. Henin, C., Le Métayer, D.: A multi-layered approach for tailored black-box explanations. In: International Conference on Pattern Recognition, pp. 5–19, Springer, (2021)

  12. Rezaee, M.R., Lelieveldt, B.P., Reiber, J.H.: A new cluster validity index for the fuzzy c-mean. Pattern Recogn. Lett. 19(3–4), 237–246 (1998)

    Article  MATH  Google Scholar 

  13. Ester, M., Kriegel, H.-P., Sander, J., Xu, X. ,et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: kdd, vol. 96, pp. 226–231, (1996)

  14. Richardson, S., Green, P.J.: On bayesian analysis of mixtures with an unknown number of components (with discussion). J. R. Stat. Soc. Ser. B (Stat. Methodol.) 59(4), 731–792 (1997)

    Article  MATH  Google Scholar 

  15. Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)

    Article  Google Scholar 

  16. Moshkovitz, M., Dasgupta, S., Rashtchian, C., Frost, N.: Explainable k-means and k-medians clustering. In: International conference on machine learning, pp. 7055–7065, PMLR, (2020)

  17. Basak, J., Krishnapuram, R.: Interpretable hierarchical clustering by constructing an unsupervised decision tree. IEEE Trans. Knowl. Data Eng. 17(1), 121–132 (2005)

    Article  Google Scholar 

  18. Yang, W., Wang, X., Lu, J., Dou, W., Liu, S.: Interactive steering of hierarchical clustering. IEEE Trans. Visual Comput. Graphics 27(10), 3953–3967 (2020)

    Article  Google Scholar 

  19. Adadi, A., Berrada, M.: Peeking inside the black-box: a survey on explainable artificial intelligence (xai). IEEE access 6, 52138–52160 (2018)

    Article  Google Scholar 

  20. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Routledge (2017)

    Book  MATH  Google Scholar 

  21. Clancey, W.J.: The epistemology of a rule-based expert system-a framework for explanation. Artif. Intell. 20(3), 215–251 (1983)

    Article  Google Scholar 

  22. Kohavi, R.: The power of decision tables. In: European conference on machine learning, pp. 174–189, Springer, (1995)

  23. Rivest, R.L.: Learning decision lists. Mach. Learn. 2(3), 229–246 (1987)

    Article  Google Scholar 

  24. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)

    Article  Google Scholar 

  25. Schulz, H.-J.: Treevis. net: a tree visualization reference. IEEE Comput. Graph Appl. 31(6), 11–15 (2011)

    Article  Google Scholar 

  26. Van Den Elzen, S., Van Wijk, J.J.: Baobabview: Interactive construction and analysis of decision trees. In: 2011 IEEE conference on visual analytics science and technology (VAST), pp. 151–160, IEEE, (2011)

  27. Ming, Y., Qu, H., Bertini, E.: Rulematrix: visualizing and understanding classifiers with rules. IEEE Trans. Visual Comput. Graphics 25(1), 342–352 (2018)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the Hefei University of Technology Academic Newcomer Enhancement A Program (JZ2021HGTA0140), the National Natural Science Foundation of China (62277014) and the National Key Research and Development Program of China (2020YFC1523100).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yankong Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Luo, Y., Liu, Y. et al. CCET: towards customized explanation of clustering. Vis Comput 39, 3169–3181 (2023). https://doi.org/10.1007/s00371-023-02958-z

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-023-02958-z

Keywords

Navigation