Abstract
Clustering is an important and challenging task in data mining. As a kind of generalized density-based clustering methods, DENCLUE algorithm has many remarkable properties, but the quality of clustering results strongly depends on the adequate choice of two parameters: density parameter σ and noise threshold ξ. In this paper, by investigating the influence of the two parameters of DENCLUE algorithm on the clustering results, we firstly show that an optimal σ should be chosen to obtain good clustering results. Then, an entropy-based method is proposed for the optimal choice of σ. Further, noise threshold ξ is estimated to produce a reasonable pattern of clustering. Finally, experiments are performed to illustrate the effectiveness of our methods.
Supported by the National Natural Science Foundation of China under Grant No. 69975024.
Preview
Unable to display preview. Download preview PDF.
References
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers (2001).
Hinneburg, A., Keim, D.A.: An efficient approach to clustering in large multimedia databases with noise. In Proceedings of the 4th International Conference on Knowledge Discovery and Data mining (1998) 58–65.
Yee Lung, Jiang-She Zhang, etc.: Clustering by scale-space filtering. IEEE Trans. Pattern analysis and Machine Intelligence, VOL. 22. (2000) 1396–1410.
Linderberg, T.: Scale-space for discrete signals, IEEE Trans. Pattern analysis and Machine Intelligence, VOL. 12. (1990) 234–254.
Jan C A van der Lubbe: Information Theory. Cambridge University Press (1997).
Dixon, W. J., Kronmal, R. A.: The choice of origin and scale for graphs. Journal of the Association for Computing Machinery, VOL. 12. (1996) 259–261.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gan, W., Li, D. (2003). Optimal Choice of Parameters for a Density-Based Clustering Algorithm. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2003. Lecture Notes in Computer Science(), vol 2639. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-39205-X_98
Download citation
DOI: https://doi.org/10.1007/3-540-39205-X_98
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-14040-5
Online ISBN: 978-3-540-39205-7
eBook Packages: Springer Book Archive