Abstract
Affinity propagation (AP) is a clustering method that takes as input measures of similarity between pairs of data points. As the oscillations and preference value need to be preset, the algorithm precision could not be controlled exactly. To improve the performance of AP, this study utilizes priori pairwise constraints to obtain the reliable similarity matrix named semi-supervised affinity propagation (SAP). To find the best solution in domain of preference value, this study also proposes an improved fruit fly optimization (IFO) to optimize the unknown parameters of the SAP model. The IFO algorithm has introduced the fuzzy density mechanism to enhance the searching capacities of fruit fly individuals. The benchmark functions experiments indicate that the IFO algorithm has better precision and convergence speed than other compared swarm intelligence algorithms. We used SAP that based on IFO to identify UCI datasets and synthetic datasets. The simulation results show that proposed clustering algorithm produces significantly better clustering quality and accuracy results. In addition, we utilized the improved model to analyse the seismic data. The clustering results indicated that the proposed model had the better research potential and the good application value.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Zhu X, Zhu Y, Zheng W (2020) Spectral rotation for deep one-step clustering. Pattern Recognit 105:107175
Peng X, Feng J, Zhou JT, Lei Y, Yan S (2020) Deep subspace clustering. In: IEEE Transactions on Neural Networks and Learning Systems
Aggarwal CC, Reddy CK (2014) Data clustering. Algorithms and applications Chapman&Hall/CRC Data mining and Knowledge Discovery series, Londra
Wu J, Chen L, Zheng Z, Lyu MR, Wu Z (2014) Clustering web services to facilitate service discovery. Knowl Inf Syst 38(1):207–229
Ongus RW (2019) Collecting development practices in using information technology: a comparative study. J Logist Inf Serv Sci 6(2):1–22
Gong M, Liang Y, Shi J, Ma W, Ma J (2012) Fuzzy c-means clustering with local information and kernel metric for image segmentation. IEEE Trans Image Process 22(2):573–584
Chen Y, Hu X, Fan W, Shen L, Zhang Z, Liu X, Du J, Li H, Chen Y, Li H (2020) Fast density peak clustering for large scale data based on knn. Knowl Based Syst 187:104824
Bishop CM (2006) Pattern recognition and machine learning. Springer, Berlin
Getis A, Aldstadt J (2004) Constructing the spatial weights matrix using a local statistic. Geograph Anal 36(2):90–104
Cutting DR, Karger DR, Pedersen JO, Tukey JW (2017) Scatter/gather: a cluster-based approach to browsing large document collections. ACM SIGIR Forum, ACM New York, NY, USA 51:148–159
Jiyoung Yoon SJ (2020) A big data based cosmetic recommendation algorithm. J Syst Manage Sci 10(2):40–52
Harvey RC, Mullighan CG, Wang X, Dobbin KK, Davidson GS, Bedrick EJ, Chen IM, Atlas SR, Kang H, Ar K et al (2010) Identification of novel cluster groups in pediatric high-risk b-precursor acute lymphoblastic leukemia with gene expression profiling: correlation with genome-wide dna copy number alterations, clinical characteristics, and outcome. Blood J Am Soc Hematol 116(23):4874–4884
He Y, Yuan H, Wu C, Xie Z (2020) Disc: a highly scalable and accurate inference of gene expression and structure for single-cell transcriptomes using semi-supervised deep learning. Genome Biol 21(1):1–28
Zou Q, Lin G, Jiang X, Liu X, Zeng X (2020) Sequence clustering in bioinformatics: an empirical study. Brief Bioinf 21(1):1–10
Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recognit Lett 31(8):651–666
Likas A, Vlassis N, Verbeek JJ (2003) The global k-means clustering algorithm. Pattern Recognit 36(2):451–461
Wang WT, Wu YL, Tang CY, Hor MK (2015) Adaptive density-based spatial clustering of applications with noise (dbscan) according to data. In: 2015 International Conference on Machine Learning and Cybernetics (ICMLC), IEEE, vol 1, pp 445–451
Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678
Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976
Zhu X, Zhang S, Zhu Y, Zheng W, Yang Y (2020) Self-weighted multi-view fuzzy clustering. ACM Trans Knowl Dis Data (TKDD) 14(4):1–17
Wang KJ, Li J, Zhang JY, Tu CY (2007) Semi-supervised affinity propagation clustering. Jisuanji Gongcheng/ Comp Eng 33(23):197–198
Xiao Y, Yu J (2008) Semi-supervised clustering based on affinity propagation algorithm. J Softw 19(11):2803–2813
Jia B, Yu B, Wu Q, Wei C, Law R (2016) Adaptive affinity propagation method based on improved cuckoo search. Knowl Based Syst 111:27–35
Wang L, Wang X, Liu G, Zhao L (2010) Improved adaptive affinity propagation clustering based on semi-supervised learning [j]. Appl Res Comput 12:013
Wang L, Ji Q, Han X (2016) Adaptive semi-supervised affinity propagation clustering algorithm based on structural similarity. Tehnicki vjesnik/Technical Gazette 23(2):425–435
Han X, Liu Q, Wang H, Wang L (2018) Novel fruit fly optimization algorithm with trend search and co-evolution. Knowl Based Syst 141:1–17
Shi Y, Chen Z, Qi Z, Meng F, Cui L (2017) A novel clustering-based image segmentation via density peaks algorithm with mid-level feature. Neural Comput Appl 28(1):29–39
Lam YK, Tsang PWM, Leung CS (2013) Pso-based k-means clustering with enhanced cluster matching for gene expression data. Neural Comput Appl 22(7–8):1349–1355
Faris H, Aljarah I, Al-Betar MA, Mirjalili S (2018) Grey wolf optimizer: a review of recent variants and applications. Neural Comput Appl 30(2):413–435
Sahoo G et al (2017) A two-step artificial bee colony algorithm for clustering. Neural Comput Appl 28(3):537–551
Abasi AK, Khader AT, Al-Betar MA, Naim S, Alyasseri ZAA, Makhadmeh SN (2020) A novel hybrid multi-verse optimizer with k-means for text documents clustering. Neural Computing and Applications
Aljarah I, Mafarja M, Heidari AA, Faris H, Mirjalili S (2020) Clustering analysis using a novel locality-informed grey wolf-inspired clustering approach. Knowl Inf Syst 62(2):507–539
Apipie FM, Georgescu V (2019) Assessing and comparing by specific metrics the performance of 15 multiobjective optimization metaheuristics when solving the portfolio optimization problem. Econ Comput Econ Cybern Stud Res 53(3):39–58
Agharezaei S, Falamarzi M (2019) Particle swarm optimization algorithm for the prepack optimization problem. Econ Comput Econ Cybern Stud Res 53(2):289–307
Shouheng T, Hong H (2019) Deaf-mops/d: an improved differential evolution algorithm for solving complex multi-objective portfolio selection problems based on decomposition. Econ Comput Econ Cybernet Stud Res 53(3):151–167
Chakraborty A, Kar AK (2017) Swarm intelligence: a review of algorithms. In: Nature-Inspired Computing and Optimization. Springer, Cham, pp 475–494
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of ICNN’95-International Conference on Neural Networks, IEEE, vol 4, pp 1942–1948
Zhang ZG, Zhang JXX (2019) A novel job-shop scheduling strategy based on particle swarm optimization and neural network. Int J Simulat Modell 18(4):699–707
Pan WT (2011) Fruit fly optimization algorithm. Tsang Hai Book Publishing Co, Taipei, pp 10–12
Hu H, Lei W, Gao X, Zhang Y (2018) Job-shop scheduling problem based on improved cuckoo search algorithm. Int J Simulat Modell 17(2):337–346
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61
Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67
Shen L, Chen H, Yu Z, Kang W, Zhang B, Li H, Yang B, Liu D (2016) Evolving support vector machines using fruit fly optimization for medical data classification. Knowl Based Syst 96:61–75
Li HZ, Guo S, Li CJ, Sun JQ (2013) A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowl Based Syst 37:378–387
Pan WT (2012) A new fruit fly optimization algorithm: taking the financial distress model as an example. Knowl Based Syst 26:69–74
Wang L, Xl Zheng, Sy Wang (2013) A novel binary fruit fly optimization algorithm for solving the multidimensional knapsack problem. Knowl Based Syst 48:17–23
Pan QK, Sang HY, Duan JH, Gao L (2014) An improved fruit fly optimization algorithm for continuous function optimization problems. Knowl Based Syst 62:69–83
Eberhart R, Kennedy J (1995) Particle swarm optimization. In: Proceedings of the IEEE international conference on neural networks, Citeseer 4:1942–1948
Pan WT (2013) Using modified fruit fly optimisation algorithm to perform the function test and case studies. Connect Sci 25(2–3):151–160
Wagstaff K, Cardie C, Rogers S, Schrödl S et al (2001) Constrained k-means clustering with background knowledge. Icml 1:577–584
Amigó E, Gonzalo J, Artiles J, Verdejo F (2009) A comparison of extrinsic clustering evaluation metrics based on formal constraints. Inf Retriev 12(4):461–486
Acknowledgements
The work was supported by the National Science Foundation of China under Grant 61472049, 61572225 and 61202309; the key scientific research projects of colleges and universities of Henan Province (No. 21A520012), the Jilin province social science fund project (No. 2019B69), the 2018 Jilin province higher education teaching reform research project, and the 2018 Jilin university of finance and economics key project.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Ruihong Zhou and Qiaoming Liu have contributed equally to this work.
Rights and permissions
About this article
Cite this article
Zhou, R., Liu, Q., Wang, J. et al. Modified semi-supervised affinity propagation clustering with fuzzy density fruit fly optimization. Neural Comput & Applic 33, 4695–4712 (2021). https://doi.org/10.1007/s00521-020-05431-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-05431-3