Abstract
Feature selection for multilabel data is a challenging and meaningful work. The information contained in multilabel data is more abundant, which may help further mine knowledge and aid decision-making in various real-life applications. However, the difficulty also increases in dealing with multilabel data because the relations between labels and features need to be considered simultaneously. Missing labels and noises may exist in multilabel data, which may affect the feature selection process. Aiming at solving these problems, a robust feature selection approach is constructed under the sparse learning framework based on the least squares regression model in this study. First, a novel objective function is built by considering the robustness of the method and the manifold information. Nonnegative matrix factorization (NMF) is used to compress the label matrix to reduce false label information, which may mislead the feature selection process. The l2,1-norm is adopted to constrain the least squares regression term. Manifold regularizers are used to construct low-dimensional manifold embeddings of the original feature and label space, retaining the local manifold structure of the data. Furthermore, the correlations between features and labels are explored, and an improved weight matrix is designed. Then, an iteration algorithm is proposed to solve the objective function. Extensive experiments are performed to analyze the proposed approach, which is compared with state-of-the-art algorithms on public multilabel datasets. The experimental results verify the effectiveness of the approach.

























Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Fan YL, Liu JH, Weng W, Chen BH, Chen YN, Wu SX (2021) Multi-label feature selection with local discriminant model and label correlations. Neurocomputing 442(28):98–115
Bidgoli AA, Ebrahimpour-Komleh H, Rahnamayan S (2021) Reference-point-based multi-objective optimization algorithm with opposition-based voting scheme for multi-label feature selection. Inf Sci 547:1–17
Chen SB, Zhang YM, Ding C, Zhang J, Luo B (2019) Extended adaptive lasso for multi-class and multi-label feature selection. Knowl-Based Syst 173(1):28–36
Cheng Z, Zeng Z (2020) Joint label-specific features and label correlation for multi-label learning with missing label. Appl Intell 50(11):4029–4049
Dai J, Chen J, Liu Y, Hu H (2020) Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation. Knowl-Based Syst 207(5):106342
Dong H, Sun J, Sun X, Ding R (2020) A many-objective feature selection for multi-label classification. Knowl-Based Syst 208(7):106456
Gao W, Hu L, Zhang P, Feng W (2018) Feature selection by integrating two groups of feature evaluation criteria. Expert Syst Appl 110:11–19
Che XY, Chen DG, Mi JS (2019) A novel approach for learning label correlation with application to feature selection of multi-label data. Inf Sci 512:795–812
Gonzalez-Lopez J, Ventura S, Cano A (2019) Distributed multi-label feature selection using individual mutual information measures. Knowl-Based Syst 188(5):105052
Hashemi A, Dowlatshahi MB, Nezamabadi-Pour H (2019) Mgfs: a multilabel graph-based feature selection algorithm via pagerank centrality. Expert Syst Appl 142:113024
Hl A, Dwk B (2021) Pairwise dependence-based unsupervised feature selection. Pattern Recogn 111:107663
Hu J, Li Y, Gao W, Zhang P (2020) Robust multi-label feature selection with dual-graph regularization. Knowl-Based Syst 203:106126
Hu L, Li Y, Gao W, Zhang P, Hu J (2020) Multi-label feature selection with shared common mode. Pattern Recogn 104:107344
Huang R, Jiang W, Sun G (2018) Manifold-based constraint laplacian score for multi-label feature selection. Pattern Recogn Lett 112:346–352
Jiang L, Yu G, Guo M, Wang J (2020) Feature selection with missing labels based on label compression and local feature correlation. Neurocomputing 395(8):95–106
Liu JH, Lin Y J, Li YW, Weng W, SXW (2018) Online multi-label streaming feature selection based on neighborhood rough set. Pattern Recogn 84:273–287
Kai LU, Hua XU (2019) Effcient ml-knn algorithm on large data set. Comput Eng Appl 55 (1):84–88
Lin J, Wang J, Yu G (2019) Semi-supervised multi-label feature selection based on sparsity regularization and dependence maximization. In: 2018 Ninth International Conference on Intelligent Control and Information Processing (ICICIP), pp 325–332
Ling J, Li J, Kai S, Liu H (2016) Multi-label informed feature selection. In: International Joint Conference on Artificial Intelligence, pp 1627–1633
Liu J, Lin Y, Wu S, Wang C (2017) Online multi-label group feature selection. Knowl-Based Syst 143(1):42–57
Lv S, Shi S, Wang H, Li F (2021) Semi-supervised multi-label feature selection with adaptive structure learning and manifold learning. Knowl-Based Syst 214(28):106757
Mishra N K, Singh P K (2020) Fs-mlc: Feature selection for multi-label classification using clustering in feature space. Inf Process Manag 57(4):102240
Nie F, Huang H, Xiao C, Ding C (2010) Effcient and robust feature selection via joint ℓ2,1-norms minimization, vol 2, pp 1813–1821
Paniri M, Dowlatshahi MB, Nezamabadi-Pour H (2019) Mlaco: a multilabel feature selection algorithm based on ant colony optimization. Knowl-Based Syst 192:105285
Li F, Miao DQ, Witold P (2017) Granular multi-label feature selection based on mutual information. Pattern Recogn 67(2017):410–423
Ping Z, Gla B, Wgab C (2019) Distinguishing two types of labels for multilabel feature selection. Pattern Recogn 95:72–82
Spolaor N, Monard MC, Tsoumakas G, Lee HD (2016) A systematic review of multi-label feature selection and a new method based on label construction. Neurocomputing 180(5):3–15
Wang XD, Chen RC, Hong CQ, Zeng ZQ, Zhou ZL (2017) Semi-supervised multi-label feature selection via label correlation analysis with l1-norm graph embedding. Image Vis Comput 63:10–23
Wu G, Zheng R, Tian Y, Liu D (2019) Joint ranking svm and binary relevance with robust low-rank learning for multi-label classification. Neural Netw 122:24–39
Yun L, Tao L, Liu H (2017) Recent advances in feature selection and its applications. Knowl Inf Syst 53(3):1–27
Zhang J, Luo Z, Li C, Zhou C, Li S (2019) Manifold regularized discriminative feature selection for multi-label learning. Pattern Recogn 95:136–150
Zhang P, Gx Liu, Wf Gao, Jz Song (2021) Multi-label feature selection considering label supplementation. Pattern Recogn 120:108137
Zhang P, Gao W (2021) Feature relevance term variation for multi-label feature selection. Applied Intelligence, https://doi.org/10.1007/s10489-020-02129-w
Hashemi A, Dowlatshahi MB, Nezamabadi-pour H (2020) MFS-MCDM: Multi-Label feature selection using multi-criteria decision making. Knowl-Based Syst 206:106365
Hashemi A, Dowlatshahi MB, Nezamabadi-pour H (2021) An efficient Pareto-based feature selection algorithm for multi-label classification. Inf Sci 581:428–447
Huang R, Wu ZJ (2021) Multi-label feature selection via manifold regularization and dependence maximization. Pattern Recogn 120:108149
Bandela SR, Kumar TK (2021) Unsupervised feature selection and NMF de-noising for robust Speech Emotion Recognition. Appl Acoust 172:107645
Jc H, Li YH, Gc X, Gao WF (2022) Dynamic subspace dual-graph regularized multi-label feature selection. Neurocomputing 467:184–196
Acknowledgements
This work is supported by the National Natural Science Foundation of China (Nos. 61976182, 62076171, 61876157, 61976245), Key program for International S&T Cooperation of Sichuan Province (2019YFH0097), and Sichuan Key R&D project (2020YFG0035).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work is supported by the National Natural Science Foundation of China (Nos. 61976182, 62076171, 61876157, 61976245), Key program for International S&T Cooperation of Sichuan Province (2019YFH0097), and Sichuan Key R&D project (2020YFG0035).
Rights and permissions
About this article
Cite this article
Liu, Y., Chen, H., Li, T. et al. A robust graph based multi-label feature selection considering feature-label dependency. Appl Intell 53, 837–863 (2023). https://doi.org/10.1007/s10489-022-03425-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03425-3