FSKNN: Multi-label text categorization based on fuzzy similarity and k nearest neighbors

被引:59
|
作者
Jiang, Jung-Yi [1 ]
Tsai, Shian-Chi [1 ]
Lee, Shie-Jue [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
关键词
Document classification; Multi-label classification; Fuzzy similarity measure; k-nearest neighbor algorithm; Maximum a posteriori estimate; LEARNING APPROACH; KNN;
D O I
10.1016/j.eswa.2011.08.141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an efficient approach. FSKNN, which employs fuzzy similarity measure (FSM) and k nearest neighbors (KNN), for multi-label text classification. One of the problems associated with KNN-like approaches is its demanding computational cost in finding the k nearest neighbors from all the training patterns. For FSKNN, FSM is used to group the training patterns into clusters. Then only the training documents in those clusters whose fuzzy similarities to the document exceed a predesignated threshold are considered in finding the k nearest neighbors for the document. An unseen document is labeled based on its k nearest neighbors using the maximum a posteriori estimate. Experimental results show that our proposed method can work more effectively than other methods. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2813 / 2821
页数:9
相关论文
共 50 条
  • [41] TREEBOOST.MH: A boosting algorithm for multi-label hierarchical text categorization
    Esuli, Andrea
    Fagni, Tiziano
    Sebastiani, Fabrizio
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2006, 4209 : 13 - 24
  • [42] Multi-label learning with missing features and labels and its application to text categorization
    Hao, Xiuyan
    Huang, Jun
    Qin, Feng
    Zheng, Xiao
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
  • [43] Irrelevant attributes and imbalanced classes in multi-label text-categorization domains
    Dendamrongvit, Sareewan
    Vateekul, Peerapon
    Kubat, Miroslav
    INTELLIGENT DATA ANALYSIS, 2011, 15 (06) : 843 - 859
  • [44] Ensemble Application of Convolutional and Recurrent Neural Networks for Multi-label Text Categorization
    Chen, Guibin
    Ye, Deheng
    Xing, Zhenchang
    Chen, Jieshan
    Cambria, Erik
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2377 - 2383
  • [45] Automatic text categorization based on K-nearest neighbor
    Sun, J.
    Wang, W.
    Zhong, Y.-X.
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 2001, 24 (01): : 42 - 46
  • [46] Multi-Label Text Classification Based on DistilBERT and Label Correlation
    Wang, Xuyang
    Geng, Liuqing
    Zhang, Xin
    Computer Engineering and Applications, 2024, 60 (23) : 168 - 175
  • [47] Multi-label Feature Selection Based on Fuzzy Neighborhood Similarity Relations in Double Spaces
    Xu, Jiucheng
    Shen, Kaili
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (09): : 805 - 815
  • [48] A Locally Adaptive Multi-Label k-Nearest Neighbor Algorithm
    Wang, Dengbao
    Wang, Jingyuan
    Hu, Fei
    Li, Li
    Zhang, Xiuzhen
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 81 - 93
  • [49] A Coupled k-Nearest Neighbor Algorithm for Multi-label Classification
    Liu, Chunming
    Cao, Longbing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 176 - 187
  • [50] NkEL: nearest k-labelsets ensemble for multi-label learning
    Zhong, Xi-Yan
    Zhang, Yu-Li
    Wang, Dan-Dong
    Min, Fan
    Applied Intelligence, 2025, 55 (01)