An Efficient Framework by Topic Model for Multi-label Text Classification

被引:0
|
作者
Sun, Wei [1 ]
Ran, Xiangying [1 ]
Luo, Xiangyang [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Dept Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-label text classification; topic model; label correlations;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing multi-label text classification (MLTC) approaches only exploit label correlations from label pairwises or label chains. However, in the real world, features of instances have much importance for classification. In this paper, we propose a simple but efficient framework for MLTC called Hybrid Latent Dirichlet Allocation Multi-Label (HLDAML). To be specific, the topics of text features (i.e., a concrete description of documents) and the topics of label sets (i.e., a summarization of documents) can be obtained from training data by topic model before building models for multi-label classification. After that, hybrid topics can be used in existing approaches to improve the performance of MLTC. Experiments on several benchmark datasets demonstrate that the proposed framework is general and effective when taking text features and label sets into consideration simultaneously. It is also worth mentioning that we construct a new multi-label dataset called Parkinson about diagnosing parkinson disease by Traditional Chinese Medicine.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Feature Extraction of Deep Topic Model for Multi-label Text Classification
    Chen, Wenshi
    Liu, Xinhui
    Lu, Mingyu
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (09): : 785 - 792
  • [2] Multi-label dataless text classification with topic modeling
    Zha, Daochen
    Li, Chenliang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (01) : 137 - 160
  • [3] Multi-label dataless text classification with topic modeling
    Daochen Zha
    Chenliang Li
    [J]. Knowledge and Information Systems, 2019, 61 : 137 - 160
  • [4] MatchXML: An Efficient Text-Label Matching Framework for Extreme Multi-Label Text Classification
    Ye, Hui
    Sunderraman, Rajshekhar
    Ji, Shihao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4781 - 4793
  • [5] A Label Distribution Topic Model for Multi-label Classification
    Liu, Lin
    Tang, Lin
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 52 - 57
  • [6] Topic Model Based Multi-Label Classification
    Padmanabhan, Divya
    Bhat, Satyanath
    Shevade, Shirish
    Narahari, Y.
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 996 - 1003
  • [7] Online multi-label dependency topic models for text classification
    Sophie Burkhardt
    Stefan Kramer
    [J]. Machine Learning, 2018, 107 : 859 - 886
  • [8] Online multi-label dependency topic models for text classification
    Burkhardt, Sophie
    Kramer, Stefan
    [J]. MACHINE LEARNING, 2018, 107 (05) : 859 - 886
  • [9] Centroid prior topic model for multi-label classification
    Li, Ximing
    Ouyang, Jihong
    Zhou, Xiaotang
    [J]. PATTERN RECOGNITION LETTERS, 2015, 62 : 8 - 13
  • [10] A Survey of Statistical Topic Model for Multi-label Classification
    Liu, Lin
    Tang, Lin
    [J]. 2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,