HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization

被引:0
|
作者
Deng, Zhongfen [1 ]
Peng, Hao [2 ,3 ]
He, Dongxiao [4 ]
Li, Jianxin [2 ]
Yu, Philip S. [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[2] Beihang Univ, BDBC, Beijing, Peoples R China
[3] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China
[4] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current state-of-the-art model HiAGM for hierarchical text classification has two limitations. First, it correlates each text sample with all labels in the dataset which contains irrelevant information. Second, it does not consider any statistical constraint on the label representations learned by the structure encoder, while constraints for representation learning are proved to be helpful in previous work. In this paper, we propose HTCInfoMax to address these issues by introducing information maximization which includes two modules: text-label mutual information maximization and label prior matching. The first module can model the interaction between each text sample and its ground truth labels explicitly which filters out irrelevant information. The second one encourages the structure encoder to learn better representations with desired characteristics for all labels which can better handle label imbalance in hierarchical text classification. Experimental results on two benchmark datasets demonstrate the effectiveness of the proposed HTCInfoMax.
引用
收藏
页码:3259 / 3265
页数:7
相关论文
共 50 条
  • [1] Utilizing global and path information with language modelling for hierarchical text classification
    Oh, Heung-Seon
    Myaeng, Sung-Hyon
    [J]. JOURNAL OF INFORMATION SCIENCE, 2014, 40 (02) : 127 - 145
  • [2] Hierarchy-Aware Global Model for Hierarchical Text Classification
    Zhou, Jie
    Ma, Chunping
    Long, Dingkun
    Xu, Guangwei
    Ding, Ning
    Zhang, Haoyu
    Xie, Pengjun
    Liu, Gongshen
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1106 - 1117
  • [3] Feature selection via maximizing global information gain for text classification
    Shang, Changxing
    Li, Min
    Feng, Shengzhong
    Jiang, Qingshan
    Fan, Jianping
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 54 : 298 - 309
  • [4] Heterogeneous information integration in hierarchical text classification
    Yang, Huai-Yuan
    Liu, Tie-Yan
    Gao, Li
    Ma, Wei-Ying
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 240 - 249
  • [5] HScodeNet: Combining Hierarchical Sequential and Global Spatial Information of Text for Commodity HS Code Classification
    Du, Shaohua
    Wu, Zhihao
    Wan, Huaiyu
    Lin, YouFang
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 676 - 689
  • [6] Integration of global and local information for text classification
    Xianghua Li
    Xinyu Wu
    Zheng Luo
    Zhanwei Du
    Zhen Wang
    Chao Gao
    [J]. Neural Computing and Applications, 2023, 35 : 2471 - 2486
  • [7] Integration of global and local information for text classification
    Li, Xianghua
    Wu, Xinyu
    Luo, Zheng
    Du, Zhanwei
    Wang, Zhen
    Gao, Chao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2471 - 2486
  • [8] Hierarchical Multilabel Text Classification via Multitask Learning
    Yu, Yipeng
    Sun, Zixun
    Sun, Chi
    Liu, Wenqiang
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1138 - 1143
  • [9] Image Retrieval with Text Feedback by Deep Hierarchical Attention Mutual Information Maximization
    Gu, Chunbin
    Bu, Jiajun
    Zhang, Zhen
    Yu, Zhi
    Ma, Dongfang
    Wang, Wei
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4600 - 4609
  • [10] Hierarchical text classification
    Pulijala, AK
    Gauch, S
    [J]. ISAS/CITSA 2004: International Conference on Cybernetics and Information Technologies, Systems and Applications and 10th International Conference on Information Systems Analysis and Synthesis, Vol 1, Proceedings: COMMUNICATIONS, INFORMATION TECHNOLOGIES AND COMPUTING, 2004, : 257 - 262