Heterogeneous information integration in hierarchical text classification

被引:0
|
作者
Yang, Huai-Yuan
Liu, Tie-Yan
Gao, Li
Ma, Wei-Ying
机构
[1] Microsoft Res Asia, Sigma Ctr 5F, Beijing 100080, Peoples R China
[2] Peking Univ, Sch Math Sci, Dept Sci & Engn Comp, Beijing 100871, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous work has shown that considering the category distance in the taxonomy tree can improve the performance of text classifiers. In this paper, we propose a new approach to further integrate more categorical information in the text corpus using the principle of multi-objective programming (MOP). That is, we not only consider the distance between categories defined by the branching of the taxonomy tree, but also consider the similarity between categories defined by the document/term distributions in the feature space. Consequently, we get a refined category distance by using MOP to leverage these two kinds of information. Experiments on both synthetic and real-world datasets demonstrated the effectiveness of the proposed algorithm in hierarchical text classification.
引用
收藏
页码:240 / 249
页数:10
相关论文
共 50 条
  • [1] Text Classification with Heterogeneous Information Network Kernels
    Wang, Chenguang
    Song, Yangqiu
    Li, Haoran
    Zhang, Ming
    Han, Jiawei
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2130 - 2136
  • [2] Integration of global and local information for text classification
    Xianghua Li
    Xinyu Wu
    Zheng Luo
    Zhanwei Du
    Zhen Wang
    Chao Gao
    [J]. Neural Computing and Applications, 2023, 35 : 2471 - 2486
  • [3] Integration of global and local information for text classification
    Li, Xianghua
    Wu, Xinyu
    Luo, Zheng
    Du, Zhanwei
    Wang, Zhen
    Gao, Chao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2471 - 2486
  • [4] Hierarchical Heterogeneous Graph Representation Learning for Short Text Classification
    Wang, Yaqing
    Wang, Song
    Yao, Quanming
    Dou, Dejing
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3091 - 3101
  • [5] Short Text Classification Based on Hierarchical Heterogeneous Graph and LDA Fusion
    Xu, Xinlan
    Li, Bo
    Shen, Yuhao
    Luo, Bing
    Zhang, Chao
    Hao, Fei
    [J]. ELECTRONICS, 2023, 12 (12)
  • [6] Hierarchical text classification
    Pulijala, AK
    Gauch, S
    [J]. ISAS/CITSA 2004: International Conference on Cybernetics and Information Technologies, Systems and Applications and 10th International Conference on Information Systems Analysis and Synthesis, Vol 1, Proceedings: COMMUNICATIONS, INFORMATION TECHNOLOGIES AND COMPUTING, 2004, : 257 - 262
  • [7] Bayesian Hierarchical Modeling and the Integration of Heterogeneous Information on the Effectiveness of Cardiovascular Therapies
    Kwok, Heemun
    Lewis, Roger J.
    [J]. CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2011, 4 (06): : 657 - 666
  • [8] Utilizing global and path information with language modelling for hierarchical text classification
    Oh, Heung-Seon
    Myaeng, Sung-Hyon
    [J]. JOURNAL OF INFORMATION SCIENCE, 2014, 40 (02) : 127 - 145
  • [9] HTCInfoMax: A Global Model for Hierarchical Text Classification via Information Maximization
    Deng, Zhongfen
    Peng, Hao
    He, Dongxiao
    Li, Jianxin
    Yu, Philip S.
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3259 - 3265
  • [10] Text classification on heterogeneous information network via enhanced GCN and knowledge
    Hui Li
    Yan Yan
    Shuo Wang
    Juan Liu
    Yunpeng Cui
    [J]. Neural Computing and Applications, 2023, 35 : 14911 - 14927