Weakly-Supervised Hierarchical Text Classification

被引:0
|
作者
Meng, Yu [1 ]
Shen, Jiaming [1 ]
Zhang, Chao [1 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical text classification, which aims to classify text documents into a given hierarchy, is an important task in many real-world applications. Recently, deep neural models are gaining increasing popularity for text classification due to their expressive power and minimum requirement for feature engineering. However, applying deep neural networks for hierarchical text classification remains challenging, because they heavily rely on a large amount of training data and meanwhile cannot easily determine appropriate levels of documents in the hierarchical setting. In this paper, we propose a weakly-supervised neural method for hierarchical text classification. Our method does not require a large amount of training data but requires only easy-to-provide weak supervision signals such as a few class-related documents or keywords. Our method effectively leverages such weak supervision signals to generate pseudo documents for model pre-training, and then performs self-training on real unlabeled data to iteratively refine the model. During the training process, our model features a hierarchical neural structure, which mimics the given hierarchy and is capable of determining the proper levels for documents with a blocking mechanism. Experiments on three datasets from different domains demonstrate the efficacy of our method compared with a comprehensive set of baselines.
引用
收藏
页码:6826 / 6833
页数:8
相关论文
共 50 条
  • [1] Weakly-Supervised Neural Text Classification
    Meng, Yu
    Shen, Jiaming
    Zhang, Chao
    Han, Jiawei
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 983 - 992
  • [2] Weakly-supervised Text Classification Based on Keyword Graph
    Zhang, Lu
    Ding, Jiandong
    Xu, Yi
    Liu, Yingyao
    Zhou, Shuigeng
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2803 - 2813
  • [3] Weakly-Supervised Text Instance Segmentation
    Zu, Xinyan
    Yu, Haiyang
    Li, Bin
    Xue, Xiangyang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1915 - 1923
  • [4] Weakly-Supervised Alignment of Video With Text
    Bojanowski, P.
    Lajugie, R.
    Grave, E.
    Bach, F.
    Laptev, I.
    Ponce, J.
    Schmid, C.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4462 - 4470
  • [5] WeStcoin: Weakly-Supervised Contextualized Text Classification with Imbalance and Noisy Labels
    Zhang, Yupei
    Zhou, Yaya
    Liu, Shuhui
    Zhang, Wenxin
    Xiao, Min
    Shang, Xuequn
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2451 - 2457
  • [6] Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation
    Jin, Yiping
    Bhatia, Akshay
    Wanvarie, Dittaya
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 112 - 118
  • [7] Weakly-supervised character-level convolutional neural networks for text classification
    Liu, Yongsheng
    Chen, Wenyu
    Niyongabo, Rubungo Andre
    Qu, Hong
    Miao, Kebin
    Wei, Feng
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 701 - 708
  • [8] Weakly-supervised Joint Anomaly Detection and Classification
    Majhi, Snehashis
    Das, Srijan
    Bremond, Francois
    Dash, Ratnakar
    Sa, Pankaj Kumar
    [J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [9] Efficient Path Prediction for Semi-Supervised and Weakly Supervised Hierarchical Text Classification
    Xiao, Huiru
    Liu, Xin
    Song, Yangqiu
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3370 - 3376
  • [10] Explainable Disease Classification via Weakly-Supervised Segmentation
    Joshi, Aniket
    Mishra, Gaurav
    Sivaswamy, Jayanthi
    [J]. INTERPRETABLE AND ANNOTATION-EFFICIENT LEARNING FOR MEDICAL IMAGE COMPUTING, IMIMIC 2020, MIL3ID 2020, LABELS 2020, 2020, 12446 : 54 - 62