Experiments with hierarchical text classification

被引:0
|
作者
Granitzer, M [1 ]
Auer, P [1 ]
机构
[1] Know Ctr, Div Knowledge Discovery, A-8010 Graz, Austria
关键词
machine learning; supervised learning; hierarchical text classification; boosting; ranking performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper applies Boosting to hierarchical text classification where the hierarchical structure is given as directed acyclic graph and compares the results to Support Vector Machines. Hierarchical classification is performed top-down and in each node a flat classifier decides if a document should be further propagated or not. As flat classifiers BoosTexter, CentroidBooster and Support Vector Machines are used, were CentroidBooster is an AdaBoost.MH based alternative similar to BoosTexter. Experiments on the Reuters Corpus Volume 1 and the OHSUMED data set show that the F-1-measure increases if the hierarchal structure of a data set is taken into account. Regarding time complexity we show, that depending on the structure of a hierarchy, learning and classification time can be reduced. Besides these hard classification approaches we also investigate the ranking performance of hierarchical classifiers. Ranking, which can be achieved by providing a meaningful score for each classification decision, is important in most practical settings. We investigate an approach based on using a sigmoid function for calculating a meaningful score, where parameter estimation is based on error bounds from computational learning theory.
引用
收藏
页码:177 / 182
页数:6
相关论文
共 50 条
  • [41] Hierarchical text classification using CNNs with local approaches
    Krendzelak M.
    Jakab F.
    Computing and Informatics, 2021, 39 (05) : 907 - 924
  • [42] Hierarchical approaches to Text-based Offense Classification
    Choi, Jay
    Kilmer, David
    Mueller-Smith, Michael
    Taheri, Sema A.
    SCIENCE ADVANCES, 2023, 9 (09)
  • [43] Peer-Label Assisted Hierarchical Text Classification
    Song, Junru
    Wang, Feifei
    Yang, Yang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3747 - 3758
  • [44] Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text Classification
    Wang, Zihan
    Wang, Peiyi
    Huang, Lianzhe
    Sun, Xin
    Wang, Houfeng
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7109 - 7119
  • [45] When are links useful? Experiments in text classification.
    Fisher, M
    Everson, R
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 41 - 56
  • [46] Hierarchical Label Text Classification Method with Deep Label Assisted Classification Task
    Yukun, Cao
    Ziyue, Wei
    Yijia, Tang
    Chengkun, Jin
    Yunfeng, Li
    Computer Engineering and Applications, 2024, 60 (10) : 105 - 112
  • [47] Hierarchical Multi-Label Classification of Social Text Streams
    Ren, Zhaochun
    Peetz, Maria-Hendrike
    Liang, Shangsong
    van Dolen, Willemijn
    de Rijke, Maarten
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 213 - 222
  • [48] Intelligent Funds Assistant Exploiting Hierarchical Text Classification Algorithms
    Saraiva, Ines
    Moniz, Daniela
    Almeida, Alexandre
    Sousa, Joao
    Vieira, Susana
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [49] Novel top-down methods for Hierarchical Text Classification
    Cao Ying
    Duan run-ying
    INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING 2011, 2011, 24 : 329 - 334
  • [50] Hierarchical Multi-label Classification of Text with Capsule Networks
    Aly, Rami
    Remus, Steffen
    Biemann, Chris
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 323 - 330