Label informed hierarchical transformers for sequential sentence classification in scientific abstracts

被引:1
|
作者
Takola, Yaswanth Sri Sai Santosh [1 ]
Aluru, Sai Saketh [2 ]
Vallabhajosyula, Anoop [3 ]
Sanyal, Debarshi Kumar [4 ,7 ]
Das, Partha Pratim [5 ,6 ]
机构
[1] Tech Univ Munich, Sch Computat Informat & Technol, Munich, Germany
[2] Shaw India Pvt Ltd, Hyderabad, India
[3] Arizona State Univ, Sch Comp & Augmented Intelligence, Tempe, AZ USA
[4] Indian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata, India
[5] Ashoka Univ, Dept Comp Sci, Sonipat, India
[6] Indian Inst Technol Kharagpur, Dept Comp Sci & Engn, Kharagpur, India
[7] Indian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, West Bengal, India
关键词
discourse segmentation; hierarchical transformers; scholarly data; scientific abstracts; sequential sentence classification;
D O I
10.1111/exsy.13238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmenting scientific abstracts into discourse categories like background, objective, method, result, and conclusion is useful in many downstream tasks like search, recommendation and summarization. This task of classifying each sentence in the abstract into one of a given set of discourse categories is called sequential sentence classification. Existing machine learning-based approaches to this problem consider the content of only the abstract to obtain the neural representation of each sentence, which is then labelled with a discourse category. But this ignores the semantic information offered by the discourse labels themselves. In this paper, we propose LIHT, Label Informed Hierarchical Transformers - a method for sequential sentence classification that explicitly and hierarchically exploits the semantic information in the labels to learn label-aware neural sentence representations. The hierarchical model helps to capture not only the fine-grained interactions between the discourse labels and the words in the abstract at the sentence level but also the potential dependencies that may exist in the label sequence. Thus, LIHT generates label-aware contextual sentence representations that are then labelled with a conditional random field. We evaluate LIHT on three publicly available datasets, namely, PUBMED-RCT, NICTA-PIBOSO and CSAbstract. The incremental gain in F1-score in all the three cases over the respective state-of-the-art approaches is around 1%$$ 1\% $$. Though the gains are modest, LIHT establishes a new performance benchmark for this task and is a novel technique of independent interest. We also perform an ablation study to identify the contribution of each component of LIHT in the observed performance, and a case study to visualize the roles of the different components of our model.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Hierarchical Neural Networks for Sequential Sentence Classification in Medical Scientific Abstracts
    Jin, Di
    Szolovits, Peter
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3100 - 3109
  • [2] A Deep Learning Approach for Sentence Classification of Scientific Abstracts
    Goncalves, Sergio
    Cortez, Paulo
    Moro, Sergio
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 479 - 488
  • [3] HIERARCHICAL TRANSFORMERS FOR LONG DOCUMENT CLASSIFICATION
    Pappagari, Raghavendra
    Zelasko, Piotr
    Villalba, Jesus
    Carmiel, Yishay
    Dehak, Najim
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 838 - 844
  • [4] Hierarchical Rhetorical Sentence Categorization for Scientific Papers
    Rachman, G. H.
    Khodra, M. L.
    Widyantoro, D. H.
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
  • [5] Constructing Corpus of Scientific Abstracts Annotated with Sentence Roles
    Yamamoto, Takafumi
    Tomiura, Yoichi
    [J]. PROCEEDINGS 2016 5TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS IIAI-AAI 2016, 2016, : 159 - 162
  • [6] Pretrained Language Models for Sequential Sentence Classification
    Cohan, Arman
    Beltagy, Iz
    King, Daniel
    Dalvi, Bhavana
    Weld, Daniel S.
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3693 - 3699
  • [7] Hierarchical Label Generation for Text Classification
    Kwon, Jingun
    Kamigaito, Hidetaka
    Song, Young-In
    Okumura, Manabu
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 625 - 632
  • [8] Information Entropy-Informed Sentence Representation for Question Classification
    Gao, Jin
    Li, Miao
    Chen, Lei
    Du, Jinhua
    Ma, Rongqiang
    [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 79 - 82
  • [9] Hierarchical Label Text Classification Method with Deep Label Assisted Classification Task
    Yukun, Cao
    Ziyue, Wei
    Yijia, Tang
    Chengkun, Jin
    Yunfeng, Li
    [J]. Computer Engineering and Applications, 2024, 60 (10) : 105 - 112
  • [10] General Multi-label Image Classification with Transformers
    Lanchantin, Jack
    Wang, Tianlu
    Ordonez, Vicente
    Qi, Yanjun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16473 - 16483