Transformer-based highlights extraction from scientific papers

被引:6
|
作者
La Quatra, Moreno [1 ]
Cagliero, Luca [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, Corso Duca Abruzzi 24, I-10129 Turin, Italy
关键词
Highlights extraction; Transformer model; Extractive summarization;
D O I
10.1016/j.knosys.2022.109382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Highlights are short sentences used to annotate scientific papers. They complement the abstract content by conveying the main result findings. To automate the process of paper annotation, highlights extraction aims at extracting from 3 to 5 paper sentences via supervised learning. Existing approaches rely on ad hoc linguistic features, which depend on the analyzed context, and apply recurrent neural networks, which are not effective in learning long-range text dependencies. This paper leverages the attention mechanism adopted in transformer models to improve the accuracy of sentence relevance estimation. Unlike existing approaches, it relies on the end-to-end training of a deep regression model. To attend patterns relevant to highlights content it also enriches sentence encodings with a section-level contextualization. The experimental results, achieved on three different benchmark datasets, show that the designed architecture is able to achieve significant performance improvements compared to the state-of-the-art. (c) 2022 Published by Elsevier B.V.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Comparative Analysis of Community Detection and Transformer-Based Approaches for Topic Clustering of Scientific Papers
    Bretsko, Daniel
    Belyi, Alexander
    Sobolevsky, Stanislav
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2023, PT I, 2023, 13956 : 648 - 660
  • [2] Transformer-based Extraction of Deep Image Models
    Battis, Verena
    Penner, Alexander
    [J]. 2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, : 320 - 336
  • [3] Influence of Context in Transformer-Based Medication Relation Extraction
    Modersohn, Luise
    Hahn, Udo
    [J]. MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 669 - 673
  • [4] Transformer-Based Models for the Automatic Indexing of Scientific Documents in French
    Angel Gonzalez, Jose
    Buscaldi, Davide
    Sanchis, Emilio
    Hurtado, Lluis-F
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 60 - 72
  • [5] A Rule-based Framework of Metadata Extraction from Scientific Papers
    Guo, Zhixin
    Jin, Hai
    [J]. 2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 400 - 404
  • [6] Scientific Data Extraction from Oceanographic Papers
    Veyhe, Bartal Eyofnsson
    Sagi, Tomer
    Hose, Katja
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 800 - 804
  • [7] Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across Domains
    Lang, Christian
    Wachowiak, Lennart
    Heinisch, Barbara
    Gromann, Dagmar
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3607 - 3620
  • [8] Extensive evaluation of transformer-based architectures for adverse drug events extraction
    Scaboro, Simone
    Portelli, Beatrice
    Chersoni, Emmanuele
    Santus, Enrico
    Serra, Giuseppe
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 275
  • [9] A visual transformer-based smart textual extraction method for financial invoices
    Wang, Tao
    Qiu, Min
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (10) : 18630 - 18649
  • [10] Extraction of Substance Use Information From Clinical Notes:Generative Pretrained Transformer-Based Investigation
    Shah-Mohammadi, Fatemeh
    Finkelstein, Joseph
    [J]. JMIR MEDICAL INFORMATICS, 2024, 12