Discourse-Aware Unsupervised Summarization of Long Scientific Documents

被引:0
|
作者
Dong, Yue [1 ]
Mircea, Andrei [1 ]
Cheung, Jackie C. K. [1 ]
机构
[1] McGill Univ, MILA, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an unsupervised graph-based ranking model for extractive summarization of long scientific documents. Our method assumes a two-level hierarchical graph representation of the source document, and exploits asymmetrical positional cues to determine sentence importance. Results on the PubMed and arXiv datasets show that our approach(1) outperforms strong unsupervised baselines by wide margins in automatic metrics and human evaluation. In addition, it achieves performance comparable to many state-of-the-art supervised approaches which are trained on hundreds of thousands of examples. These results suggest that patterns in the discourse structure are a strong signal for determining importance in scientific articles.
引用
收藏
页码:1089 / 1102
页数:14
相关论文
共 50 条
  • [1] DBERT-ELVA: Discourse-Aware Extractive Text Summarization with Autoencoder
    Kia, Mahsa Abazari
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 317 - 324
  • [2] Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization
    Feng, Xiachong
    Feng, Xiaocheng
    Qin, Bing
    Geng, Xinwei
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3808 - 3814
  • [3] Unsupervised Learning of Discourse-Aware Text Representation for Essay Scoring
    Mim, Farjana Sultana
    Inoue, Naoya
    Reisert, Paul
    Ouchi, Hiroki
    Inui, Kentaro
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 378 - 385
  • [4] SATS: simplification aware text summarization of scientific documents
    Zaman, Farooq
    Kamiran, Faisal
    Shardlow, Matthew
    Saeed-Ul Hassan, Saeed-Ul
    Karim, Asim
    Aljohani, Naif Radi
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [5] Probabilistic Modeling of Discourse-Aware Sentence Processing
    Dubey, Amit
    Keller, Frank
    Sturt, Patrick
    [J]. TOPICS IN COGNITIVE SCIENCE, 2013, 5 (03) : 425 - 451
  • [6] Automated Essay Scoring with Discourse-Aware Neural Models
    Nadeem, Farah
    Huy Nguyen
    Liu, Yang
    Ostendorf, Mari
    [J]. INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2019, : 484 - 493
  • [7] Discourse-Aware Graph Networks for Textual Logical Reasoning
    Huang, Yinya
    Liu, Lemao
    Xu, Kun
    Fang, Meng
    Lin, Liang
    Liang, Xiaodan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11668 - 11688
  • [8] Psychiatric document retrieval using a discourse-aware model
    Yu, Liang-Chih
    Wu, Chung-Hsien
    Jang, Fong-Lin
    [J]. ARTIFICIAL INTELLIGENCE, 2009, 173 (7-8) : 817 - 829
  • [9] DAGN: Discourse-Aware Graph Network for Logical Reasoning
    Huang, Yinya
    Fang, Meng
    Cao, Yu
    Wang, Liwei
    Liang, Xiaodan
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5848 - 5855
  • [10] A Summarization System for Scientific Documents
    Erera, Shai
    Shmueli-Scheuer, Michal
    Feigenblat, Guy
    Nakash, Ora Peled
    Boni, Odellia
    Roitman, Haggai
    Cohen, Doron
    Weiner, Bar
    Mass, Yosi
    Rivlin, Or
    Lev, Guy
    Jerbi, Achiya
    Herzig, Jonathan
    Hou, Yufang
    Jochim, Charles
    Gleize, Martin
    Bonin, Francesca
    Ganguly, Debasis
    Konopnicki, David
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 211 - 216