Summarization of scientific documents by detecting common facts in citations

被引:24
|
作者
Chen, Jingqiang
Zhuge, Hai [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing 210003, Jiangsu, Peoples R China
关键词
Summarization; Semantic link network; Natural language processing;
D O I
10.1016/j.future.2013.07.018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reading scientific articles is more time-consuming than reading news because readers need to search and read many citations. This paper proposes a citation guided method for summarizing multiple scientific papers. A phenomenon we can observe is that citation sentences in one paragraph or section usually talk about a common fact, which is usually represented as a set of noun phrases co-occurring in citation texts and it is usually discussed from different aspects. We design a multi-document summarization system based on common fact detection. One challenge is that citations may not use the same terms to refer to a common fact. We thus use term association discovering algorithm to expand terms based on a large set of scientific article abstracts. Then, citations can be clustered based on common facts. The common fact is used as a salient term set to get relevant sentences from the corresponding cited articles to form a summary. Experiments show that our method outperforms three baseline methods by ROUGE metric. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:246 / 252
页数:7
相关论文
共 50 条
  • [1] A Summarization System for Scientific Documents
    Erera, Shai
    Shmueli-Scheuer, Michal
    Feigenblat, Guy
    Nakash, Ora Peled
    Boni, Odellia
    Roitman, Haggai
    Cohen, Doron
    Weiner, Bar
    Mass, Yosi
    Rivlin, Or
    Lev, Guy
    Jerbi, Achiya
    Herzig, Jonathan
    Hou, Yufang
    Jochim, Charles
    Gleize, Martin
    Bonin, Francesca
    Ganguly, Debasis
    Konopnicki, David
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 211 - 216
  • [2] SciBERTSUM: Extractive Summarization for Scientific Documents
    Sefid, Athar
    Giles, C. Lee
    [J]. DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 688 - 701
  • [3] TLDR: Extreme Summarization of Scientific Documents
    Cachola, Isabel
    Lo, Kyle
    Cohan, Arman
    Weld, Daniel S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4766 - 4777
  • [4] SATS: simplification aware text summarization of scientific documents
    Zaman, Farooq
    Kamiran, Faisal
    Shardlow, Matthew
    Saeed-Ul Hassan, Saeed-Ul
    Karim, Asim
    Aljohani, Naif Radi
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [5] Contextualizing Citations for Scientific Summarization using Word Embeddings and Domain Knowledge
    Cohan, Arman
    Goharian, Nazli
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1133 - 1136
  • [6] Detecting and analyzing missing citations to published scientific entities
    Jialiang Lin
    Yao Yu
    Jiaxin Song
    Xiaodong Shi
    [J]. Scientometrics, 2022, 127 : 2395 - 2412
  • [7] Detecting and analyzing missing citations to published scientific entities
    Lin, Jialiang
    Yu, Yao
    Song, Jiaxin
    Shi, Xiaodong
    [J]. SCIENTOMETRICS, 2022, 127 (05) : 2395 - 2412
  • [8] Discourse-Aware Unsupervised Summarization of Long Scientific Documents
    Dong, Yue
    Mircea, Andrei
    Cheung, Jackie C. K.
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1089 - 1102
  • [9] Detecting data and schema changes in scientific documents
    Adam, N
    Adiwijaya, I
    Critchlow, T
    Musick, R
    [J]. IEEE ADVANCES IN DIGITAL LIBRARIES 2000, PROCEEDINGS, 2000, : 160 - 170
  • [10] Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents
    Meng, Rui
    Thaker, Khushboo
    Zhang, Lei
    Dong, Yue
    Yuan, Xingdi
    Wang, Tong
    He, Daqing
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 1080 - 1089