Figure-Associated Text Summarization and Evaluation

被引:19
|
作者
Ramesh, Balaji Polepalli [1 ]
Sethi, Ricky J. [1 ]
Yu, Hong [1 ,2 ,3 ]
机构
[1] Univ Massachusetts, Sch Med, Dept Quantitat Hlth Sci, Worcester, MA 01655 USA
[2] Univ Massachusetts, Sch Comp Sci, Amherst, MA 01003 USA
[3] VA Cent Western Massachusetts, Leeds, MA USA
来源
PLOS ONE | 2015年 / 10卷 / 02期
基金
美国国家卫生研究院;
关键词
BIOMEDICAL LITERATURE; FULL-TEXT; RETRIEVAL; DATABASE; IMAGES;
D O I
10.1371/journal.pone.0115671
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Evaluation of Query-Based Arabic Text Summarization System
    El-Haj, Mahmoud O.
    Hammo, Bassam H.
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 88 - 94
  • [22] Automatic Evaluation of Text Summarization Based on Semantic Link Network
    Cao, Mengyun
    Hai Zhuge
    2019 15TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG 2019), 2019, : 107 - 114
  • [23] A new evaluation measure using compression dissimilarity on text summarization
    Wang, Tong
    Chen, Ping
    Simovici, Dan
    APPLIED INTELLIGENCE, 2016, 45 (01) : 127 - 134
  • [24] Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation
    Xie, Yuexiang
    Sun, Fei
    Deng, Yang
    Li, Yaliang
    Ding, Bolin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 100 - 110
  • [25] A Framework for Word Embedding Based Automatic Text Summarization and Evaluation
    Hailu, Tulu Tilahun
    Yu, Junqing
    Fantaye, Tessfu Geteye
    INFORMATION, 2020, 11 (02)
  • [26] A Data Set for the Analysis of Text Quality Dimensions in Summarization Evaluation
    Mieskes, Margot
    Mencia, Eneldo Loza
    Kronsbein, Tim
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6690 - 6699
  • [27] Rethinking Efficient Multilingual Text Summarization Meta-Evaluation
    Han, Rilyn R.
    Chen, Jiawen
    Liu, Yixin
    Cohan, Arman
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15739 - 15746
  • [28] A new evaluation measure using compression dissimilarity on text summarization
    Tong Wang
    Ping Chen
    Dan Simovici
    Applied Intelligence, 2016, 45 : 127 - 134
  • [29] A Semantic QA-Based Approach for Text Summarization Evaluation
    Chen, Ping
    Wu, Fei
    Wang, Tong
    Ding, Wei
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4800 - 4807
  • [30] Legal Case Summarization: An Application for Text Summarization
    Agrawal, Kanika
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 363 - 368