A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

被引:36
|
作者
Yoo, Illhoi [1 ]
Hu, Xiaohua [2 ]
Song, Il-Yeol [2 ]
机构
[1] Univ Missouri, Sch Med, Dept Hlth Management & Informat, Columbia, MO 65211 USA
[2] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19102 USA
关键词
MeSH; Betweenness Centrality; Document Cluster; Graph Cluster; Text Summarization;
D O I
10.1186/1471-2105-8-S9-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. Results: Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. Conclusion: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Graph-based extractive text summarization based on single document
    Yadav, Avaneesh Kumar
    Ranvijay, Rama Shankar
    Yadav, Rama Shankar
    Maurya, Ashish Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 18987 - 19013
  • [42] Enhancing Biomedical Scientific Reviews Summarization with Graph-based Factual Evidence Extracted from Papers
    Frisoni, Giacomo
    Italiani, Paolo
    Boschi, Francesco
    Moro, Gianluca
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 168 - 179
  • [43] A new graph-based extractive text summarization using keywords or topic modeling
    Ramesh Chandra Belwal
    Sawan Rai
    Atul Gupta
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 8975 - 8990
  • [44] MultiGBS: A multi-layer graph approach to biomedical summarization
    Davoodijam, Ensieh
    Ghadiri, Nasser
    Shahreza, Maryam Lotfi
    Rinaldi, Fabio
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 116
  • [45] A new graph-based extractive text summarization using keywords or topic modeling
    Belwal, Ramesh Chandra
    Rai, Sawan
    Gupta, Atul
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (10) : 8975 - 8990
  • [46] Deep Graph Search Based Disease Related Knowledge Summarization from Biomedical Literature
    Wu, Xiaofang
    Yang, Zhihao
    Li, Zhiheng
    Sun, Yuanyuan
    Lin, Hongfei
    Wang, Jian
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [47] An Iterative Graph-based Generic Single and Multi Document Summarization Approach using Semantic Role Labeling and Wikipedia Concepts
    Mohamed, Muhidin
    Oussalah, Mourad
    PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016), 2016, : 117 - 120
  • [48] Graph-based hierarchical video summarization using global descriptors
    Belo, Luciana
    Caetano, Carlos
    Patrocinio, Zenilton, Jr.
    Guimaraes, Silvio
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 822 - 829
  • [49] Unsupervised video summarization with adversarial graph-based attention network
    Gunuganti, Jeshmitha
    Yeh, Zhi-Ting
    Wang, Jenq-Haur
    Norouzi, Mehdi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [50] LexRank: Graph-based lexical centrality as salience in text summarization
    Erkan, G
    Radev, DR
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 : 457 - 479