A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

被引:36
|
作者
Yoo, Illhoi [1 ]
Hu, Xiaohua [2 ]
Song, Il-Yeol [2 ]
机构
[1] Univ Missouri, Sch Med, Dept Hlth Management & Informat, Columbia, MO 65211 USA
[2] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19102 USA
关键词
MeSH; Betweenness Centrality; Document Cluster; Graph Cluster; Text Summarization;
D O I
10.1186/1471-2105-8-S9-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. Results: Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. Conclusion: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Graph-Based Text Summarization Using Modified TextRank
    Mallick, Chirantana
    Das, Ajit Kumar
    Dutta, Madhurima
    Das, Asit Kumar
    Sarkar, Apurba
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 137 - 146
  • [32] Learning Similarity Functions in Graph-Based Document Summarization
    Ouyang, You
    Li, Wenjie
    Wei, Furu
    Lu, Qin
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 189 - 200
  • [33] Update Summarization via Graph-Based Sentence Ranking
    Li, Xuan
    Du, Liang
    Shen, Yi-Dong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (05) : 1162 - 1174
  • [34] A Semantic QA-Based Approach for Text Summarization Evaluation
    Chen, Ping
    Wu, Fei
    Wang, Tong
    Ding, Wei
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4800 - 4807
  • [35] Graph-based Multimodal Ranking Models for Multimodal Summarization
    Zhu, Junnan
    Xiang, Lu
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [36] TSGVi: a graph-based summarization system for Vietnamese documents
    Tu-Anh Nguyen-Hoang
    Khai Nguyen
    Quang-Vinh Tran
    Journal of Ambient Intelligence and Humanized Computing, 2012, 3 : 305 - 313
  • [37] EdgeSumm: Graph-based framework for automatic text summarization
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [38] Graph-based extractive text summarization based on single document
    Avaneesh Kumar Yadav
    Rama Shankar Ranvijay
    Ashish Kumar Yadav
    Multimedia Tools and Applications, 2024, 83 : 18987 - 19013
  • [39] Comparison of Graph-based and Term Weighting Method for Automatic Summarization of Online News
    Rumagit, Reinert Yosua
    Setiyawati, Nina
    Bangkalang, Dwi Hosanna
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 663 - 672
  • [40] An automatic text summarization approach using content-based and graph-based characteristics
    Sornil, Ohm
    Gree-ut, Kornnika
    2006 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 795 - +