A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

被引:36
|
作者
Yoo, Illhoi [1 ]
Hu, Xiaohua [2 ]
Song, Il-Yeol [2 ]
机构
[1] Univ Missouri, Sch Med, Dept Hlth Management & Informat, Columbia, MO 65211 USA
[2] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19102 USA
关键词
MeSH; Betweenness Centrality; Document Cluster; Graph Cluster; Text Summarization;
D O I
10.1186/1471-2105-8-S9-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. Results: Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. Conclusion: Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Combining Graph Connectivity and Genetic Clustering to Improve Biomedical Summarization
    Menendez, Hector D.
    Plaza, Laura
    Camacho, David
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 2740 - 2747
  • [22] A Framework for Extractive Text Summarization using Semantic Graph Based Approach
    Ullah, Shofi
    Al Islam, A. B. M. Alim
    2019 6TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS 2019), 2019, : 48 - 55
  • [23] A Graph Based Clustering Technique for Tweet Summarization
    Dutta, Soumi
    Ghatak, Sujata
    Roy, Moumita
    Ghosh, Saptarshi
    Das, Asit Kumar
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [24] Sentence Embedding Based Semantic Clustering Approach for Discussion Thread Summarization
    Khan, Atif
    Shah, Qaiser
    Uddin, M. Irfan
    Ullah, Fasee
    Alharbi, Abdullah
    Alyami, Hashem
    Gul, Muhammad Adnan
    COMPLEXITY, 2020, 2020
  • [25] Semantic Graph Reduction Approach for Abstractive Text Summarization
    Moawad, Ibrahim F.
    Aref, Mostafa
    2012 SEVENTH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES'2012), 2012, : 132 - 138
  • [26] SGATS: Semantic Graph-based Automatic Text Summarization from Hindi Text Documents
    Joshi, Manju Lata
    Joshi, Nisheeth
    Mittal, Namita
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [27] Tamil Document Summarization using semantic graph method
    Banu, M.
    Karthika, C.
    Sudarmani, P.
    Geetha, T., V
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 128 - 134
  • [28] Graph Ranked Clustering Based Biomedical Text Summarization Using Top k Similarity
    Gupta S.
    Sharaff A.
    Nagwani N.K.
    Computer Systems Science and Engineering, 2023, 45 (03): : 2333 - 2349
  • [29] TSGVi: a graph-based summarization system for Vietnamese documents
    Tu-Anh Nguyen-Hoang
    Khai Nguyen
    Quang-Vinh Tran
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2012, 3 (04) : 305 - 313
  • [30] Graph-based structural difference analysis for video summarization
    Chai, Chunlei
    Lu, Guoliang
    Wang, Ruyun
    Lyu, Chen
    Lyu, Lei
    Zhang, Peng
    Liu, Hong
    INFORMATION SCIENCES, 2021, 577 : 483 - 509