A novel centroid based sentence classification approach for extractive summarization of COVID-19 news reports

被引:0
|
作者
Banerjee S. [1 ]
Mukherjee S. [1 ]
Bandyopadhyay S. [1 ]
机构
[1] Computer Science and Engineering, National Institute of Technology Silchar, Assam, Silchar
关键词
Extractive text summarization; Query focused summarization; Sentence classification;
D O I
10.1007/s41870-023-01221-x
中图分类号
学科分类号
摘要
A COVID-19 news covers subtopics like infections, deaths, the economy, jobs, and more. The proposed method generates a news summary based on the subtopics of a reader’s interest. It extracts a centroid having the lexical pattern of the sentences on those subtopics by the frequently used words in them. The centroid is then used as a query in the vector space model (VSM) for sentence classification and extraction, producing a query focused summarization (QFS) of the documents. Three approaches, TF-IDF, word vector averaging, and auto-encoder are experimented to generate sentence embedding that are used in VSM. These embeddings are ranked depending on their similarities with the query embedding. A Novel approach has been introduced to find the value for the similarity parameter using a supervised technique to classify the sentences. Finally, the performance of the method has been assessed in two different ways. All the sentences of the dataset are considered together in the first assessment and in the second, each document wise group of sentences is considered separately using fivefold cross-validation. The proposed method has achieved a minimum of 0.60 to a maximum of 0.63 mean F1 scores with the three sentence encoding approaches on the test dataset. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1789 / 1801
页数:12
相关论文
共 50 条
  • [1] An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings
    Lamsiyah, Salima
    El Mahdaouy, Abdelkader
    Espinasse, Bernard
    Ouatik, Said El Alaoui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
  • [2] Extractive Myanmar News Summarization Using Centroid Based Word Embedding
    Lwin, Soe Soe
    Nwet, Khin Thandar
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 200 - 205
  • [3] Automatic Extractive Text Summarization Based on Fuzzy Logic: A Sentence Oriented Approach
    Hannah, M. Esther
    Geetha, T. V.
    Mukherjee, Saswati
    [J]. SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 530 - +
  • [4] Comparison of feature-based sentence ranking methods for extractive summarization of Turkish news texts
    Erdagi, Erturk
    Tunali, Volkan
    [J]. SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2024, 42 (02): : 321 - 334
  • [5] Extractive Multi-document Summarization using K-means, Centroid-based Method, MMR, and Sentence Position
    Hai Cao Manh
    Huong Le Thanh
    Tuan Luu Minh
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 29 - 35
  • [6] Spatio-temporal approach for classification of COVID-19 pandemic fake news
    Agarwal, I. Y.
    Rana, D. P.
    Shaikh, M.
    Poudel, S.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [7] Spatio-temporal approach for classification of COVID-19 pandemic fake news
    I. Y. Agarwal
    D. P. Rana
    M. Shaikh
    S. Poudel
    [J]. Social Network Analysis and Mining, 2022, 12
  • [8] Graph Based Extractive News Articles Summarization Approach leveraging Static Word Embeddings
    Barman, Utpal
    Barman, Vishal
    Rahman, Mustafizur
    Choudhury, Nawaz Khan
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 8 - 11
  • [9] A Novel Method for a COVID-19 Classification of Countries Based on an Intelligent Fuzzy Fractal Approach
    Castillo, Oscar
    Melin, Patricia
    [J]. HEALTHCARE, 2021, 9 (02)
  • [10] Fake Sentence Detection Based on Transfer Learning: Applying to Korean COVID-19 Fake News
    Lee, Jeong-Wook
    Kim, Jae-Hoon
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):