Extractive Myanmar News Summarization Using Centroid Based Word Embedding

被引:0
|
作者
Lwin, Soe Soe [1 ]
Nwet, Khin Thandar [2 ]
机构
[1] Univ Comp Studies Yangon, Yangon, Myanmar
[2] Univ Informat Technol, Yangon, Myanmar
关键词
Word Embedding; Text Summarization; Bags of words; Centroid;
D O I
10.1109/aitc.2019.8921386
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, many researches are going on for text summarization because there are a lot of data on the internet and it is required to process, store and manage. Text summarization is a process of distilling important information from the original text and presents that information in the form of summary. The system is proposed to summarize Myanmar news with centroid based method. Centroid based method ranks the sentences based on their similarity to the centroid. Centroid based method uses the bags of words model to represent sentences. Bags of words representation does not capture the semantic relationship between words. To overcome this problem, centroid based method is combined with word embedding representation instead of bags of words in this paper. Experiments were done on Myanmar news dataset. Centroid based on word embedding method gets better performance than centroid based on bags of words method.
引用
收藏
页码:200 / 205
页数:6
相关论文
共 50 条
  • [1] Extractive Text Summarization using Word Vector Embedding
    Jain, Aditya
    Bhatia, Divij
    Thakur, Manish K.
    [J]. 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 51 - 55
  • [2] Extractive Arabic Text Summarization Using PageRank and Word Embedding
    Alselwi, Ghadir
    Tasci, Tugrul
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 13115 - 13130
  • [3] A weighted word embedding based approach for extractive text summarization
    Rani, Ruby
    Lobiyal, Daya K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [4] Using Centroid Keywords and Word Mover's Distance for Single Document Extractive Summarization
    Seitkali, Dauken
    Mussabayev, Rustam
    [J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 149 - 152
  • [5] Extractive Summarization for Myanmar Language
    Lwin, Soe Soe
    Nwet, Khin Thandar
    [J]. 2018 INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2018), 2018, : 138 - 143
  • [6] Automatic Extractive Summarization using GAN Boosted by DistilBERT Word Embedding and Transductive Learning
    Li, Dongliang
    Li, Youyou
    Zhang, Zhigang
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 61 - 74
  • [7] Exploring Word Mover's Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization
    Liu, Shih-Hung
    Chen, Kuan-Yu
    Hsieh, Yu-Lun
    Chen, Berlin
    Wang, Hsin-Min
    Yen, Hsu -Chun
    Hsu, Wen-Lian
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 670 - 674
  • [8] A novel centroid based sentence classification approach for extractive summarization of COVID-19 news reports
    Banerjee S.
    Mukherjee S.
    Bandyopadhyay S.
    [J]. International Journal of Information Technology, 2023, 15 (4) : 1789 - 1801
  • [9] Graph Based Extractive News Articles Summarization Approach leveraging Static Word Embeddings
    Barman, Utpal
    Barman, Vishal
    Rahman, Mustafizur
    Choudhury, Nawaz Khan
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 8 - 11
  • [10] CENTRANK: A GRAPH CENTROID BASED RANKING FOR EXTRACTIVE TEXT SUMMARIZATION
    Hemamalini, S.
    Swaminathan, V.
    [J]. TWMS JOURNAL OF APPLIED AND ENGINEERING MATHEMATICS, 2023, 13 : 355 - 364