Using Centroid Keywords and Word Mover's Distance for Single Document Extractive Summarization

被引:1
|
作者
Seitkali, Dauken [1 ]
Mussabayev, Rustam [1 ]
机构
[1] Inst Informat & Computat Technol, 125 Pushkina, Alma Ata 050010, Kazakhstan
关键词
Centroid; WMD; word2vec; extractive summarization;
D O I
10.1145/3342827.3342852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents unsupervised method of single document extractive summarization. The main idea behind the method is in selecting sentences based on Word Mover's Distance Similarity between each sentence and set of centroid keywords. This approach leverages both compositional property of word embeddings and advantages of recently discovered powerful text to text distance metric. ROUGE results on DUC 2002 data set showed that quality of produced summaries can compete with well-known state of the art systems. In this work we also discuss limitations of gold summaries in evaluating quality of summarization systems.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [1] Extractive Myanmar News Summarization Using Centroid Based Word Embedding
    Lwin, Soe Soe
    Nwet, Khin Thandar
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 200 - 205
  • [2] Exploring Word Mover's Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization
    Liu, Shih-Hung
    Chen, Kuan-Yu
    Hsieh, Yu-Lun
    Chen, Berlin
    Wang, Hsin-Min
    Yen, Hsu -Chun
    Hsu, Wen-Lian
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 670 - 674
  • [3] An Efficient Approach for Findings Document Similarity Using Optimized Word Mover's Distance
    Dey, Atanu
    Jenamani, Mamata
    De, Arijit
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 3 - 11
  • [4] Single Document Extractive Text Summarization Using Genetic Algorithms
    Chatterjee, Niladri
    Mittal, Amol
    Goyal, Shubham
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2012, : 19 - 23
  • [5] Single document summarization using word and sentence embeddings
    Ayana
    [J]. PROCEEDINGS OF THE 2015 JOINT INTERNATIONAL MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY CONFERENCE (JIMET 2015), 2015, 10 : 523 - 526
  • [6] Word topical mixture models for extractive spoken document summarization
    Chen, Berlin
    Chen, Yi-Ting
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 52 - 55
  • [7] Single document extractive text summarization using cuckoo search algorithm
    Pati, Siba Prasad
    Rautray, Rasmita
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1089 - 1097
  • [8] Supervised Word Mover's Distance
    Huang, Gao
    Guo, Chuan
    Kusner, Matt J.
    Sun, Yu
    Weinberger, Kilian Q.
    Sha, Fei
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [9] Extractive Summarization of a Document Using Lexical Chains
    Mallick, Chirantana
    Dutta, Madhurima
    Das, Ajit Kumar
    Sarkar, Apurba
    Das, Asit Kumar
    [J]. SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 825 - 836
  • [10] Enhanced Genetic Algorithm for Single Document Extractive Summarization
    Bui Thi Mai Anh
    Nguyen Tra My
    Nguyen Thi Thu Trang
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 370 - 376