Central Embeddings for Extractive Summarization Based on Similarity

被引:0
|
作者
Gutierrez-Hinojosa, Sandra J. [1 ]
Calvo, Hiram [1 ]
Moreno-Armendariz, Marco A. [1 ]
机构
[1] Inst Politecn Nacl, Ctr Invest Comp, Mexico City, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2019年 / 23卷 / 03期
关键词
Extractive summarization; prevalent ideas extraction; concept similarity; central embeddings; DUC; 2002;
D O I
10.13053/CyS-23-3-3256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we propose using word embeddings combined with unsupervised methods such as clustering for the multi-document summarization task of DUC (Document Understanding Conference) 2002. We aim to find evidence that semantic information is kept in word embeddings and this representation is subject to be grouped based on their similarity, so that main ideas can be identified in sets of documents. We experiment with different clustering methods to extract candidates for the multi-document summarization task. Our experiments show that our method is able to find the prevalent ideas. ROUGE measures of our experiments are similar to the state of the art, despite the fact that not all the main ideas are found; as our method does not require annotated resources, it provides a domain and language independent way to create a summary.
引用
收藏
页码:649 / 663
页数:15
相关论文
共 50 条
  • [1] Sentence Pair Embeddings Based Evaluation Metric for Abstractive and Extractive Summarization
    Akula, Ramya
    Garibay, Ivan
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6009 - 6017
  • [2] STRASS: A Light and Effective Method for Extractive Summarization Based on Sentence Embeddings
    Bouscarrat, Leo
    Bonnefoy, Antoine
    Peel, Thomas
    Pereira, Cecile
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 243 - 252
  • [3] The Combination of Similarity Measures for Extractive Summarization
    Hy Nguyen
    Tung Le
    Viet-Thang Luong
    Minh-Quoc Nghiem
    Dien Dinh
    [J]. PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 66 - 72
  • [4] A Similarity-Based Abstract Argumentation Approach to Extractive Text Summarization
    Ferilli, Stefano
    Pazienza, Andrea
    Angelastro, Sergio
    Suglia, Alessandro
    [J]. AI*IA 2017 ADVANCES IN ARTIFICIAL INTELLIGENCE, 2017, 10640 : 87 - 100
  • [5] Graph Based Extractive News Articles Summarization Approach leveraging Static Word Embeddings
    Barman, Utpal
    Barman, Vishal
    Rahman, Mustafizur
    Choudhury, Nawaz Khan
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 8 - 11
  • [6] An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings
    Lamsiyah, Salima
    El Mahdaouy, Abdelkader
    Espinasse, Bernard
    Ouatik, Said El Alaoui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
  • [7] An optimized hybrid deep learning model based on word embeddings and statistical features for extractive summarization
    Wazery, Yaser M.
    Saleh, Marwa E.
    Ali, Abdelmgeid A.
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [8] A new sentence similarity measure and sentence based extractive technique for automatic text summarization
    Aliguliyev, Ramiz M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7764 - 7772
  • [9] Memory-based Extractive Summarization
    Feng, Chong
    Pan, Zhiqiang
    Zheng, Jianming
    Xu, Ying
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE), 2018, : 549 - 552
  • [10] Heuristic Initialization And Similarity Integration Based Model for Improving Extractive Multi-Document Summarization
    Kadhim, Nasreen J.
    Mohammed, Dheyaa Abdulameer
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (05): : 330 - 350