A Comparison of Multiple Approaches for the Extractive Summarization of Portuguese Texts

被引:0
|
作者
Costa, Miguel [1 ]
Martins, Bruno [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1699 Lisbon, Portugal
来源
LINGUAMATICA | 2015年 / 7卷 / 01期
关键词
Automatic Summarization; Comparative Evaluation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Automatic document summarization is the task of automatically generating condensed versions of source texts, presenting itself as one of the fundamental problems in the areas of Information Retrieval and Natural Language Processing. In this paper, different extractive approaches are compared in the task of summarizing individual documents corresponding to journalistic texts written in Portuguese. Through the use of the ROUGE package for measuring the quality of the produced summaries, we report on results for two different experimental domains, involving (i) the generation of headlines for news articles written in European Portuguese, and (ii) the generation of summaries for news articles written in Brazilian Portuguese. The results demonstrate that methods based on the selection of the first sentences have the best results when building extractive news headlines in terms of several ROUGE metrics. Regarding the generation of summaries with more than one sentence, the method that achieved the best results was the LSA Squared algorithm, for the various ROUGE metrics.
引用
收藏
页码:23 / 40
页数:18
相关论文
共 50 条
  • [41] Unsupervised Extractive Summarization of Emotion Triggers
    Sosea, Tiberiu
    Zhan, Hongli
    Li, Junyi Jessy
    Caragea, Cornelia
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9550 - 9569
  • [42] Extractive Speech Summarization By Active Learning
    Zhang, Justin Jian
    Chan, Ricky Ho Yin
    Fung, Pascale
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 392 - 397
  • [43] Extractive summarization of clinical trial descriptions
    Gulden, Christian
    Kirchner, Melanie
    Schuettler, Christina
    Hinderer, Marc
    Kampf, Marvin
    Prokosch, Hans-Ulrich
    Toddenroth, Dennis
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 : 114 - 121
  • [44] BANDITSUM: Extractive Summarization as a Contextual Bandit
    Dong, Yue
    Shen, Yikang
    Crawford, Eric
    van Hoof, Herke
    Cheung, Jackie C. K.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3739 - 3748
  • [45] Deep Differential Amplifier for Extractive Summarization
    Jia, Ruipeng
    Cao, Yanan
    Fang, Fang
    Zhou, Yuchen
    Fang, Zheng
    Liu, Yanbing
    Wang, Shi
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 366 - 376
  • [46] SENTENCE MODELING FOR EXTRACTIVE SPEECH SUMMARIZATION
    Chen, Berlin
    Chang, Hao-Chin
    Chen, Kuan-Yu
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [47] Heterogeneous graphormer for extractive multimodal summarization
    Jiang, Xiankai
    Chen, Jingqiang
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, : 355 - 373
  • [48] DistilSum: Distilling the Knowledge for Extractive Summarization
    Jia, Ruipeng
    Cao, Yanan
    Shi, Haichao
    Fang, Fang
    Liu, Yanbing
    Tan, Jianlong
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2069 - 2072
  • [49] The Combination of Similarity Measures for Extractive Summarization
    Hy Nguyen
    Tung Le
    Viet-Thang Luong
    Minh-Quoc Nghiem
    Dien Dinh
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 66 - 72
  • [50] On-Device Extractive Text Summarization
    Dhaliwal, Mehak Preet
    Kumar, Rishabh
    Rungta, Mukund
    Tiwari, Hemant
    Vala, Vanraj
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 347 - 354