GreekT5: Sequence-to-Sequence Models for Greek News Summarization

被引:0
|
作者
Giarelis, Nikolaos [1 ]
Mastrokostas, Charalampos [1 ]
Karacapilidis, Nikos [1 ]
机构
[1] Univ Patras, Ind Management & Informat Syst Lab, MEAD, Rion, Greece
关键词
Deep Learning; Natural Language Processing; Greek NLP; Text Summarization; Pretrained Language Models; Greek Language;
D O I
10.1007/978-3-031-63215-0_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text summarization is a natural language processing subtask pertaining to the automatic formulation of a concise and coherent summary that covers the major concepts and topics from one or multiple documents. Recent advancements in deep learning have led to the development of abstractive summarization Transformer-based models, which outperform classical approaches. In any case, research in this field focuses on high resource languages such as English, while the corresponding work for low resource languages is limited. Dealing with modern Greek, this paper proposes a series of new abstractive models for news article summarization. The proposed models were thoroughly evaluated on the same dataset against GreekBART, the only existing model for Greek abstractive news summarization. Our evaluation results reveal that most of the proposed models perform better than GreekBART on various evaluation metrics. Our experiments indicate that multilingual Seq2Seq models, fine-tuned for a specific language and task, can achieve similar or even better performance compared to monolingual models pre-trained and fine-tuned for the same language and task, while requiring significantly less computational resources. We make our evaluation code public, aiming to increase the reproducibility of this work and facilitate future research in the field.
引用
收藏
页码:60 / 73
页数:14
相关论文
共 50 条
  • [21] Automated Integration of Genomic Metadata with Sequence-to-Sequence Models
    Cannizzaro, Giuseppe
    Leone, Michele
    Bernasconi, Anna
    Canakoglu, Arif
    Carman, Mark J.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 187 - 203
  • [22] Neural AMR: Sequence-to-Sequence Models for Parsing and Generation
    Konstas, Ioannis
    Iyer, Srinivasan
    Yatskar, Mark
    Choi, Yejin
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 146 - 157
  • [23] Sequence-to-Sequence Models for Trajectory Deformation of Dynamic Manipulation
    Kutsuzawa, Kyo
    Sakaino, Sho
    Tsuji, Toshiaki
    IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 5227 - 5232
  • [24] Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
    Dutil, Francis
    Gulcehre, Caglar
    Trischler, Adam
    Bengio, Yoshua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [25] Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition
    Panchbhai, Anand
    Soru, Tommaso
    Marx, Edgard
    KNOWLEDGE GRAPHS AND SEMANTIC WEB, KGSWC 2020, 2020, 1232 : 158 - 165
  • [26] Predicting the Mumble of Wireless Channel with Sequence-to-Sequence Models
    Huangfu, Yourui
    Wang, Jian
    Li, Rong
    Xu, Chen
    Wang, Xianbin
    Zhang, Huazi
    Wang, Jun
    2019 IEEE 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2019, : 1043 - 1049
  • [27] Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models
    Parry, Andrew
    Froebe, Maik
    MacAvaney, Sean
    Potthast, Martin
    Hagen, Matthias
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT II, 2024, 14609 : 286 - 302
  • [28] SUPERVISED ATTENTION IN SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION
    Yang, Gene-Ping
    Tang, Hao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7222 - 7226
  • [29] ACOUSTIC-TO-WORD RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS
    Palaskar, Shruti
    Metze, Florian
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 397 - 404
  • [30] Persian Keyphrase Generation Using Sequence-to-sequence Models
    Doostmohammadi, Ehsan
    Bokaei, Mohammad Hadi
    Sameti, Hossein
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 2010 - 2015