GreekT5: Sequence-to-Sequence Models for Greek News Summarization

被引：0

作者：

Giarelis, Nikolaos ^{[1
]}

Mastrokostas, Charalampos ^{[1
]}

Karacapilidis, Nikos ^{[1
]}

机构：

[1] Univ Patras, Ind Management & Informat Syst Lab, MEAD, Rion, Greece

来源：

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT II, AIAI 2024 | 2024年 / 712卷

关键词：

Deep Learning; Natural Language Processing; Greek NLP; Text Summarization; Pretrained Language Models; Greek Language;

D O I：

10.1007/978-3-031-63215-0_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text summarization is a natural language processing subtask pertaining to the automatic formulation of a concise and coherent summary that covers the major concepts and topics from one or multiple documents. Recent advancements in deep learning have led to the development of abstractive summarization Transformer-based models, which outperform classical approaches. In any case, research in this field focuses on high resource languages such as English, while the corresponding work for low resource languages is limited. Dealing with modern Greek, this paper proposes a series of new abstractive models for news article summarization. The proposed models were thoroughly evaluated on the same dataset against GreekBART, the only existing model for Greek abstractive news summarization. Our evaluation results reveal that most of the proposed models perform better than GreekBART on various evaluation metrics. Our experiments indicate that multilingual Seq2Seq models, fine-tuned for a specific language and task, can achieve similar or even better performance compared to monolingual models pre-trained and fine-tuned for the same language and task, while requiring significantly less computational resources. We make our evaluation code public, aiming to increase the reproducibility of this work and facilitate future research in the field.

引用

页码：60 / 73

页数：14

共 50 条

[1] Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Shi, Tian
Keneshloo, Yaser
Ramakrishnan, Naren
Reddy, Chandan K.
ACM/IMS Transactions on Data Science, 2021, 2 (01):
[2] Turkish abstractive text summarization using pretrained sequence-to-sequence models
Baykara, Batuhan
Gungor, Tunga
NATURAL LANGUAGE ENGINEERING, 2023, 29 (05) : 1275 - 1304
[3] Sparse Sequence-to-Sequence Models
Peters, Ben
Niculae, Vlad
Martins, Andre F. T.
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1504 - 1519
[4] Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization
Chen, Jiaao
Yang, Diyi
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4106 - 4118
[5] Towards Sequence-to-Sequence Neural Model for Croatian Abstractive Summarization
Davidovic, Vlatka
Ipsic, Sanda Martincic
CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, CECIIS, 2023, : 309 - 315
[6] Assessing incrementality in sequence-to-sequence models
Ulmer, Dennis
Hupkes, Dieuwke
Bruni, Elia
4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 209 - 217
[7] An Analysis of "Attention" in Sequence-to-Sequence Models
Prabhavalkar, Rohit
Sainath, Tara N.
Li, Bo
Rao, Kanishka
Jaitly, Navdeep
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3702 - 3706
[8] Myanmar News Headline Generation with Sequence-to-Sequence model
Thu, Yamin
Pa, Win Pa
PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 117 - 122
[9] HIERARCHICAL SPEAKER-AWARE SEQUENCE-TO-SEQUENCE MODEL FOR DIALOGUE SUMMARIZATION
Lei, Yuejie
Yan, Yuanmeng
Zeng, Zhiyuan
He, Keqing
Zhang, Ximing
Xu, Weiran
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7823 - 7827
[10] BART-IT: An Efficient Sequence-to-Sequence Model for Italian Text Summarization
La Quatra, Moreno
Cagliero, Luca
FUTURE INTERNET, 2023, 15 (01)

← 1 2 3 4 5 →