Solving Hungarian natural language processing tasks with multilingual generative models

被引:0
|
作者
Yang, Zijian Gyozo [1 ]
Laki, Laszlo Janos [2 ]
机构
[1] Hungarian Res Ctr Linguist, Budapest, Hungary
[2] Pazmany Peter Catholic Univ, Fac Informat Technol & Bion, MTA PPKE Hungarian Language Technol Res Grp, Budapest, Hungary
来源
关键词
natural language processing; multilingual model; sentiment analy-sis; abstractive summarization; machine translation; Marian NMT;
D O I
暂无
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Generative ability is a crucial need for artificial intelligence appli-cations, such as chatbots, virtual assistants, machine translation systems etc. In recent years, the transformer-based neural architectures gave a huge boost to generate human-like English texts. In our research we did experiments to create pre-trained generative transformer models for Hungarian language and fine-tune them for multiple types of natural language processing tasks.In our focus, multilingual models were trained. We have pre-trained a multilingual BART, then fine-tuned it to various NLP tasks, such as text classification, abstractive summarization. In our experiments, we focused on transfer learning techniques to increase the performance. Furthermore, a M2M100 multilingual model was fine-tuned for a 12-lingual Hungarian -Centric machine translation. Last but not least, a Marian NMT based machine translation system was also built from scratch for the 12-lingual Hungarian-Centric machine translation task.In our results, using the cross-lingual transfer method we could achieve higher performance in all of our tasks. In our machine translation experi-ment, using our fine-tuned M2M100 model we could outperform the Google Translate, Microsoft Translator and eTranslation.
引用
收藏
页码:92 / 106
页数:15
相关论文
共 50 条
  • [1] Solving Hungarian natural language processing tasks with multilingual generative models
    Yang, Zijian Gyozo
    Laki, Laszlo Janos
    [J]. ANNALES MATHEMATICAE ET INFORMATICAE, 2023, 57 : 92 - 106
  • [2] Stellenwert von Natural Language Processing und chatbasierten Generative Language ModelsSignificance of natural language processing and chat-based generative language models
    Markus Haar
    Michael Sonntagbauer
    Stefan Kluge
    [J]. Medizinische Klinik - Intensivmedizin und Notfallmedizin, 2024, 119 : 181 - 188
  • [3] Robustness of GPT Large Language Models on Natural Language Processing Tasks
    Xuanting, Chen
    Junjie, Ye
    Can, Zu
    Nuo, Xu
    Tao, Gui
    Qi, Zhang
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1128 - 1142
  • [4] Stellenwert von Natural Language Processing und chatbasierten Generative Language Models
    Haar, Markus
    Sonntagbauer, Michael
    Kluge, Stefan
    [J]. MEDIZINISCHE KLINIK-INTENSIVMEDIZIN UND NOTFALLMEDIZIN, 2024, 119 (03) : 181 - 188
  • [5] Performance Prediction via Bayesian Matrix Factorisation for Multilingual Natural Language Processing Tasks
    Schram, Viktoria
    Beck, Daniel
    Cohn, Trevor
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1790 - 1801
  • [6] Robust-to-Noise Models in Natural Language Processing Tasks
    Malykh, Valentin
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 10 - 16
  • [7] Unsupervised multi-sense language models for natural language processing tasks
    Roh, Jihyeon
    Park, Sungjin
    Kim, Bo-Kyeong
    Oh, Sang-Hoon
    Lee, Soo-Young
    [J]. NEURAL NETWORKS, 2021, 142 : 397 - 409
  • [8] Semantic Representations for Multilingual Natural Language Processing
    Kozerenko, Elena B.
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 433 - 438
  • [9] Implementation of Sentence Parser for Hungarian Language in Natural Language Processing
    Kovacs, L.
    Barabas, P.
    [J]. 2010 IEEE 8TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, 2010, : 59 - 63
  • [10] Analysis of sentence embedding models using prediction tasks in natural language processing
    Adi, Y.
    Kermany, E.
    Belinkov, Y.
    Lavi, O.
    Goldberg, Y.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)