Solving Hungarian natural language processing tasks with multilingual generative models

被引:0
|
作者
Yang, Zijian Gyozo [1 ]
Laki, Laszlo Janos [2 ]
机构
[1] Hungarian Res Ctr Linguist, Budapest, Hungary
[2] Pazmany Peter Catholic Univ, Fac Informat Technol & Bion, MTA PPKE Hungarian Language Technol Res Grp, Budapest, Hungary
来源
关键词
natural language processing; multilingual model; sentiment analy-sis; abstractive summarization; machine translation; Marian NMT;
D O I
暂无
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Generative ability is a crucial need for artificial intelligence appli-cations, such as chatbots, virtual assistants, machine translation systems etc. In recent years, the transformer-based neural architectures gave a huge boost to generate human-like English texts. In our research we did experiments to create pre-trained generative transformer models for Hungarian language and fine-tune them for multiple types of natural language processing tasks.In our focus, multilingual models were trained. We have pre-trained a multilingual BART, then fine-tuned it to various NLP tasks, such as text classification, abstractive summarization. In our experiments, we focused on transfer learning techniques to increase the performance. Furthermore, a M2M100 multilingual model was fine-tuned for a 12-lingual Hungarian -Centric machine translation. Last but not least, a Marian NMT based machine translation system was also built from scratch for the 12-lingual Hungarian-Centric machine translation task.In our results, using the cross-lingual transfer method we could achieve higher performance in all of our tasks. In our machine translation experi-ment, using our fine-tuned M2M100 model we could outperform the Google Translate, Microsoft Translator and eTranslation.
引用
收藏
页码:92 / 106
页数:15
相关论文
共 50 条
  • [31] Knowledge-based Data Processing for Multilingual Natural Language Analysis
    Jain, Deepak Kumar
    Eyre, Yamila Garcia-Martinez
    Kumar, Akshi
    Gupta, Brij B.
    Kotecha, Ketan
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [32] Bilingual and multilingual language processing
    Halsband, Ulrike
    [J]. JOURNAL OF PHYSIOLOGY-PARIS, 2006, 99 (4-6) : 355 - 369
  • [33] A scoping review of publicly available language tasks in clinical natural language processing
    Gao, Yanjun
    Dligach, Dmitriy
    Christensen, Leslie
    Tesch, Samuel
    Laffin, Ryan
    Xu, Dongfang
    Miller, Timothy
    Uzuner, Ozlem
    Churpek, Matthew M.
    Afshar, Majid
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (10) : 1797 - 1806
  • [34] Learning Representations forWeakly Supervised Natural Language Processing Tasks
    Huang, Fei
    Ahuja, Arun
    Downey, Doug
    Yang, Yi
    Guo, Yuhong
    Yates, Alexander
    [J]. COMPUTATIONAL LINGUISTICS, 2014, 40 (01) : 85 - 120
  • [35] A Toolkit for Text Extraction and Analysis for Natural Language Processing Tasks
    Sefara, Tshephisho Joseph
    Mbooi, Mahlatse
    Mashile, Katlego
    Rambuda, Thompho
    Rangata, Mapitsi
    [J]. 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD2022), 2022,
  • [36] ITALIAN-LEGAL-BERT models for improving natural language processing tasks in the Italian legal domain
    Licari, Daniele
    Comande, Giovanni
    [J]. COMPUTER LAW & SECURITY REVIEW, 2024, 52
  • [37] Multilingual spoken language processing - Challenges for multilingual systems
    Fung, Pascale
    Schultz, Tanja
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) : 89 - 97
  • [38] SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
    Chang, Kai-Wei
    Wu, Haibin
    Wang, Yu-Kai
    Wu, Yuan-Kuei
    Shen, Hua
    Tseng, Wei-Cheng
    Kang, Iu-Thing
    Li, Shang-Wen
    Lee, Hung-Yi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3730 - 3744
  • [39] On the Explainability of Natural Language Processing Deep Models
    El Zini, Julia
    Awad, Mariette
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (05)
  • [40] Dissecting word embeddings and language models in natural language processing
    Verma, Vivek Kumar
    Pandey, Mrigank
    Jain, Tarun
    Tiwari, Pradeep Kumar
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2021, 24 (05): : 1509 - 1515