Robust Multi-task Learning-based Korean POS Tagging to OvercomeWord Spacing Errors

被引:0
|
作者
Park, Cheoneum [1 ,2 ]
Kim, Juae [3 ,4 ]
机构
[1] SK Telecom, 65 Eulji Ro, Seoul 04539, South Korea
[2] Hyundai Motor Co, 65 Eulji Ro, Seoul 04539, South Korea
[3] Hankuk Univ Foreign Studies, 107 Imun Ro, Seoul 02451, South Korea
[4] Hyundai Motor Co, 107 Imun Ro, Seoul 02451, South Korea
关键词
Morphological analysis; part-of-speech tagging; word spacing; multi-task learning;
D O I
10.1145/3591206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end neural network-based approaches have recently demonstrated significant improvements in natural language processing (NLP). However, in the NLP application such as assistant systems, NLP components are still processed to extract results using a pipeline paradigm. The pipeline-based concept has issues with error propagation. In Korean, morphological analysis and part-of-speech (POS) tagging step, incorrectly analyzing POS tags for a sentence containing spacing errors negatively affects other modules behind the POS module. Hence, we present a multi-task learning-based POS tagging neural model for Korean with word spacing challenges. When we apply this model to the Korean morphological analysis and POS tagging, we get findings that are robust to word spacing errors. We adopt syllable-level input and output formats, as well as a simple structure for ELECTRA and RNN-CRF models for multi-task learning, and we achieve a good performance 98.30 of F1, better than previous studies on the Sejong corpus test set.
引用
下载
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-task Learning for Chinese Word Usage Errors Detection
    Zhang, Jinbin
    Wang, Heng
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2018, : 93 - 96
  • [22] Robust license plate signatures matching based on multi-task learning approach
    Hasnat, Abul
    Nakib, Amir
    NEUROCOMPUTING, 2021, 440 : 58 - 71
  • [23] A Deep Learning-Based Approach for Part of Speech (PoS) Tagging in the Pashto Language
    Ullah, Shaheen
    Ahmad, Riaz
    Namoun, Abdallah
    Muhammad, Siraj
    Ullah, Khalil
    Hussain, Ibrar
    Ibrahim, Isa Ali
    IEEE ACCESS, 2024, 12 : 86355 - 86364
  • [24] Evaluation of multi-task learning in deep learning-based positioning classification of mandibular third molars
    Sukegawa, Shintaro
    Matsuyama, Tamamo
    Tanaka, Futa
    Hara, Takeshi
    Yoshii, Kazumasa
    Yamashita, Katsusuke
    Nakano, Keisuke
    Takabatake, Kiyofumi
    Kawai, Hotaka
    Nagatsuka, Hitoshi
    Furuki, Yoshihiko
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [25] Evaluation of multi-task learning in deep learning-based positioning classification of mandibular third molars
    Shintaro Sukegawa
    Tamamo Matsuyama
    Futa Tanaka
    Takeshi Hara
    Kazumasa Yoshii
    Katsusuke Yamashita
    Keisuke Nakano
    Kiyofumi Takabatake
    Hotaka Kawai
    Hitoshi Nagatsuka
    Yoshihiko Furuki
    Scientific Reports, 12
  • [26] Robust Lifelong Multi-task Multi-view Representation Learning
    Sun, Gan
    Cong, Yang
    Li, Jun
    Fu, Yun
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 91 - 98
  • [27] Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework
    Parikh, Rahil
    Sundar, Harshavardhan
    Sun, Ming
    Wang, Chao
    Matsoukas, Spyros
    INTERSPEECH 2022, 2022, : 4192 - 4196
  • [28] An End-to-End Scalable Iterative Sequence Tagging with Multi-Task Learning
    Gui, Lin
    Du, Jiachen
    Zhao, Zhishan
    He, Yulan
    Xu, Ruifeng
    Fan, Chuang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 288 - 298
  • [29] Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks
    Schroeder, Fynn
    Biemann, Chris
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2971 - 2985
  • [30] MTTLADE: A multi-task transfer learning-based method for adverse drug events extraction
    El-allaly, Ed-drissiya
    Sarrouti, Mourad
    En-Nahnahi, Noureddine
    El Alaoui, Said Ouatik
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)