Robust Multi-task Learning-based Korean POS Tagging to OvercomeWord Spacing Errors

被引:0
|
作者
Park, Cheoneum [1 ,2 ]
Kim, Juae [3 ,4 ]
机构
[1] SK Telecom, 65 Eulji Ro, Seoul 04539, South Korea
[2] Hyundai Motor Co, 65 Eulji Ro, Seoul 04539, South Korea
[3] Hankuk Univ Foreign Studies, 107 Imun Ro, Seoul 02451, South Korea
[4] Hyundai Motor Co, 107 Imun Ro, Seoul 02451, South Korea
关键词
Morphological analysis; part-of-speech tagging; word spacing; multi-task learning;
D O I
10.1145/3591206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end neural network-based approaches have recently demonstrated significant improvements in natural language processing (NLP). However, in the NLP application such as assistant systems, NLP components are still processed to extract results using a pipeline paradigm. The pipeline-based concept has issues with error propagation. In Korean, morphological analysis and part-of-speech (POS) tagging step, incorrectly analyzing POS tags for a sentence containing spacing errors negatively affects other modules behind the POS module. Hence, we present a multi-task learning-based POS tagging neural model for Korean with word spacing challenges. When we apply this model to the Korean morphological analysis and POS tagging, we get findings that are robust to word spacing errors. We adopt syllable-level input and output formats, as well as a simple structure for ELECTRA and RNN-CRF models for multi-task learning, and we achieve a good performance 98.30 of F1, better than previous studies on the Sejong corpus test set.
引用
下载
收藏
页数:13
相关论文
共 50 条
  • [1] Local Learning-based Multi-task Clustering
    Zhong, Guo
    Pun, Chi-Man
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [2] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
    Yuanjun Zhao
    Roberto Togneri
    Victor Sreeram
    Circuits, Systems, and Signal Processing, 2022, 41 : 4068 - 4089
  • [3] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
    Zhao, Yuanjun
    Togneri, Roberto
    Sreeram, Victor
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (07) : 4068 - 4089
  • [4] Transfer Learning-Based Evolutionary Multi-task Optimization
    Li, Shuai
    Zhu, Xiaobing
    Li, Xi
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023, 2024, 2061 : 14 - 28
  • [5] Robust Estimator based Adaptive Multi-Task Learning
    Zhu, Peiyuan
    Chen, Cailian
    He, Jianping
    Zhu, Shanying
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 740 - 747
  • [6] ADAPTIVE AND ROBUST MULTI-TASK LEARNING
    Duan, Yaqi
    Wang, Kaizheng
    ANNALS OF STATISTICS, 2023, 51 (05): : 2015 - 2039
  • [7] Multi-Task Learning-Based Immunofluorescence Classification of Kidney Disease
    Pan, Sai
    Fu, Yibing
    Chen, Pu
    Liu, Jiaona
    Liu, Weicen
    Wang, Xiaofei
    Cai, Guangyan
    Yin, Zhong
    Wu, Jie
    Tang, Li
    Wang, Yong
    Duan, Shuwei
    Dai, Ning
    Jiang, Lai
    Xu, Mai
    Chen, Xiangmei
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (20)
  • [8] Unsupervised domain adaptation: A multi-task learning-based method
    Zhang, Jing
    Li, Wanqing
    Ogunbona, Philip
    KNOWLEDGE-BASED SYSTEMS, 2019, 186
  • [9] Deep Learning-Based Positioning With Multi-Task Learning and Uncertainty-Based Fusion
    Foliadis, Anastasios
    Garcia, Mario H. Castaneda
    Stirling-Gallacher, Richard A.
    Thoma, Reiner S.
    IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 1127 - 1141
  • [10] Robust Temporal Smoothness in Multi-Task Learning
    Zhou, Menghui
    Zhang, Yu
    Yang, Yun
    Liu, Tong
    Yang, Po
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11426 - 11434