Robust Multi-task Learning-based Korean POS Tagging to OvercomeWord Spacing Errors

被引:0
|
作者
Park, Cheoneum [1 ,2 ]
Kim, Juae [3 ,4 ]
机构
[1] SK Telecom, 65 Eulji Ro, Seoul 04539, South Korea
[2] Hyundai Motor Co, 65 Eulji Ro, Seoul 04539, South Korea
[3] Hankuk Univ Foreign Studies, 107 Imun Ro, Seoul 02451, South Korea
[4] Hyundai Motor Co, 107 Imun Ro, Seoul 02451, South Korea
关键词
Morphological analysis; part-of-speech tagging; word spacing; multi-task learning;
D O I
10.1145/3591206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end neural network-based approaches have recently demonstrated significant improvements in natural language processing (NLP). However, in the NLP application such as assistant systems, NLP components are still processed to extract results using a pipeline paradigm. The pipeline-based concept has issues with error propagation. In Korean, morphological analysis and part-of-speech (POS) tagging step, incorrectly analyzing POS tags for a sentence containing spacing errors negatively affects other modules behind the POS module. Hence, we present a multi-task learning-based POS tagging neural model for Korean with word spacing challenges. When we apply this model to the Korean morphological analysis and POS tagging, we get findings that are robust to word spacing errors. We adopt syllable-level input and output formats, as well as a simple structure for ELECTRA and RNN-CRF models for multi-task learning, and we achieve a good performance 98.30 of F1, better than previous studies on the Sejong corpus test set.
引用
下载
收藏
页数:13
相关论文
共 50 条
  • [31] Robust Stuttering Detection via Multi-task and Adversarial Learning
    Sheikh, Shakeel A.
    Sahidullah, Md
    Hirsch, Fabrice
    Ouni, Slim
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 190 - 194
  • [32] A multi-task learning-based automatic blind identification procedure for operational modal analysis
    Shu, Jiangpeng
    Zhang, Congguang
    Gao, Yifan
    Niu, Yanbo
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 187
  • [33] Deep learning-based multi-task prediction system for plant disease and species detection
    Keceli, Ali Seydi
    Kaya, Aydin
    Catal, Cagatay
    Tekinerdogan, Bedir
    ECOLOGICAL INFORMATICS, 2022, 69
  • [34] Robust Visual Tracking via Multi-Task Sparse Learning
    Zhang, Tianzhu
    Ghanem, Bernard
    Liu, Si
    Ahuja, Narendra
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2042 - 2049
  • [35] Stratified Multi-Task Learning for Robust Spotting of Scene Texts
    Dasgupta, Kinjal
    Das, Sudip
    Bhattacharya, Ujjwal
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3130 - 3137
  • [36] Robust Online Multi-Task Learning with Correlative and Personalized Structures
    Yang, Peng
    Zhao, Peilin
    Gao, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (11) : 2510 - 2521
  • [37] Wind power forecasting: A hybrid forecasting model and multi-task learning-based framework
    Tang, Yugui
    Yang, Kuo
    Zhang, Shujing
    Zhang, Zhen
    ENERGY, 2023, 278
  • [38] MTL-FFDET: A Multi-Task Learning-Based Model for Forest Fire Detection
    Lu, Kangjie
    Huang, Jingwen
    Li, Junhui
    Zhou, Jiashun
    Chen, Xianliang
    Liu, Yunfei
    FORESTS, 2022, 13 (09):
  • [39] Part-of-Speech (POS) Tagging Using Deep Learning-Based Approaches on the Designed Khasi POS Corpus
    Warjri, Sunita
    Pakray, Partha
    Lyngdoh, Saralin A.
    Maji, Arnab Kumar
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)
  • [40] Fabric Retrieval Based on Multi-Task Learning
    Xiang, Jun
    Zhang, Ning
    Pan, Ruru
    Gao, Weidong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1570 - 1582