Robust Multi-task Learning-based Korean POS Tagging to OvercomeWord Spacing Errors

被引：0

作者：

Park, Cheoneum ^{[1
,2
]}

Kim, Juae ^{[3
,4
]}

机构：

[1] SK Telecom, 65 Eulji Ro, Seoul 04539, South Korea

[2] Hyundai Motor Co, 65 Eulji Ro, Seoul 04539, South Korea

[3] Hankuk Univ Foreign Studies, 107 Imun Ro, Seoul 02451, South Korea

[4] Hyundai Motor Co, 107 Imun Ro, Seoul 02451, South Korea

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2023年 / 22卷 / 06期

关键词：

Morphological analysis; part-of-speech tagging; word spacing; multi-task learning;

D O I：

10.1145/3591206

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

End-to-end neural network-based approaches have recently demonstrated significant improvements in natural language processing (NLP). However, in the NLP application such as assistant systems, NLP components are still processed to extract results using a pipeline paradigm. The pipeline-based concept has issues with error propagation. In Korean, morphological analysis and part-of-speech (POS) tagging step, incorrectly analyzing POS tags for a sentence containing spacing errors negatively affects other modules behind the POS module. Hence, we present a multi-task learning-based POS tagging neural model for Korean with word spacing challenges. When we apply this model to the Korean morphological analysis and POS tagging, we get findings that are robust to word spacing errors. We adopt syllable-level input and output formats, as well as a simple structure for ELECTRA and RNN-CRF models for multi-task learning, and we achieve a good performance 98.30 of F1, better than previous studies on the Sejong corpus test set.

引用

下载

页数：13

共 50 条

[1] Local Learning-based Multi-task Clustering
Zhong, Guo
Pun, Chi-Man
KNOWLEDGE-BASED SYSTEMS, 2022, 255
[2] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
Yuanjun Zhao
Roberto Togneri
Victor Sreeram
Circuits, Systems, and Signal Processing, 2022, 41 : 4068 - 4089
[3] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
Zhao, Yuanjun
Togneri, Roberto
Sreeram, Victor
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (07) : 4068 - 4089
[4] Transfer Learning-Based Evolutionary Multi-task Optimization
Li, Shuai
Zhu, Xiaobing
Li, Xi
BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023, 2024, 2061 : 14 - 28
[5] Robust Estimator based Adaptive Multi-Task Learning
Zhu, Peiyuan
Chen, Cailian
He, Jianping
Zhu, Shanying
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 740 - 747
[6] ADAPTIVE AND ROBUST MULTI-TASK LEARNING
Duan, Yaqi
Wang, Kaizheng
ANNALS OF STATISTICS, 2023, 51 (05): : 2015 - 2039
[7] Multi-Task Learning-Based Immunofluorescence Classification of Kidney Disease
Pan, Sai
Fu, Yibing
Chen, Pu
Liu, Jiaona
Liu, Weicen
Wang, Xiaofei
Cai, Guangyan
Yin, Zhong
Wu, Jie
Tang, Li
Wang, Yong
Duan, Shuwei
Dai, Ning
Jiang, Lai
Xu, Mai
Chen, Xiangmei
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (20)
[8] Unsupervised domain adaptation: A multi-task learning-based method
Zhang, Jing
Li, Wanqing
Ogunbona, Philip
KNOWLEDGE-BASED SYSTEMS, 2019, 186
[9] Deep Learning-Based Positioning With Multi-Task Learning and Uncertainty-Based Fusion
Foliadis, Anastasios
Garcia, Mario H. Castaneda
Stirling-Gallacher, Richard A.
Thoma, Reiner S.
IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 1127 - 1141
[10] Robust Temporal Smoothness in Multi-Task Learning
Zhou, Menghui
Zhang, Yu
Yang, Yun
Liu, Tong
Yang, Po
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11426 - 11434

← 1 2 3 4 5 →