Improving Named Entity Recognition in Vietnamese Texts by a Character-Level Deep Lifelong Learning Model

被引:3
|
作者
Ngoc-Vu Nguyen [1 ]
Thi-Lan Nguyen [2 ]
Cam-Van Nguyen Thi [2 ]
Mai-Vu Tran [2 ]
Tri-Thanh Nguyen [1 ,2 ]
Quang-Thuy Ha [2 ]
机构
[1] Minist Nat Resources & Environm MONRE, Dept Informat Technol, 10 Ton That Thuyet, Hanoi, Vietnam
[2] Vietnam Natl Univ Hanoi VNU, Univ Engn & Technol UET, 144 Xuan Thuy, Hanoi, Vietnam
关键词
Named entity recognition; deep lifelong learning; DeepLML for NER in Vietnamese texts;
D O I
10.1142/S219688881950026X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is a fundamental task which affects the performance of its dependent task, e.g. machine translation. Lifelong machine learning (LML) is a continuous learning process, in which the knowledge base accumulated from previous tasks will be used to improve future learning tasks having few samples. Since there are a few studies on LML based on deep neural networks for NER, especially in Vietnamese, we propose a lifelong learning model based on deep learning with a CRFs layer, named DeepLML-NER, for NER in Vietnamese texts. DeepLML-NER includes an algorithm to extract the knowledge of "prefix-features" of named entities in previous domains. Then the model uses the knowledge in the knowledge base to solve the current NER task. Preprocessing and model parameter tuning are also investigated to improve the performance. The effect of the model was demonstrated by in-domain and cross-domain experiments, achieving promising results.
引用
收藏
页码:471 / 487
页数:17
相关论文
共 50 条
  • [1] A Character-Level Deep Lifelong Learning Model for Named Entity Recognition in Vietnamese Text
    Ngoc-Vu Nguyen
    Thi-Lan Nguyen
    Cam-Van Nguyen Thi
    Mai-Vu Tran
    Quang-Thuy Ha
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 90 - 102
  • [2] Chinese Named Entity Recognition with Character-Level BLSTM and Soft Attention Model
    Jize Yin
    Senlin Luo
    Zhouting Wu
    Limin Pan
    [J]. Journal of Beijing Institute of Technology, 2020, 29 (01) : 60 - 71
  • [3] Chinese Named Entity Recognition with Character-Level BLSTM and Soft Attention Model
    Yin J.
    Luo S.
    Wu Z.
    Pan L.
    [J]. Journal of Beijing Institute of Technology (English Edition), 2020, 29 (01): : 60 - 71
  • [4] Character-level neural network for biomedical named entity recognition
    Gridach, Mourad
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 : 85 - 91
  • [5] Evaluating corpora for named entity recognition using character-level features
    Whitelaw, C
    Patrick, J
    [J]. AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 910 - 921
  • [6] End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-Level Vs. Character-Level
    Thai-Hoang Pham
    Phuong Le-Hong
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 219 - 232
  • [7] Named-Entity Recognition in Sports Field Based on a Character-Level Graph Convolutional Network
    Seti, Xieraili
    Wumaier, Aishan
    Yibulayin, Turgen
    Paerhati, Diliyaer
    Wang, Lulu
    Saimaiti, Alimu
    [J]. INFORMATION, 2020, 11 (01)
  • [8] Deep Learning Speech Synthesis Model for Word/Character-Level Recognition in the Tamil Language
    Rajendran, Sukumar
    Raja, Kiruba Thangam
    Nagarajan, G.
    Dass, A. Stephen
    Kumar, M. Sandeep
    Jayagopal, Prabhu
    [J]. INTERNATIONAL JOURNAL OF E-COLLABORATION, 2023, 19 (04) : 20 - 20
  • [9] Character Feature Learning for Named Entity Recognition
    Zeng, Ping
    Tan, Qingping
    Zhang, Haoyu
    Meng, Xiankai
    Zhang, Zhuo
    Xu, Jianjun
    Lei, Yan
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (07) : 1811 - 1815
  • [10] Improving dictionary-based named entity recognition with deep learning
    Nastou, Katerina
    Koutrouli, Mikaela
    Pyysalo, Sampo
    Jensen, Lars Juhl
    [J]. BIOINFORMATICS, 2024, 40 : ii45 - ii52