Tibetan word segmentation method based on CNN-BiLSTM-CRF model

被引:0
|
作者
Wang, Lili [2 ]
Yang, Hongwu [1 ,2 ,3 ]
Xing, Xiaotian [2 ]
Yan, Yajing [2 ]
机构
[1] Northwest Normal Univ, Coll Educ Technol, Lanzhou 730070, Peoples R China
[2] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730070, Peoples R China
[3] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou 730070, Peoples R China
基金
美国国家科学基金会;
关键词
Convolutional Neural Network; recurrent neural network; Conditional random field; Tibetan word segmentation;
D O I
10.1109/ialp48816.2019.9037661
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Tibetan word segmentation method based on CNN-BiLSTM-CRF model that merely uses the characters of sentence as the input so that the method does not need large-scale corpus resources and manual features for training. Firstly, we use convolution neural network to train character vectors. Then the character vectors are searched through the character lookup table to form a matrix C by stacking searched results. Then the convolution operation between the matrix C and multiple filter matrices is carried out to obtain the character-level features of each Tibetan word by maximizing the pooling. We input the character vector into the BiLSTM-CRF model, which is suitable for Tibetan word segmentation through the highway network, for getting a Tibetan word segmentation model that is optimized by using the character vector and CRF model. For Tibetan language with rich morphology, fewer parameters and faster training time make this model better than BiLSTM-CRF model in the performance of character level. The experimental results show that character input is sufficient for language modeling. The robustness of Tibetan word segmentation is improved by the model that can achieves 95.17% of the F value.
引用
收藏
页码:319 / 324
页数:6
相关论文
共 50 条
  • [41] Tibetan Unknown Word Identification from News Corpora for Supporting Lexicon-based Tibetan Word Segmentation
    Nuo, Minghua
    Liu, Huidan
    Long, Congjun
    Wu, Jian
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 451 - 457
  • [42] Low-Grade Glioma Segmentation Based on CNN with Fully Connected CRF
    Li, Zeju
    Wang, Yuanyuan
    Yu, Jinhua
    Shi, Zhifeng
    Guo, Yi
    Chen, Liang
    Mao, Ying
    JOURNAL OF HEALTHCARE ENGINEERING, 2017, 2017
  • [43] Construction method of a defect knowledge map of a relay protection device based on a MacBERT-BiLSTM-CRF model
    Dai, Zhihui
    Zhang, Fuze
    Zhang, Jinyue
    Han, Xiao
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (20): : 132 - 143
  • [44] SWVBiL-CRF: Selectable Word Vectors-based BiLSTM-CRF Power Defect Text Named Entity Recognition
    Li, JianBin
    Fang, SuWan
    Ren, YuQi
    Li, KunChang
    Sun, MingYu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2502 - 2507
  • [45] Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation
    Wu, Fangzhao
    Liu, Junxin
    Wu, Chuhan
    Huang, Yongfeng
    Xie, Xing
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3342 - 3348
  • [46] A New Industrial Intrusion Detection Method Based on CNN-BiLSTM
    Wang, Jun
    Si, Changfu
    Wang, Zhen
    Fu, Qiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4297 - 4318
  • [47] A Method for Network Intrusion Detection Based on GAN-CNN-BiLSTM
    Li, Shuangyuan
    Li, Qichang
    Li, Mengfan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 507 - 515
  • [48] Extracting Online Recruitment Information Based on BiLSTM-Dropout-CRF Model
    Yang, Wenxin
    Zhang, Zhiming
    Gao, Yongqiang
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1681 - 1685
  • [49] Cross-Domain Text Sentiment Classification Method Based on the CNN-BiLSTM-TE Model
    Zeng, Yuyang
    Zhang, Ruirui
    Yang, Liang
    Song, Sujuan
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (04): : 818 - 833
  • [50] Detecting Simultaneously Chinese Grammar Errors Based on a BiLSTM-CRF Model
    Liu, Yajun
    Zan, Hongying
    Zhong, Mengjie
    Ma, Hongchao
    NATURAL LANGUAGE PROCESSING TECHNIQUES FOR EDUCATIONAL APPLICATIONS, 2018, : 188 - 193