Tibetan word segmentation method based on CNN-BiLSTM-CRF model

被引:0
|
作者
Wang, Lili [2 ]
Yang, Hongwu [1 ,2 ,3 ]
Xing, Xiaotian [2 ]
Yan, Yajing [2 ]
机构
[1] Northwest Normal Univ, Coll Educ Technol, Lanzhou 730070, Peoples R China
[2] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730070, Peoples R China
[3] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou 730070, Peoples R China
基金
美国国家科学基金会;
关键词
Convolutional Neural Network; recurrent neural network; Conditional random field; Tibetan word segmentation;
D O I
10.1109/ialp48816.2019.9037661
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Tibetan word segmentation method based on CNN-BiLSTM-CRF model that merely uses the characters of sentence as the input so that the method does not need large-scale corpus resources and manual features for training. Firstly, we use convolution neural network to train character vectors. Then the character vectors are searched through the character lookup table to form a matrix C by stacking searched results. Then the convolution operation between the matrix C and multiple filter matrices is carried out to obtain the character-level features of each Tibetan word by maximizing the pooling. We input the character vector into the BiLSTM-CRF model, which is suitable for Tibetan word segmentation through the highway network, for getting a Tibetan word segmentation model that is optimized by using the character vector and CRF model. For Tibetan language with rich morphology, fewer parameters and faster training time make this model better than BiLSTM-CRF model in the performance of character level. The experimental results show that character input is sufficient for language modeling. The robustness of Tibetan word segmentation is improved by the model that can achieves 95.17% of the F value.
引用
收藏
页码:319 / 324
页数:6
相关论文
共 50 条
  • [21] Study on MRI Medical Image Segmentation Technology Based on CNN-CRF Model
    Feng, Naiqin
    Geng, Xiuqin
    Qin, Lijuan
    IEEE ACCESS, 2020, 8 : 60505 - 60514
  • [22] A Semisupervised CRF Model for CNN-Based Semantic Segmentation With Sparse Ground Truth
    Maggiolo, Luca
    Marcos, Diego
    Moser, Gabriele
    Serpico, Sebastiano B.
    Tuia, Devis
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [23] Tibetan Word Segmentation Based on Word-position Tagging
    Kang, Caijun
    Jiang, Di
    Long, Congjun
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 239 - 242
  • [24] Network Intrusion Detection Method Based on CNN-BiLSTM-Attention Model
    Dai, Wei
    Li, Xinhui
    Ji, Wenxin
    He, Sicheng
    IEEE ACCESS, 2024, 12 : 53099 - 53111
  • [25] Landslide Displacement Prediction Based on CEEMDAN Method and CNN-BiLSTM Model
    Lin, Zian
    Ji, Yuanfa
    Sun, Xiyan
    SUSTAINABILITY, 2023, 15 (13)
  • [26] A DEEP NEURAL CNN MODEL WITH CRF FOR BREAST MASS SEGMENTATION IN MAMMOGRAMS
    Arora, Ridhi
    Raman, Balasubramanian
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1311 - 1315
  • [27] An Intrusion Detection Method Based on Attention Mechanism to Improve CNN-BiLSTM Model
    Shou, Dingyu
    Li, Chao
    Wang, Zhen
    Cheng, Song
    Hu, Xiaobo
    Zhang, Kai
    Wen, Mi
    Wang, Yong
    COMPUTER JOURNAL, 2023, 67 (05): : 1851 - 1865
  • [28] A file archival integrity check method based on the BiLSTM + CNN model and deep learning
    Li, Jinxun
    Wang, Tingjun
    Ma, Chao
    Lin, Yunxuan
    Yan, Qing
    EGYPTIAN INFORMATICS JOURNAL, 2025, 29
  • [29] Ancient Tibetan Word Segmentation based on Deep Learning
    An, Bo
    Long, Congjun
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 292 - 297
  • [30] Chinese word segmentation model based on BI_GRU_AT_HN_CRF_6i
    Ge Yunsheng
    Jie, Kong
    ICIIP'18: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING, 2018, : 88 - 94