A Vietnamese Language Model Based on Recurrent Neural Network

被引:0
|
作者
Viet-Trung Tran [1 ]
Kiem-Hieu Nguyen [1 ]
Duc-Hanh Bui [1 ]
机构
[1] Hanoi Univ Sci & Technol, Hanoi, Vietnam
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language modeling plays a critical role in many natural language processing (NLP) tasks such as text prediction, machine translation and speech recognition. Traditional statistical language models (e.g. n-gram models) can only offer words that have been seen before and can not capture long word context. Neural language model provides a promising solution to surpass this shortcoming of statistical language model. This paper investigates Recurrent Neural Networks (RNNs) language model for Vietnamese, at character and syllable-levels. Experiments were conducted on a large dataset of 24M syllables, constructed from 1,500 movie subtitles. The experimental results show that our RNN-based language models yield reasonable performance on the movie subtitle dataset. Concretely, our models outperform n-gram language models in term of perplexity score.
引用
收藏
页码:274 / 278
页数:5
相关论文
共 50 条
  • [21] Rapid bayesian learning for recurrent neural network language model
    Chien, Jen-Tzung
    Ku, Yuan-Chu
    Huang, Mou-Yue
    Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, 2014, : 34 - 38
  • [22] Recurrent neural network language model adaptation with curriculum learning
    Shi, Yangyang
    Larson, Martha
    Jonker, Cathohin M.
    COMPUTER SPEECH AND LANGUAGE, 2015, 33 (01): : 136 - 154
  • [23] IMPROVED NEURAL LANGUAGE MODEL FUSION FOR STREAMING RECURRENT NEURAL NETWORK TRANSDUCER
    Kim, Suyoun
    Yuan Shangguan
    Mahadeokar, Jay
    Bruguier, Antoine
    Fuegen, Christian
    Seltzer, Michael L.
    Le, Duc
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7333 - 7337
  • [24] Recurrent Neural Network based Language Modeling in Meeting Recognition
    Kombrink, Stefan
    Mikolov, Tomas
    Karafiat, Martin
    Burget, Lukas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2888 - 2891
  • [25] PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE
    Liu, X.
    Chen, X.
    Gales, M. J. F.
    Woodland, P. C.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5406 - 5410
  • [26] A Neural Network based Vietnamese Chatbot
    Trang Nguyen
    Shcherbakov, Maxim
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART), 2018, : 147 - 149
  • [27] CACHE BASED RECURRENT NEURAL NETWORK LANGUAGE MODEL INFERENCE FOR FIRST PASS SPEECH RECOGNITION
    Huang, Zhiheng
    Zweig, Geoffrey
    Dumoulin, Benoit
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [28] RECURRENT NEURAL NETWORK LANGUAGE MODEL TRAINING USING NATURAL GRADIENT
    Yu, Jianwei
    Lam, Max. W. Y.
    Chen, Xie
    Hu, Shoukang
    Liu, Songxiang
    Wu, Xixin
    Liu, Xunying
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7260 - 7264
  • [29] An Improved Recurrent Neural Network Language Model with Context Vector Features
    Zhang, Jian
    Qu, Dan
    Li, Zhen
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 828 - 831
  • [30] Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition
    Li, Ke
    Xu, Hainan
    Wang, Yiming
    Povey, Daniel
    Khudanpur, Sanjeev
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3373 - 3377