A Vietnamese Language Model Based on Recurrent Neural Network

被引:0
|
作者
Viet-Trung Tran [1 ]
Kiem-Hieu Nguyen [1 ]
Duc-Hanh Bui [1 ]
机构
[1] Hanoi Univ Sci & Technol, Hanoi, Vietnam
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language modeling plays a critical role in many natural language processing (NLP) tasks such as text prediction, machine translation and speech recognition. Traditional statistical language models (e.g. n-gram models) can only offer words that have been seen before and can not capture long word context. Neural language model provides a promising solution to surpass this shortcoming of statistical language model. This paper investigates Recurrent Neural Networks (RNNs) language model for Vietnamese, at character and syllable-levels. Experiments were conducted on a large dataset of 24M syllables, constructed from 1,500 movie subtitles. The experimental results show that our RNN-based language models yield reasonable performance on the movie subtitle dataset. Concretely, our models outperform n-gram language models in term of perplexity score.
引用
收藏
页码:274 / 278
页数:5
相关论文
共 50 条
  • [41] Bayesian Recurrent Neural Network for Language Modeling
    Chien, Jen-Tzung
    Ku, Yuan-Chu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (02) : 361 - 374
  • [42] SCALING RECURRENT NEURAL NETWORK LANGUAGE MODELS
    Williams, Will
    Prasad, Niranjani
    Mrva, David
    Ash, Tom
    Robinson, Tony
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5391 - 5395
  • [43] Integrating Prosodic Information into Recurrent Neural Network Language Model For Speech Recognition
    Fu, Tong
    Han, Yang
    Li, Xiangang
    Liu, Yi
    Wu, Xihong
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1194 - 1197
  • [44] RECURRENT NEURAL NETWORK LANGUAGE MODEL WITH STRUCTURED WORD EMBEDDINGS FOR SPEECH RECOGNITION
    He, Tianxing
    Xiang, Xu
    Qian, Yanmin
    Yu, Kai
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5396 - 5400
  • [45] Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity
    Yang, Tzu-Hsuan
    Tseng, Tzu-Hsuan
    Chen, Chia-Ping
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 71 - 74
  • [46] Nonlinear model-based dynamic recurrent neural network
    Karam, M
    Zohdy, MA
    PROCEEDINGS OF THE 44TH IEEE 2001 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 2001, : 624 - 626
  • [47] Dance Action Generation Model Based on Recurrent Neural Network
    Ma, Xuan
    Wang, Kai
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [48] Dance Action Generation Model Based on Recurrent Neural Network
    Ma, Xuan
    Wang, Kai
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [49] VIETNAMESE NEURAL LANGUAGE MODEL FOR NLP TASKS WITH LIMITED RESOURCES
    Quan Thanh Tho
    PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : XXVIII - XXVIII
  • [50] Enhancing recurrent neural network-based language models by word tokenization
    Noaman, Hatem M.
    Sarhan, Shahenda S.
    Rashwan, Mohsen. A. A.
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8