Architectures of neural networks applied for LVCSR language modeling

被引:7
|
作者
Gajecki, Leszek [1 ]
机构
[1] Univ Informat Technol & Management, PL-35205 Rzeszow, Poland
关键词
Language modeling; Speech recognition; Neural network architectures; Self-organized maps;
D O I
10.1016/j.neucom.2013.11.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The n-gram model and its derivatives are both widely applied solutions for Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However, Slavonic languages require a language model that considers word order less strictly than English, i.e. the language that is the subject of most linguistic research. Such a language model is a necessary module in LVCSR systems, because it increases the probability of finding the right word sequences. The aim of the presented work is to create a language module for the Polish language with the application of neural networks. Here, the capabilities of Kohonen's Self-Organized Maps will be explored to find the associations between words in spoken utterances. To fulfill such a task, the application of neural networks to evaluate sequences of words will be presented. Then, the next step of language model development, the network architectures, will be discussed. The network proposed for the construction of the considered model is inspired by the Cocke-Young-Kasami parsing algorithm. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:46 / 53
页数:8
相关论文
共 50 条
  • [1] Improving Russian LVCSR Using Deep Neural Networks for Acoustic and Language Modeling
    Kipyatkova, Irina
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 291 - 300
  • [2] Phone duration modeling for LVCSR using neural networks
    Hadian, Hossein
    Povey, Daniel
    Sameti, Hossein
    Khudanpur, Sanjeev
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 518 - 522
  • [3] Convolutional Neural Networks for Acoustic Modeling of Raw Time Signal in LVCSR
    Golik, Pavel
    Tueske, Zoltan
    Schlueter, Ralf
    Ney, Hermann
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 26 - 30
  • [4] A hybrid input-type recurrent neural network for LVCSR language modeling
    Chunwijitra, Vataya
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016,
  • [5] Factored Language Modeling for Russian LVCSR
    Vazhenina, Daria
    Markov, Konstantin
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY & UBI-MEDIA COMPUTING (ICAST-UMEDIA), 2013, : 205 - 210
  • [6] A hybrid input-type recurrent neural network for LVCSR language modeling
    Vataya Chunwijitra
    Ananlada Chotimongkol
    Chai Wutiwiwatchai
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [7] DEEP CONVOLUTIONAL NEURAL NETWORKS FOR LVCSR
    Sainath, Tara N.
    Mohamed, Abdel-rahman
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8614 - 8618
  • [8] Acoustic Modeling with Deep Neural Networks Using Raw Time Signal for LVCSR
    Tueske, Zoltan
    Golik, Pavel
    Schluter, Ralf
    Ney, Hermann
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 890 - 894
  • [9] Towards Better Language Modeling for Thai LVCSR
    Jongtaveesataporn, Markpong
    Thienlikit, Issara
    Wutiwiwatchai, Chai
    Furui, Sadaoki
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 317 - +
  • [10] Turkish LVCSR: Database preparation and language modeling for an aglutinative language
    Mengusoglu, E
    Deroo, O
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4018 - 4018