Improved Lattice Rescoring by Using Speech Attributes in Large Vocabulary Continuous Speech Recognition Systems

被引:0
|
作者
Gao, Xinglong [1 ]
Zhang, Qingqing [2 ]
Pan, Jielin [2 ]
机构
[1] Univ Chinese Acad Sci, Informat & Signal Proc, Beijing, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Beijing 100864, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic modeling of Large Vocabulary Continuous Speech Recognition (LVCSR) system which is normally based on context-dependent phone is heavily limited by representative capability between transcriptions and corresponding variation of raw speech utterance. To describe this relationship more accurate, this paper presents an alternative strategy by which speech attributes are used to capture acoustic characteristics to improve performances of LVCSR. Validations on a series of relevant experiments, and it is proven that the speech attributes can be used as complementary knowledge resources that can bring more abundant information than basic phone based system. Hence, speech attribute information is used to be integrated into phone based LVCSR system during lattice rescoring. For both reading and Conversional Telephone Speech (CTS) style LVCSR tasks, experimental results showed that the combined system reduced Word Error Rate (WER) by about 3-5% relatively.
引用
收藏
页码:143 / 147
页数:5
相关论文
共 50 条
  • [1] Language modelling approaches for Turkish large vocabulary continuous speech recognition based on lattice rescoring
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2006 IEEE 14th Signal Processing and Communications Applications, Vols 1 and 2, 2006, : 579 - 582
  • [2] Boosting systems for large vocabulary continuous speech recognition
    Saon, George
    Soltau, Hagen
    [J]. SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
  • [3] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [4] ON LATTICE GENERATION FOR LARGE VOCABULARY SPEECH RECOGNITION
    Rybach, David
    Riley, Michael
    Schalkwyk, Johan
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 228 - 235
  • [5] Improving the Accuracy of Large Vocabulary Continuous Speech Recognizer using Dependency Parse Tree and Chomsky Hierarchy in Lattice Rescoring
    Hong, Kai Sze
    Tan, Tien-Ping
    Tang, Enya Kong
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 167 - 170
  • [6] Improved discriminative training techniques for large vocabulary continuous speech recognition
    Povey, D
    Woodland, PC
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 45 - 48
  • [7] Improved search strategy for large vocabulary continuous Mandarin speech recognition
    Ho, TH
    Yang, KC
    Huang, KH
    Lee, LS
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 825 - 828
  • [8] Using a transcription graph for large vocabulary continuous speech recognition
    Li, Z
    OShaughnessy, D
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 121 - 124
  • [9] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
  • [10] Advances in large vocabulary continuous speech recognition
    Zweig, G
    Picheny, M
    [J]. ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291