Improved Lattice Rescoring by Using Speech Attributes in Large Vocabulary Continuous Speech Recognition Systems

被引:0
|
作者
Gao, Xinglong [1 ]
Zhang, Qingqing [2 ]
Pan, Jielin [2 ]
机构
[1] Univ Chinese Acad Sci, Informat & Signal Proc, Beijing, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Beijing 100864, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic modeling of Large Vocabulary Continuous Speech Recognition (LVCSR) system which is normally based on context-dependent phone is heavily limited by representative capability between transcriptions and corresponding variation of raw speech utterance. To describe this relationship more accurate, this paper presents an alternative strategy by which speech attributes are used to capture acoustic characteristics to improve performances of LVCSR. Validations on a series of relevant experiments, and it is proven that the speech attributes can be used as complementary knowledge resources that can bring more abundant information than basic phone based system. Hence, speech attribute information is used to be integrated into phone based LVCSR system during lattice rescoring. For both reading and Conversional Telephone Speech (CTS) style LVCSR tasks, experimental results showed that the combined system reduced Word Error Rate (WER) by about 3-5% relatively.
引用
收藏
页码:143 / 147
页数:5
相关论文
共 50 条
  • [21] The RWTH large vocabulary continuous speech recognition system
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, F
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
  • [22] Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition
    Palecek, Karel
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 767 - 776
  • [23] Accent Issues in Large Vocabulary Continuous Speech Recognition
    Chao Huang
    Tao Chen
    Eric Chang
    [J]. International Journal of Speech Technology, 2004, 7 (2-3) : 141 - 153
  • [24] Experimenting with lipreading for large vocabulary continuous speech recognition
    Palecek, Karel
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2018, 12 (04) : 309 - 318
  • [25] CONNECTIONIST APPROACHES TO LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    SAWAI, H
    MINAMI, Y
    MIYATAKE, M
    WAIBEL, A
    SHIKANO, K
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1834 - 1844
  • [26] Confidence measures for large vocabulary continuous speech recognition
    Wessel, F
    Schlüter, R
    Macherey, K
    Ney, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 288 - 298
  • [27] Experimenting with lipreading for large vocabulary continuous speech recognition
    Karel Paleček
    [J]. Journal on Multimodal User Interfaces, 2018, 12 : 309 - 318
  • [28] Development of Large Vocabulary Continuous Speech Recognition for Polish
    Demenko, G.
    Szymanski, M.
    Cecko, R.
    Kusmierek, E.
    Lange, M.
    Wegner, K.
    Klessa, K.
    Owsianny, M.
    [J]. ACTA PHYSICA POLONICA A, 2012, 121 (1A) : A86 - A91
  • [29] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    [J]. INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
  • [30] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,