Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units

被引:7
|
作者
Holter, T [1 ]
Svendsen, T [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Telecommun, N-7034 Trondheim, Norway
关键词
D O I
10.1109/ASRU.1997.659006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g. acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the base forms describing the vocabulary words in terms of the recognition units need to be generated from training data. In this paper we propose an algorithm for ASWU-based speech recognition which performs a combined optimisation of the baseforms and the subword models. The resulting system has been tested on the DARPA Resource Management task, and is shown to perform comparable to a baseline phoneme based system.
引用
收藏
页码:199 / 206
页数:8
相关论文
共 50 条
  • [1] Combined optimisation of baseforms and subword models for an HMM based speech recogniser
    Holter, T
    Svendsen, T
    [J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 321 - 324
  • [2] Speech recognition using automatically derived acoustic baseforms
    Rose, RC
    Lleida, E
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1271 - 1274
  • [3] ACOUSTIC MODELING OF SUBWORD UNITS FOR LARGE VOCABULARY SPEAKER INDEPENDENT SPEECH RECOGNITION
    LEE, CH
    RABINER, LR
    PIERACCINI, R
    WILPON, JG
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 280 - 291
  • [4] Improving the Usage of Subword-Based Units for Turkish Speech Recognition
    Cetinkaya, Gozde
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [5] SUBWORD UNITS FOR AUTOMATIC SPEECH RECOGNITION OF ANY VOCABULARY
    HOLMES, WJ
    PEARCE, DJB
    [J]. GEC JOURNAL OF RESEARCH, 1993, 11 (01): : 49 - 59
  • [6] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
  • [7] Automatic generation of subword units for speech recognition systems
    Singh, R
    Raj, B
    Stern, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
  • [8] Reduced sets of subword units for continuous speech recognition of Portuguese
    dos Santos, SCB
    Alcaim, A
    [J]. ELECTRONICS LETTERS, 2000, 36 (06) : 586 - 588
  • [9] An investigation of phone-based subword units for end-to-end speech recognition
    Wang, Weiran
    Wang, Guangsen
    Bhatnagar, Aadyot
    Zhou, Yingbo
    Xiong, Caiming
    Socher, Richard
    [J]. INTERSPEECH 2020, 2020, : 1778 - 1782
  • [10] Acoustic Data-Driven Subword Units Obtained through Segment Embedding and Clustering for Spontaneous Speech Recognition
    Bang, Jeong-Uk
    Kim, Sang-Hun
    Kwon, Oh-Wook
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (06):