Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units

被引：7

作者：

Holter, T ^{[1
]}

Svendsen, T ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol, Dept Telecommun, N-7034 Trondheim, Norway

来源：

1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS | 1997年

关键词：

D O I：

10.1109/ASRU.1997.659006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g. acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the base forms describing the vocabulary words in terms of the recognition units need to be generated from training data. In this paper we propose an algorithm for ASWU-based speech recognition which performs a combined optimisation of the baseforms and the subword models. The resulting system has been tested on the DARPA Resource Management task, and is shown to perform comparable to a baseline phoneme based system.

引用

页码：199 / 206

页数：8

共 50 条

[1] Combined optimisation of baseforms and subword models for an HMM based speech recogniser
Holter, T
Svendsen, T
[J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 321 - 324
[2] Speech recognition using automatically derived acoustic baseforms
Rose, RC
Lleida, E
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1271 - 1274
[3] ACOUSTIC MODELING OF SUBWORD UNITS FOR LARGE VOCABULARY SPEAKER INDEPENDENT SPEECH RECOGNITION
LEE, CH
RABINER, LR
PIERACCINI, R
WILPON, JG
[J]. SPEECH AND NATURAL LANGUAGE, 1989, : 280 - 291
[4] Improving the Usage of Subword-Based Units for Turkish Speech Recognition
Cetinkaya, Gozde
Arisoy, Ebru
Saraclar, Murat
[J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[5] SUBWORD UNITS FOR AUTOMATIC SPEECH RECOGNITION OF ANY VOCABULARY
HOLMES, WJ
PEARCE, DJB
[J]. GEC JOURNAL OF RESEARCH, 1993, 11 (01): : 49 - 59
[6] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
LEE, CH
GAUVAIN, JL
PIERACCINI, R
RABINER, LR
[J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
[7] Automatic generation of subword units for speech recognition systems
Singh, R
Raj, B
Stern, RM
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
[8] Reduced sets of subword units for continuous speech recognition of Portuguese
dos Santos, SCB
Alcaim, A
[J]. ELECTRONICS LETTERS, 2000, 36 (06) : 586 - 588
[9] An investigation of phone-based subword units for end-to-end speech recognition
Wang, Weiran
Wang, Guangsen
Bhatnagar, Aadyot
Zhou, Yingbo
Xiong, Caiming
Socher, Richard
[J]. INTERSPEECH 2020, 2020, : 1778 - 1782
[10] Acoustic Data-Driven Subword Units Obtained through Segment Embedding and Clustering for Spontaneous Speech Recognition
Bang, Jeong-Uk
Kim, Sang-Hun
Kwon, Oh-Wook
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (06):

← 1 2 3 4 5 →