A comparative study of model-based adaptation techniques for a compact speech recognizer

被引：0

作者：

Thiele, F ^{[1
]}

Bippus, R ^{[1
]}

机构：

[1] Philips Res Labs, D-52066 Aachen, Germany

来源：

ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, Eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP and MLLR adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.

引用

页码：29 / 32

页数：4

共 50 条

[31] A study on model-based error rate estimation for automatic speech recognition
Huang, CS
Wang, HC
Lee, CH
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 581 - 589
[32] Model-based Policy Optimization with Unsupervised Model Adaptation
Shen, Jian
Zhao, Han
Zhang, Weinan
Yu, Yong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[33] Comparative study of model-based PI(D) autotuning methods
Leva, Alberto
[J]. 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2788 - 2793
[34] A comparative study of speech rate estimation techniques
Dekens, Tomas
Demol, Mike
Verhelst, Werner
Verhoeve, Piet
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 225 - +
[35] Comparative study of automatic speech recognition techniques
Cutajar, Michelle
Gatt, Edward
Grech, Ivan
Casha, Owen
Micallef, Joseph
[J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
[36] Comparative study of automatic speech recognition techniques
[J]. 1600, Institution of Engineering and Technology, United States (07):
[37] A Comparative Study of Audio/Speech Steganalysis Techniques
Paulin, Catherine
Selouani, Sid-Ahmed
Hervet, Eric
[J]. 2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
[38] Model-based controllers for CubeSat ORU installation: A comparative study
Kurnell, Mitchell
Sharf, Inna
[J]. ACTA ASTRONAUTICA, 2024, 223 : 666 - 684
[39] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
Jiang Wenbin
Ying Rendong
Liu Peilin
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
[40] Model-Based Speech Enhancement in the Modulation Domain
Wang, Yu
Brookes, Mike
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 580 - 594

← 1 2 3 4 5 →