Cross-task portability of a broadcast news speech recognition system

被引：2

作者：

Bertoldi, N ^{[1
]}

Brugnara, F ^{[1
]}

Cettolo, M ^{[1
]}

Federico, M ^{[1
]}

Giuliani, D ^{[1
]}

机构：

[1] IRST, ITC, Ctr Ric Sci & Tecnol, I-38050 Povo, Italy

来源：

SPEECH COMMUNICATION | 2002年 / 38卷 / 3-4期

关键词：

acoustic model adaptation; language model adaptation; spontaneous speech phenomena;

D O I：

10.1016/S0167-6393(01)00074-7

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper reports on experiments of porting the ITC-irst Italian broadcast news recognition system to two spontaneous dialogue domains. Porting was investigated by applying state-of-the-art adaptation methods on acoustic and language models, and by evaluating the trade-off between performance and required amount of task specific annotated data. The use of different levels of supervision for acoustic model adaptation was also studied. By employing 2 It of manually annotated speech, word error rates of 26.0% and 28.4% were achieved by the adapted systems. These results are to be compared with the performance of two domain specific baseline systems, 22.6% and 21.2%, respectively, which were developed on much more training data. Finally, a robust method is presented that allows to tune the insertion of spontaneous speech phenomena by the speech decoder. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：335 / 347

页数：13

共 50 条

[1] Air traffic control speech recognition system cross-task & speaker adaptation
de Cordoba, R.
Ferreiros, J.
San-Segundo, R.
Macias-Guarasa, J.
Montero, J. M.
Fernandez, F.
D'Haro, L. F.
Pardo, J. M.
[J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2006, 21 (09) : 12 - 17
[2] Slovak Broadcast News Speech Recognition and Transcription System
Lojka, Martin
Viszlay, Peter
Stas, Jan
Hladek, Daniel
Juhar, Jozef
[J]. ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2018, 2019, 22 : 385 - 394
[3] Connectionist speech recognition of Broadcast News
Robinson, AJ
Cook, GD
Ellis, DPW
Fosler-Lussier, E
Renals, SJ
Williams, DAG
[J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 27 - 45
[4] Speech recognition for Turkish broadcast news
Arisoy, Ebru
Saraclar, Murat
[J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1054 - 1057
[5] Genericity and portability for task-independent speech recognition
Lefevre, F
Gauvain, JL
Lamel, L
[J]. COMPUTER SPEECH AND LANGUAGE, 2005, 19 (03): : 345 - 363
[6] Investigation on Mandarin Broadcast News Speech Recognition
Hwang, Mei-Yuh
Lei, Xin
Wang, Wen
Shinozaki, Takahiro
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
[7] A study on Mandarin broadcast news speech recognition
Chen, CL
Wang, YR
Chen, SH
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
[8] Improved cross-task recognition using MMIE training
Córdoba, R
Woodland, PC
Gales, MJF
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 85 - 88
[9] Cross-Task Crowdsourcing
Mo, Kaixiang
Zhong, Erheng
Yang, Qiang
[J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 677 - 685
[10] ENGLISH BROADCAST NEWS SPEECH RECOGNITION BY HUMANS AND MACHINES
Thomas, Samuel
Suzuki, Masayuki
Huang, Yinghui
Kurata, Gakuto
Tuske, Zoltan
Saon, George
Kingsbury, Brian
Picheny, Michael
Dibert, Tom
Kaiser-Schatzlein, Alice
Samko, Bern
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6455 - 6459

← 1 2 3 4 5 →