Cross-task portability of a broadcast news speech recognition system

被引:2
|
作者
Bertoldi, N [1 ]
Brugnara, F [1 ]
Cettolo, M [1 ]
Federico, M [1 ]
Giuliani, D [1 ]
机构
[1] IRST, ITC, Ctr Ric Sci & Tecnol, I-38050 Povo, Italy
关键词
acoustic model adaptation; language model adaptation; spontaneous speech phenomena;
D O I
10.1016/S0167-6393(01)00074-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper reports on experiments of porting the ITC-irst Italian broadcast news recognition system to two spontaneous dialogue domains. Porting was investigated by applying state-of-the-art adaptation methods on acoustic and language models, and by evaluating the trade-off between performance and required amount of task specific annotated data. The use of different levels of supervision for acoustic model adaptation was also studied. By employing 2 It of manually annotated speech, word error rates of 26.0% and 28.4% were achieved by the adapted systems. These results are to be compared with the performance of two domain specific baseline systems, 22.6% and 21.2%, respectively, which were developed on much more training data. Finally, a robust method is presented that allows to tune the insertion of spontaneous speech phenomena by the speech decoder. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:335 / 347
页数:13
相关论文
共 50 条
  • [1] Air traffic control speech recognition system cross-task & speaker adaptation
    de Cordoba, R.
    Ferreiros, J.
    San-Segundo, R.
    Macias-Guarasa, J.
    Montero, J. M.
    Fernandez, F.
    D'Haro, L. F.
    Pardo, J. M.
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2006, 21 (09) : 12 - 17
  • [2] Slovak Broadcast News Speech Recognition and Transcription System
    Lojka, Martin
    Viszlay, Peter
    Stas, Jan
    Hladek, Daniel
    Juhar, Jozef
    [J]. ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2018, 2019, 22 : 385 - 394
  • [3] Connectionist speech recognition of Broadcast News
    Robinson, AJ
    Cook, GD
    Ellis, DPW
    Fosler-Lussier, E
    Renals, SJ
    Williams, DAG
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 27 - 45
  • [4] Speech recognition for Turkish broadcast news
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1054 - 1057
  • [5] Genericity and portability for task-independent speech recognition
    Lefevre, F
    Gauvain, JL
    Lamel, L
    [J]. COMPUTER SPEECH AND LANGUAGE, 2005, 19 (03): : 345 - 363
  • [6] Investigation on Mandarin Broadcast News Speech Recognition
    Hwang, Mei-Yuh
    Lei, Xin
    Wang, Wen
    Shinozaki, Takahiro
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
  • [7] A study on Mandarin broadcast news speech recognition
    Chen, CL
    Wang, YR
    Chen, SH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
  • [8] Improved cross-task recognition using MMIE training
    Córdoba, R
    Woodland, PC
    Gales, MJF
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 85 - 88
  • [9] Cross-Task Crowdsourcing
    Mo, Kaixiang
    Zhong, Erheng
    Yang, Qiang
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 677 - 685
  • [10] ENGLISH BROADCAST NEWS SPEECH RECOGNITION BY HUMANS AND MACHINES
    Thomas, Samuel
    Suzuki, Masayuki
    Huang, Yinghui
    Kurata, Gakuto
    Tuske, Zoltan
    Saon, George
    Kingsbury, Brian
    Picheny, Michael
    Dibert, Tom
    Kaiser-Schatzlein, Alice
    Samko, Bern
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6455 - 6459