Cross-task portability of a broadcast news speech recognition system

被引:2
|
作者
Bertoldi, N [1 ]
Brugnara, F [1 ]
Cettolo, M [1 ]
Federico, M [1 ]
Giuliani, D [1 ]
机构
[1] IRST, ITC, Ctr Ric Sci & Tecnol, I-38050 Povo, Italy
关键词
acoustic model adaptation; language model adaptation; spontaneous speech phenomena;
D O I
10.1016/S0167-6393(01)00074-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper reports on experiments of porting the ITC-irst Italian broadcast news recognition system to two spontaneous dialogue domains. Porting was investigated by applying state-of-the-art adaptation methods on acoustic and language models, and by evaluating the trade-off between performance and required amount of task specific annotated data. The use of different levels of supervision for acoustic model adaptation was also studied. By employing 2 It of manually annotated speech, word error rates of 26.0% and 28.4% were achieved by the adapted systems. These results are to be compared with the performance of two domain specific baseline systems, 22.6% and 21.2%, respectively, which were developed on much more training data. Finally, a robust method is presented that allows to tune the insertion of spontaneous speech phenomena by the speech decoder. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:335 / 347
页数:13
相关论文
共 50 条
  • [31] An Enroll-to-Verify Approach for Cross-Task Unseen Emotion Class Recognition
    Li, Jeng-Lin
    Lee, Chi-Chun
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3066 - 3077
  • [32] CROSS-TASK VALIDATION OF FUNCTIONAL MEASUREMENT
    ANDERSON, NH
    [J]. PERCEPTION & PSYCHOPHYSICS, 1972, 12 (05): : 389 - &
  • [33] The costs and benefits of cross-task priming
    Florian Waszak
    Bernhard Hommel
    [J]. Memory & Cognition, 2007, 35 : 1175 - 1186
  • [34] Cross-task perceptual learning of object recognition in simulated retinal implant perception
    Wang, Lihui
    Sharifian, Fariba
    Napp, Jonathan
    Nath, Carola
    Pollmann, Stefan
    [J]. JOURNAL OF VISION, 2018, 18 (13): : 1 - 14
  • [35] Age Differences in Cross-Task Bleeding
    Nicosia, Jessica
    Balota, David
    [J]. PSYCHOLOGY AND AGING, 2020, 35 (06) : 881 - 893
  • [36] Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription
    Chaloupka, Josef
    Nouza, Jan
    Malek, Jiri
    Silovsky, Jan
    [J]. 2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015,
  • [37] The costs and benefits of cross-task priming
    Waszak, Florian
    Hommel, Bernhard
    [J]. MEMORY & COGNITION, 2007, 35 (05) : 1175 - 1186
  • [38] CROSS-TASK FACILITATION IN SEMANTIC MEMORY
    MACLEOD, CM
    VOUMVAKIS, S
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1980, 16 (03) : 153 - 153
  • [39] From broadcast news to spontaneous dialogue transcription: Portability issues
    Bertoldi, N
    Brugnara, F
    Cettolo, M
    Federico, M
    Giuliani, D
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 37 - 40
  • [40] Simultaneous subtitling system for broadcast news programs with a speech recognizer
    Ando, A
    Imai, T
    Kobayashi, A
    Homma, S
    Goto, J
    Seiyama, N
    Mishima, T
    Kobayakawa, T
    Sato, S
    Onoe, K
    Segi, H
    Imai, A
    Matsui, A
    Nakamura, A
    Tanaka, H
    Takagi, T
    Miyasaka, E
    Isono, H
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (01) : 15 - 25