Long term on-line speaker adaptation for large vocabulary dictation

被引:0
|
作者
Thelen, E
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
On-line speaker adaptation is desirable for speech recognition dictation applications, because it offers the possibility to improve the system with the speaker-specific data obtained from the user. Since the user will work with such a device over a long period, for a dictation system the long term adaptation performance is more important than the adaptation speed. In contrast to speaker-dependent re-training, the speaker-specific speech data does not need to be stored for on-line speaker adaptation and each adaptation step does not require a large computational effort. In this paper we describe our way of performing online Bayesian speaker adaptation using partial traceback. We compare supervised with unsupervised adaptation and speaker adaptation with speaker-dependent training using the adaptation material. Compared to the speaker-independent startup models, the error rate was divided by two after five hours of supervised adaptation in our experiments, In the long term experiments, supervised on-line adaptation performed similar to speaker-dependent training using the adaptation material.
引用
收藏
页码:2139 / 2142
页数:4
相关论文
共 50 条
  • [41] Speaker verification through large vocabulary continuous speech recognition
    Newman, M
    Gillick, L
    Ito, Y
    McAllaster, D
    Peskin, B
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2419 - 2422
  • [42] Speaker intention modeling for large vocabulary Mandarin spoken dialogues
    Yang, YJ
    Chien, LF
    Lee, LS
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 713 - 716
  • [43] Speaker selection training for large vocabulary continuous speech recognition
    Huang, C
    Chen, T
    Chang, E
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 609 - 612
  • [44] A study of generic models for unsupervised on-line speaker indexing
    Kwon, S
    Narayanan, S
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 423 - 428
  • [45] Long term on-line PD monitoring using power line communication in a distribution substation
    Houdai, Akito
    Lumba, Lunnetta Safura
    Wakisaka, Toshiyuki
    Kozako, Masahiro
    Hikita, Masayuki
    Sato, Hidefumi
    Soeda, Masahiro
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON CONDITION MONITORING AND DIAGNOSIS (CMD), 2022, : 649 - 653
  • [46] On-line Transmission Line Fault Classification using Long Short-Term Memory
    Li, Mengshi
    Yu, Yang
    Ji, Tianyao
    Wu, Qinghua
    [J]. PROCEEDINGS OF THE 2019 IEEE 12TH INTERNATIONAL SYMPOSIUM ON DIAGNOSTICS FOR ELECTRICAL MACHINES, POWER ELECTRONICS AND DRIVES (SDEMPED), 2019, : 513 - 518
  • [47] Learning Spanish Medical Vocabulary with On-line Authentic Materials
    Selezneva, Elena
    Veiga, Alberto
    [J]. IMSCI'11: THE 5TH INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL I, 2011, : 61 - 65
  • [48] The synchronization cost of on-line quorum adaptation
    Bearden, MJ
    Bianchini, RP
    [J]. INTERNATIONAL SOCIETY FOR COMPUTERS AND THEIR APPLICATIONS 10TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 1997, : 598 - 605
  • [49] On-line dynamic adaptation of fuzzy preferences
    Marin, Lucas
    Isern, David
    Moreno, Antonio
    Valls, Aida
    [J]. INFORMATION SCIENCES, 2013, 220 : 5 - 21
  • [50] Long-term On-line Identification of Time-varying Systems
    Vachalek, Jan
    Sismisova, Dana
    Fitka, Ivan
    Simovec, Matej
    [J]. 2021 22ND INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2021, : 294 - 300