Speaker adaptation techniques for speech recognition using probabilistic models

被引:3
|
作者
Shinoda, K [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
关键词
speech recognition; speaker adaptation; hidden Markov model;
D O I
10.1002/ecjc.20207
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speech recognition, speaker adaptation refers to the range of techniques whereby a speech recognition system is adapted to the acoustic features of a specific user using a small sample of utterances from that user. In recent years the practical development of speaker-independent speech recognition systems using continuous density hidden Markov models has seen significant progress; however, the recognition performance of these systems has not yet reached that of speaker-dependent speech recognition systems in which a user's speech is registered beforehand. Much hope has therefore been placed on the establishment of speaker adaptation techniques that can bring performance of a speaker-independent system Lip to that of a speaker-dependent one using the smallest amounts of data. In this paper we present a survey of previous research into speaker adaptation techniques focusing particularly on three important approaches in this area: maximum a posteriori (MAP) parameter estimation, maximum likelihood linear regression (MLLR), and eigenvoices. We also discuss approaches that combine these techniques in a lateral fashion. (C) 2005 Wiley Periodicals, Inc.
引用
收藏
页码:25 / 42
页数:18
相关论文
共 50 条
  • [1] SPEAKER ADAPTATION IN SPEECH RECOGNITION USING LINEAR-REGRESSION TECHNIQUES
    COX, S
    [J]. ELECTRONICS LETTERS, 1992, 28 (22) : 2093 - 2094
  • [3] Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
    Kosaka, Tetsuo
    Takeda, Yuui
    Ito, Takashi
    Kato, Masaharu
    Kohda, Masaki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2363 - 2369
  • [4] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [5] INVESTIGATIONS ON SPEAKER ADAPTATION OF LSTM RNN MODELS FOR SPEECH RECOGNITION
    Liu, Chaojun
    Wang, Yongqiang
    Kumar, Kshitiz
    Gong, Yifan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5020 - 5024
  • [6] SPEAKER ADAPTATION USING SPECTRAL INTERPOLATION FOR SPEECH RECOGNITION
    SHINODA, K
    ISO, KI
    WATANABE, T
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1994, 77 (10): : 1 - 11
  • [7] Analysis on MAP and MLLR Based Speaker Adaptation Techniques in Speech Recognition
    Ramya, T.
    Christina, Lilly S.
    Vijayalakshmi, P.
    Nagarajan, T.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1753 - 1758
  • [8] AUTOMATIC SPEAKER AUTHENTICATION USING SPEECH RECOGNITION TECHNIQUES
    MEEKER, WF
    MARTIN, TB
    HERSCHER, MB
    PHYFE, D
    WEINSTOCK, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 42 (05): : 1182 - &
  • [9] PREDICTIVE SPEAKER ADAPTATION IN SPEECH RECOGNITION
    COX, S
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (01): : 1 - 17
  • [10] Speech Recognition Using Speaker Adaptation by System Parameter Transformation
    Hao, Ying
    Fang, Ditang
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 63 - 68