Analysis on MAP and MLLR Based Speaker Adaptation Techniques in Speech Recognition

被引:0
|
作者
Ramya, T. [1 ]
Christina, Lilly S. [1 ]
Vijayalakshmi, P. [1 ]
Nagarajan, T. [1 ]
机构
[1] SSN Coll Engn, Speech Lab, Madras, Tamil Nadu, India
关键词
Speaker adaptation; MAP; MLLR;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech recognition system produces a text output corresponding to the given speech input. A speaker-dependent (SD) recognition system results in a higher recognition performance when compared to a speaker-independent (SI) system. Speaker adaptation techniques like maximum aposteriori (MAP) and maximum likelihood linear regression (MLLR) are applied to an SI system, in order to get a recognition performance similar to that of an SD system, with minimal amount of data. The main focus of this paper is to analyse the performance of the adaptation techniques, applied to the recognition system for different amount of adaptation data. In this work, a speech recognition system is developed using Tamil speech corpus. Cross-gender speaker adaptation is performed by varying the adaptation data. It is observed that when the adaptation data is very minimum, around 30s, the recognition performance of MLLR adapted system results in 45.76% when MAP adapted system resulted in 42.44%. When the adaptation data is increased to 5min, the overall recognition performance is improved by 6% for MAP adaptation over MLLR adapted recognition system.
引用
收藏
页码:1753 / 1758
页数:6
相关论文
共 50 条
  • [1] Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
    Gales, MJF
    Pye, D
    Woodland, PC
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1832 - 1835
  • [2] Analysis of Cross-gender Adaptation using MAP and MLLR in Speech Recognition Systems
    Mahiba, Magdalene S.
    Christina, Lilly S.
    Vijayalakshmi, P.
    Nagarajan, T.
    [J]. 2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 387 - 392
  • [3] Speaker recognition with session variability normalization based on MLLR adaptation transforms
    Stolcke, Andreas
    Kajarekar, Sachin S.
    Ferrer, Luciana
    Shrinberg, Elizabeth
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 1987 - 1998
  • [4] MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
    Cerva, Petr
    Nouza, Jan
    [J]. RADIOENGINEERING, 2004, 13 (03) : 42 - 46
  • [5] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    [J]. Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [6] MLLR/MAP Adaptation Using Pronunciation Variation for Non-native Speech Recognition
    Oh, Yoo Rhee
    Kim, Hong Kook
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 216 - 221
  • [7] MAP speaker adaptation of state duration distributions for speech recognition
    Yoma, NB
    Sánchez, JS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 443 - 450
  • [8] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [9] Speaker adaptation techniques for speech recognition using probabilistic models
    Shinoda, K
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2005, 88 (12): : 25 - 42
  • [10] Research on MLLR based speaker recognition algorithm
    Tsinghua National Laboratory for Information Science and Technology , Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    [J]. Zidonghua Xuebao Acta Auto. Sin., 2009, 5 (546-550):