Speaker identification using multilingual phone strings

被引:0
|
作者
Jin, Q [1 ]
Schultz, T [1 ]
Waibel, A [1 ]
机构
[1] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Far-field speaker identification is very challenging since varying recording conditions often result in un-matching training and testing situations. Although the widely used Gaussian Mixture Models (GMM) approach achieves reasonable good results when training and testing conditions match, its performance degrades dramatically under un-matching conditions. In this paper we propose a new approach for far-field speaker identification: the usage of multilingual phone strings derived from phone recognizers in eight different languages. The experiments are carried out on a database of 30 speakers recorded with eight different microphone distances. The results show that the multi-lingual phone string approach is robust against un-matching conditions and significantly outperforms the GMMs. On 10-second test chunks, the average closed-set identification performance achieves 96.7% on variable distance data.
引用
收藏
页码:145 / 148
页数:4
相关论文
共 50 条
  • [1] Text independent speaker identification in multilingual environments
    Luengo, Iker
    Navas, Eva
    Sainz, Inaki
    Saratxaga, Ibon
    Sanchez, Jon
    Odriozola, Igor
    Hernaez, Inma
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1814 - 1817
  • [2] Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms
    Stolcke, Andreas
    Akbacak, Murat
    Ferrer, Luciana
    Kajarekar, Sachin
    Richey, Colleen
    Scheffer, Nicolas
    Shriberg, Elizabeth
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 256 - 262
  • [3] Multilingual speaker recognition using ANFIS
    Department of Information Technology, ABV-Indian Institute of Information Technology and Management, Gwalior, India
    [J]. ICSPS - Proc. Int. Conf. Signal Process. Syst., 1600, (V3714-V3718):
  • [4] A Fuzzy-GMM Classifier For Multilingual Speaker Identification
    Devika, A. K.
    Sumithra, M. G.
    Deepika, A. K.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [5] Multilingual Speaker Identification by Combining Evidence from LPR and Multitaper MFCC
    Nagaraja, B.
    Jayanna, H.
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (03) : 241 - 251
  • [6] GENERATING MULTILINGUAL VOICES USING SPEAKER SPACE TRANSLATION BASED ON BILINGUAL SPEAKER DATA
    Maiti, Soumi
    Marchi, Erik
    Conkie, Alistair
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7624 - 7628
  • [7] Hierarchical speaker identification using speaker clustering
    Sun, B
    Liu, WJ
    Zhong, QH
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
  • [8] An Experimental Comparison of Modeling Techniques and Combination of Speaker - Specific Information from Different Languages for Multilingual Speaker Identification
    Jayanna, H. S.
    Nagaraja, B. G.
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2016, 25 (04) : 529 - 538
  • [9] The native speaker: Multilingual perspectives
    Backus, A
    [J]. LINGUA, 1999, 108 (04) : 269 - 275
  • [10] The native speaker: Multilingual perspectives
    Madhavan, P
    [J]. CONTRIBUTIONS TO INDIAN SOCIOLOGY, 1999, 33 (03): : 603 - 604