Speaker identification using multilingual phone strings

被引：0

作者：

Jin, Q ^{[1
]}

Schultz, T ^{[1
]}

Waibel, A ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA

来源：

2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Far-field speaker identification is very challenging since varying recording conditions often result in un-matching training and testing situations. Although the widely used Gaussian Mixture Models (GMM) approach achieves reasonable good results when training and testing conditions match, its performance degrades dramatically under un-matching conditions. In this paper we propose a new approach for far-field speaker identification: the usage of multilingual phone strings derived from phone recognizers in eight different languages. The experiments are carried out on a database of 30 speakers recorded with eight different microphone distances. The results show that the multi-lingual phone string approach is robust against un-matching conditions and significantly outperforms the GMMs. On 10-second test chunks, the average closed-set identification performance achieves 96.7% on variable distance data.

引用

页码：145 / 148

页数：4

共 50 条

[1] Text independent speaker identification in multilingual environments
Luengo, Iker
Navas, Eva
Sainz, Inaki
Saratxaga, Ibon
Sanchez, Jon
Odriozola, Igor
Hernaez, Inma
[J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1814 - 1817
[2] Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms
Stolcke, Andreas
Akbacak, Murat
Ferrer, Luciana
Kajarekar, Sachin
Richey, Colleen
Scheffer, Nicolas
Shriberg, Elizabeth
[J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 256 - 262
[3] Multilingual speaker recognition using ANFIS
Department of Information Technology, ABV-Indian Institute of Information Technology and Management, Gwalior, India
[J]. ICSPS - Proc. Int. Conf. Signal Process. Syst., 1600, (V3714-V3718):
[4] A Fuzzy-GMM Classifier For Multilingual Speaker Identification
Devika, A. K.
Sumithra, M. G.
Deepika, A. K.
[J]. 2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
[5] Multilingual Speaker Identification by Combining Evidence from LPR and Multitaper MFCC
Nagaraja, B.
Jayanna, H.
[J]. JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (03) : 241 - 251
[6] GENERATING MULTILINGUAL VOICES USING SPEAKER SPACE TRANSLATION BASED ON BILINGUAL SPEAKER DATA
Maiti, Soumi
Marchi, Erik
Conkie, Alistair
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7624 - 7628
[7] Hierarchical speaker identification using speaker clustering
Sun, B
Liu, WJ
Zhong, QH
[J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
[8] An Experimental Comparison of Modeling Techniques and Combination of Speaker - Specific Information from Different Languages for Multilingual Speaker Identification
Jayanna, H. S.
Nagaraja, B. G.
[J]. JOURNAL OF INTELLIGENT SYSTEMS, 2016, 25 (04) : 529 - 538
[9] The native speaker: Multilingual perspectives
Backus, A
[J]. LINGUA, 1999, 108 (04) : 269 - 275
[10] The native speaker: Multilingual perspectives
Madhavan, P
[J]. CONTRIBUTIONS TO INDIAN SOCIOLOGY, 1999, 33 (03): : 603 - 604

← 1 2 3 4 5 →