I-vector-based Speaker Identification with Extremely Short Utterances for Both Training and Testing

被引：0

作者：

Tsujikawa, Misaki ^{[1
]}

Nishikawa, Tsuyoki ^{[1
]}

Matsui, Tomoko ^{[2
]}

机构：

[1] Panasonic Corp, Core Technol Elemental Dev Ctr, Kadoma, Osaka, Japan

[2] Inst Stat Math, Tachikawa, Tokyo, Japan

来源：

2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE) | 2017年

关键词：

Speaker identification; i-vector; Extremely short utterances;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Voice applications often require the ability to make user-friendly responses by judging the user or user-type from an extremely short utterance, such as a single word. However, it is assumed that performance becomes degraded as the utterance length decreases. In this paper, we examine the performance of speaker identification for extremely short utterances of less than two seconds and then study the relationship between the accuracy and utterance length. Moreover, we show that the identification accuracy can be improved by selecting similar speakers to the target user from a large speech corpus.

引用

页数：4

共 50 条

[21] FRAME-LEVEL PHONEME-INVARIANT SPEAKER EMBEDDING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION ON EXTREMELY SHORT UTTERANCES
Tawara, Naohiro
Ogawa, Atsunori
Iwata, Tomoharu
Delcroix, Marc
Ogawa, Tetsuji
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6799 - 6803
[22] Experiments in SVM-based Speaker Verification Using Short Utterances
McLaren, Mitchell
Vogt, Robbie
Baker, Brendan
Sridharan, Sridha
ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 83 - 90
[23] An Effective Speaker Clustering Method using UBM and Ultra-Short Training Utterances
Hossa, Robert
Makowski, Ryszard
ARCHIVES OF ACOUSTICS, 2016, 41 (01) : 107 - 118
[24] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
Chakroun, Rania
Frikha, Mondher
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (29-30) : 21279 - 21298
[25] Speaker identification based on vector quantization
Radová, V
Svenda, Z
TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
[26] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
Rania Chakroun
Mondher Frikha
Multimedia Tools and Applications, 2020, 79 : 21279 - 21298
[27] Full multicondition training for robust i-vector based speaker recognition
Ribas, Dayana
Vincent, Emmanuel
Ramon Calvo, Jose
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1057 - 1061
[28] An End-to-End Text-Independent Speaker Identification System on Short Utterances
Ji, Ruifang
Cai, Xinyuan
Xu, Bo
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3628 - 3632
[29] CNN-based joint mapping of short and long utterance i-vectors for speaker verification using short utterances
Guo, Jinxi
Nookala, Usha Amrutha
Alwan, Abeer
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3712 - 3716
[30] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
Larcher, Anthony
Bousquet, Pierre-Michel
Lee, Kong Aik
Matrouf, Driss
Li, Haizhou
Bonastre, Jean-Francois
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776

← 1 2 3 4 5 →