I-vector-based Speaker Identification with Extremely Short Utterances for Both Training and Testing

被引：0

作者：

Tsujikawa, Misaki ^{[1
]}

Nishikawa, Tsuyoki ^{[1
]}

Matsui, Tomoko ^{[2
]}

机构：

[1] Panasonic Corp, Core Technol Elemental Dev Ctr, Kadoma, Osaka, Japan

[2] Inst Stat Math, Tachikawa, Tokyo, Japan

来源：

2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE) | 2017年

关键词：

Speaker identification; i-vector; Extremely short utterances;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Voice applications often require the ability to make user-friendly responses by judging the user or user-type from an extremely short utterance, such as a single word. However, it is assumed that performance becomes degraded as the utterance length decreases. In this paper, we examine the performance of speaker identification for extremely short utterances of less than two seconds and then study the relationship between the accuracy and utterance length. Moreover, we show that the identification accuracy can be improved by selecting similar speakers to the target user from a large speech corpus.

引用

页数：4

共 50 条

[31] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
Larcher, Anthony
Bousquet, Pierre-Michel
Lee, Kong Aik
Matrouf, Driss
Li, Haizhou
Bonastre, Jean-Francois
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
[32] Simplification of I-Vector Extraction for Speaker Identification
XU Longting
YANG Zhen
SUN Linhui
Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
[33] Simplification of I-Vector Extraction for Speaker Identification
Xu Longting
Yang Zhen
Sun Linhui
CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) : 1121 - 1126
[34] Denoising autoencoder-based speaker feature restoration for utterances of short duration
Yamamoto, Hitoshi
Koshinaka, Takafumi
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1052 - 1056
[35] Direct Optimization of the Detection Cost for I-Vector-Based Spoken Language Recognition
Sizov, Aleksandr
Lee, Kong Aik
Kinnunen, Tomi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 588 - 597
[36] Speaker Identification based on Discriminative Vector Quantization
Zhou, GY
Mikhael, WB
Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620
[37] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
[38] Speaker identification using fuzzy i-vector tree
Galka, Jakub
Jaciow, Pawel
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4937 - 4949
[39] Combining Amplitude and Phase-based Features for Speaker Verification with Short Duration Utterances
Alam, Md Jahangir
Kenny, Patrick
Stafylakis, Themos
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 249 - 253
[40] I-Vector-Based Patient Adaptation of Deep Neural Networks for Automatic Heartbeat Classification
Xu, Sean Shensheng
Mak, Man-Wai
Cheung, Chi-Chung
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (03) : 717 - 727

← 1 2 3 4 5 →