I-vector-based Speaker Identification with Extremely Short Utterances for Both Training and Testing

被引:0
|
作者
Tsujikawa, Misaki [1 ]
Nishikawa, Tsuyoki [1 ]
Matsui, Tomoko [2 ]
机构
[1] Panasonic Corp, Core Technol Elemental Dev Ctr, Kadoma, Osaka, Japan
[2] Inst Stat Math, Tachikawa, Tokyo, Japan
关键词
Speaker identification; i-vector; Extremely short utterances;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Voice applications often require the ability to make user-friendly responses by judging the user or user-type from an extremely short utterance, such as a single word. However, it is assumed that performance becomes degraded as the utterance length decreases. In this paper, we examine the performance of speaker identification for extremely short utterances of less than two seconds and then study the relationship between the accuracy and utterance length. Moreover, we show that the identification accuracy can be improved by selecting similar speakers to the target user from a large speech corpus.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
    Larcher, Anthony
    Bousquet, Pierre-Michel
    Lee, Kong Aik
    Matrouf, Driss
    Li, Haizhou
    Bonastre, Jean-Francois
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
  • [32] Simplification of I-Vector Extraction for Speaker Identification
    XU Longting
    YANG Zhen
    SUN Linhui
    Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
  • [33] Simplification of I-Vector Extraction for Speaker Identification
    Xu Longting
    Yang Zhen
    Sun Linhui
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) : 1121 - 1126
  • [34] Denoising autoencoder-based speaker feature restoration for utterances of short duration
    Yamamoto, Hitoshi
    Koshinaka, Takafumi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1052 - 1056
  • [35] Direct Optimization of the Detection Cost for I-Vector-Based Spoken Language Recognition
    Sizov, Aleksandr
    Lee, Kong Aik
    Kinnunen, Tomi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 588 - 597
  • [36] Speaker Identification based on Discriminative Vector Quantization
    Zhou, GY
    Mikhael, WB
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620
  • [37] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
    Kang, Woo Hyun
    Cho, Won Ik
    Jang, Se Young
    Lee, Hyeon Seung
    Kim, Nam Soo
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
  • [38] Speaker identification using fuzzy i-vector tree
    Galka, Jakub
    Jaciow, Pawel
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4937 - 4949
  • [39] Combining Amplitude and Phase-based Features for Speaker Verification with Short Duration Utterances
    Alam, Md Jahangir
    Kenny, Patrick
    Stafylakis, Themos
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 249 - 253
  • [40] I-Vector-Based Patient Adaptation of Deep Neural Networks for Automatic Heartbeat Classification
    Xu, Sean Shensheng
    Mak, Man-Wai
    Cheung, Chi-Chung
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (03) : 717 - 727