I-vector-based Speaker Identification with Extremely Short Utterances for Both Training and Testing

被引:0
|
作者
Tsujikawa, Misaki [1 ]
Nishikawa, Tsuyoki [1 ]
Matsui, Tomoko [2 ]
机构
[1] Panasonic Corp, Core Technol Elemental Dev Ctr, Kadoma, Osaka, Japan
[2] Inst Stat Math, Tachikawa, Tokyo, Japan
关键词
Speaker identification; i-vector; Extremely short utterances;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Voice applications often require the ability to make user-friendly responses by judging the user or user-type from an extremely short utterance, such as a single word. However, it is assumed that performance becomes degraded as the utterance length decreases. In this paper, we examine the performance of speaker identification for extremely short utterances of less than two seconds and then study the relationship between the accuracy and utterance length. Moreover, we show that the identification accuracy can be improved by selecting similar speakers to the target user from a large speech corpus.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] FRAME-LEVEL PHONEME-INVARIANT SPEAKER EMBEDDING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION ON EXTREMELY SHORT UTTERANCES
    Tawara, Naohiro
    Ogawa, Atsunori
    Iwata, Tomoharu
    Delcroix, Marc
    Ogawa, Tetsuji
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6799 - 6803
  • [22] Experiments in SVM-based Speaker Verification Using Short Utterances
    McLaren, Mitchell
    Vogt, Robbie
    Baker, Brendan
    Sridharan, Sridha
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 83 - 90
  • [23] An Effective Speaker Clustering Method using UBM and Ultra-Short Training Utterances
    Hossa, Robert
    Makowski, Ryszard
    ARCHIVES OF ACOUSTICS, 2016, 41 (01) : 107 - 118
  • [24] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
    Chakroun, Rania
    Frikha, Mondher
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (29-30) : 21279 - 21298
  • [25] Speaker identification based on vector quantization
    Radová, V
    Svenda, Z
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
  • [26] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
    Rania Chakroun
    Mondher Frikha
    Multimedia Tools and Applications, 2020, 79 : 21279 - 21298
  • [27] Full multicondition training for robust i-vector based speaker recognition
    Ribas, Dayana
    Vincent, Emmanuel
    Ramon Calvo, Jose
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1057 - 1061
  • [28] An End-to-End Text-Independent Speaker Identification System on Short Utterances
    Ji, Ruifang
    Cai, Xinyuan
    Xu, Bo
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3628 - 3632
  • [29] CNN-based joint mapping of short and long utterance i-vectors for speaker verification using short utterances
    Guo, Jinxi
    Nookala, Usha Amrutha
    Alwan, Abeer
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3712 - 3716
  • [30] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
    Larcher, Anthony
    Bousquet, Pierre-Michel
    Lee, Kong Aik
    Matrouf, Driss
    Li, Haizhou
    Bonastre, Jean-Francois
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776