Keyword spotting for dialectal speech and Introduction of wav2vec2.0

被引:0
|
作者
Ariga, Tomohiro [1 ]
Minakawa, Reo [2 ]
Kojima, Kazunori [1 ]
Lee, Shi-Wook [3 ]
Itoh, Yoshiaki [1 ]
机构
[1] Iwate Prefectural University, Japan
[2] Graduated from Iwate Prefectural University, Japan
[3] National Institute of Advanced Industrial Science and Technology, Japan
关键词
Blank-comp - Detection accuracy - Dialectal speech - Keyword spotting - Monophones - Posterior probability - Probability vector - Query-by example - Spoken term detection - Sub words;
D O I
暂无
中图分类号
学科分类号
摘要
12
引用
收藏
相关论文
共 50 条
  • [41] BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0
    Kim, Miseul
    Piao, Zhenyu
    Lee, Jihyun
    Kang, Hong-Goo
    2023 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, BHI, 2023,
  • [42] CCC-WAV2VEC 2.0: CLUSTERING AIDED CROSS CONTRASTIVE SELF-SUPERVISED LEARNING OF SPEECH REPRESENTATIONS
    Lodagala, Vasista Sai
    Ghosh, Sreyan
    Umesh, S.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1 - 8
  • [43] Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
    Becerra, Helard
    Ragano, Alessandro
    Hines, Andrew
    INTERSPEECH 2022, 2022, : 4088 - 4092
  • [44] wav2vec2-based Speech Rating System for Children with Speech Sound Disorder
    Getman, Yaroslav
    Al-Ghezil, Ragheb
    Vbskoboinik, Ekaterina
    Grosz, Tamas
    Kurimo, Mikko
    Salvi, Giampiero
    Svendsen, Torbjorn
    Strombergsson, Sofia
    INTERSPEECH 2022, 2022, : 3618 - 3622
  • [45] Aggregation Strategies of Wav2vec 2.0 Embeddings for Computational Paralinguistic Tasks
    Vetrab, Mercedes
    Gosztolya, Gabor
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 79 - 93
  • [46] Siamese Network with Wav2vec Feature for Spoofing Speech Detection
    Xie, Yang
    Zhang, Zhenchuan
    Yang, Yingchun
    INTERSPEECH 2021, 2021, : 4269 - 4273
  • [47] Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders
    Svec, Jan
    Polak, Filip
    Bartos, Ales
    Zapletalova, Michaela
    Vita, Martin
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 501 - 512
  • [48] wav2vec: Unsupervised Pre-training for Speech Recognition
    Schneider, Steffen
    Baevski, Alexei
    Collobert, Ronan
    Auli, Michael
    INTERSPEECH 2019, 2019, : 3465 - 3469
  • [49] Using Speaker-Specific Emotion Representations in Wav2vec 2.0-Based Modules for Speech Emotion Recognition
    Park, Somin
    Mark, Mpabulungi
    Park, Bogyung
    Hong, Hyunki
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 1009 - 1030
  • [50] An improved wav2vec 2.0 pre-training approach using enhanced local dependency modeling for speech recognition
    Zhu, Qiu-shi
    Zhang, Jie
    Wu, Ming-hui
    Fang, Xin
    Dai, Li-Rong
    INTERSPEECH 2021, 2021, : 4334 - 4338