Keyword spotting for dialectal speech and Introduction of wav2vec2.0

被引：0

作者：

Ariga, Tomohiro ^{[1
]}

Minakawa, Reo ^{[2
]}

Kojima, Kazunori ^{[1
]}

Lee, Shi-Wook ^{[3
]}

Itoh, Yoshiaki ^{[1
]}

机构：

[1] Iwate Prefectural University, Japan

[2] Graduated from Iwate Prefectural University, Japan

[3] National Institute of Advanced Industrial Science and Technology, Japan

来源：

APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024 | 2024年

关键词：

Blank-comp - Detection accuracy - Dialectal speech - Keyword spotting - Monophones - Posterior probability - Probability vector - Query-by example - Spoken term detection - Sub words;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

共 50 条

[41] BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0
Kim, Miseul
Piao, Zhenyu
Lee, Jihyun
Kang, Hong-Goo
2023 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, BHI, 2023,
[42] CCC-WAV2VEC 2.0: CLUSTERING AIDED CROSS CONTRASTIVE SELF-SUPERVISED LEARNING OF SPEECH REPRESENTATIONS
Lodagala, Vasista Sai
Ghosh, Sreyan
Umesh, S.
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1 - 8
[43] Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Becerra, Helard
Ragano, Alessandro
Hines, Andrew
INTERSPEECH 2022, 2022, : 4088 - 4092
[44] wav2vec2-based Speech Rating System for Children with Speech Sound Disorder
Getman, Yaroslav
Al-Ghezil, Ragheb
Vbskoboinik, Ekaterina
Grosz, Tamas
Kurimo, Mikko
Salvi, Giampiero
Svendsen, Torbjorn
Strombergsson, Sofia
INTERSPEECH 2022, 2022, : 3618 - 3622
[45] Aggregation Strategies of Wav2vec 2.0 Embeddings for Computational Paralinguistic Tasks
Vetrab, Mercedes
Gosztolya, Gabor
SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 79 - 93
[46] Siamese Network with Wav2vec Feature for Spoofing Speech Detection
Xie, Yang
Zhang, Zhenchuan
Yang, Yingchun
INTERSPEECH 2021, 2021, : 4269 - 4273
[47] Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders
Svec, Jan
Polak, Filip
Bartos, Ales
Zapletalova, Michaela
Vita, Martin
TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 501 - 512
[48] wav2vec: Unsupervised Pre-training for Speech Recognition
Schneider, Steffen
Baevski, Alexei
Collobert, Ronan
Auli, Michael
INTERSPEECH 2019, 2019, : 3465 - 3469
[49] Using Speaker-Specific Emotion Representations in Wav2vec 2.0-Based Modules for Speech Emotion Recognition
Park, Somin
Mark, Mpabulungi
Park, Bogyung
Hong, Hyunki
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 1009 - 1030
[50] An improved wav2vec 2.0 pre-training approach using enhanced local dependency modeling for speech recognition
Zhu, Qiu-shi
Zhang, Jie
Wu, Ming-hui
Fang, Xin
Dai, Li-Rong
INTERSPEECH 2021, 2021, : 4334 - 4338

← 1 2 3 4 5 →