Keyword spotting for dialectal speech and Introduction of wav2vec2.0

被引：0

作者：

Ariga, Tomohiro ^{[1
]}

Minakawa, Reo ^{[2
]}

Kojima, Kazunori ^{[1
]}

Lee, Shi-Wook ^{[3
]}

Itoh, Yoshiaki ^{[1
]}

机构：

[1] Iwate Prefectural University, Japan

[2] Graduated from Iwate Prefectural University, Japan

[3] National Institute of Advanced Industrial Science and Technology, Japan

来源：

APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024 | 2024年

关键词：

Blank-comp - Detection accuracy - Dialectal speech - Keyword spotting - Monophones - Posterior probability - Probability vector - Query-by example - Spoken term detection - Sub words;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

共 50 条

[1] Kazakh Speech Recognition: Wav2vec2.0 vs. Whisper
Kozhirbayev, Zhanibek
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (06) : 1382 - 1389
[2] Transfer Ability of Monolingual Wav2vec2.0 for Low-resource Speech Recognition
Yi, Cheng
Wang, Jianzong
Cheng, Ning
Zhou, Shiyu
Xu, Bo
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[3] Enhancing Stuttering Detection and Classification using Wav2Vec2.0
Sen, Madhurima
Das, Pradip K.
2024 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP, 2024,
[4] Automatic Speech Disfluency Detection Using wav2vec2.0 for Different Languages with Variable Lengths
Liu, Jiajun
Wumaier, Aishan
Wei, Dongping
Guo, Shen
APPLIED SCIENCES-BASEL, 2023, 13 (13):
[5] Speech emotion recognition using fine-tuned Wav2vec2.0 and neural controlleddifferential equations classifier
Wang, Ni
Yang, Danyu
PLOS ONE, 2025, 20 (02):
[6] The Graph feature fusion technique for speaker recognition based on wav2vec2.0 framework
Ge, Zirui
Guo, Haiyan
Wang, Tingting
Yang, Zhen
arXiv, 2023,
[7] Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper
Vasquez-Correa, Juan Camilo
alvarez Muniain, Aitor
SENSORS, 2023, 23 (04)
[8] Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters
Leem, Seong-Gyun
Fulford, Daniel
Onnela, Jukka-Pekka
Gard, David
Busso, Carlos
INTERSPEECH 2023, 2023, : 1888 - 1892
[9] Damage localization method using ultrasonic lamb waves and Wav2Vec2.0 neural network
Qian, Lubin
Liu, Sihao
Fan, Guopeng
Liu, Xinlong
Zhang, Hui
Mei, Yaohua
Xing, Yuhui
Wang, Zhiqiang
FRONTIERS IN MATERIALS, 2023, 10
[10] Enhancing Language Identification in Indian Context Through Exploiting Learned Features with Wav2Vec2.0
Gupta, Shivang
Motepalli, Kowshik Siva Sai
Kumar, Ravi
Narasinga, Vamsi
Mirishkar, Sai Ganesh
Vuppala, Anil Kumar
SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 503 - 512

← 1 2 3 4 5 →