Keyword spotting for dialectal speech and Introduction of wav2vec2.0

被引:0
|
作者
Ariga, Tomohiro [1 ]
Minakawa, Reo [2 ]
Kojima, Kazunori [1 ]
Lee, Shi-Wook [3 ]
Itoh, Yoshiaki [1 ]
机构
[1] Iwate Prefectural University, Japan
[2] Graduated from Iwate Prefectural University, Japan
[3] National Institute of Advanced Industrial Science and Technology, Japan
关键词
Blank-comp - Detection accuracy - Dialectal speech - Keyword spotting - Monophones - Posterior probability - Probability vector - Query-by example - Spoken term detection - Sub words;
D O I
暂无
中图分类号
学科分类号
摘要
12
引用
收藏
相关论文
共 50 条
  • [1] Kazakh Speech Recognition: Wav2vec2.0 vs. Whisper
    Kozhirbayev, Zhanibek
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (06) : 1382 - 1389
  • [2] Transfer Ability of Monolingual Wav2vec2.0 for Low-resource Speech Recognition
    Yi, Cheng
    Wang, Jianzong
    Cheng, Ning
    Zhou, Shiyu
    Xu, Bo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Enhancing Stuttering Detection and Classification using Wav2Vec2.0
    Sen, Madhurima
    Das, Pradip K.
    2024 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP, 2024,
  • [4] Automatic Speech Disfluency Detection Using wav2vec2.0 for Different Languages with Variable Lengths
    Liu, Jiajun
    Wumaier, Aishan
    Wei, Dongping
    Guo, Shen
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [5] Speech emotion recognition using fine-tuned Wav2vec2.0 and neural controlleddifferential equations classifier
    Wang, Ni
    Yang, Danyu
    PLOS ONE, 2025, 20 (02):
  • [6] The Graph feature fusion technique for speaker recognition based on wav2vec2.0 framework
    Ge, Zirui
    Guo, Haiyan
    Wang, Tingting
    Yang, Zhen
    arXiv, 2023,
  • [7] Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper
    Vasquez-Correa, Juan Camilo
    alvarez Muniain, Aitor
    SENSORS, 2023, 23 (04)
  • [8] Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters
    Leem, Seong-Gyun
    Fulford, Daniel
    Onnela, Jukka-Pekka
    Gard, David
    Busso, Carlos
    INTERSPEECH 2023, 2023, : 1888 - 1892
  • [9] Damage localization method using ultrasonic lamb waves and Wav2Vec2.0 neural network
    Qian, Lubin
    Liu, Sihao
    Fan, Guopeng
    Liu, Xinlong
    Zhang, Hui
    Mei, Yaohua
    Xing, Yuhui
    Wang, Zhiqiang
    FRONTIERS IN MATERIALS, 2023, 10
  • [10] Enhancing Language Identification in Indian Context Through Exploiting Learned Features with Wav2Vec2.0
    Gupta, Shivang
    Motepalli, Kowshik Siva Sai
    Kumar, Ravi
    Narasinga, Vamsi
    Mirishkar, Sai Ganesh
    Vuppala, Anil Kumar
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 503 - 512