The Graph feature fusion technique for speaker recognition based on wav2vec2.0 framework

被引:0
|
作者
Ge, Zirui [1 ]
Guo, Haiyan [1 ]
Wang, Tingting [1 ]
Yang, Zhen [1 ]
机构
[1] School of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing,2100023, China
来源
arXiv | 2023年
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Graph neural networks
引用
收藏
相关论文
共 50 条
  • [1] On the robustness of wav2vec 2.0 based speaker recognition systems
    Novoselov, Sergey
    Lavrentyeva, Galina
    Avdeeva, Anastasia
    Volokhov, Vladimir
    Khmelev, Nikita
    Akulov, Artem
    Leonteva, Polina
    INTERSPEECH 2023, 2023, : 3177 - 3181
  • [2] Kazakh Speech Recognition: Wav2vec2.0 vs. Whisper
    Kozhirbayev, Zhanibek
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (06) : 1382 - 1389
  • [3] Transfer Ability of Monolingual Wav2vec2.0 for Low-resource Speech Recognition
    Yi, Cheng
    Wang, Jianzong
    Cheng, Ning
    Zhou, Shiyu
    Xu, Bo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Enhancing Stuttering Detection and Classification using Wav2Vec2.0
    Sen, Madhurima
    Das, Pradip K.
    2024 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP, 2024,
  • [5] Keyword spotting for dialectal speech and Introduction of wav2vec2.0
    Ariga, Tomohiro
    Minakawa, Reo
    Kojima, Kazunori
    Lee, Shi-Wook
    Itoh, Yoshiaki
    APSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024, 2024,
  • [6] Speech emotion recognition using fine-tuned Wav2vec2.0 and neural controlleddifferential equations classifier
    Wang, Ni
    Yang, Danyu
    PLOS ONE, 2025, 20 (02):
  • [7] Exploring the potential of Wav2vec 2.0 for speech emotion recognition using classifier combination and attention-based feature fusion
    Nasersharif, Babak
    Namvarpour, Mohammad
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (16): : 23667 - 23688
  • [8] Exploring wav2vec 2.0 on speaker verification and language identification
    Fan, Zhiyun
    Li, Meng
    Zhou, Shiyu
    Xu, Bo
    INTERSPEECH 2021, 2021, : 1509 - 1513
  • [9] Novel Speech Recognition Systems Applied to Forensics within Child Exploitation: Wav2vec2.0 vs. Whisper
    Vasquez-Correa, Juan Camilo
    alvarez Muniain, Aitor
    SENSORS, 2023, 23 (04)
  • [10] Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters
    Leem, Seong-Gyun
    Fulford, Daniel
    Onnela, Jukka-Pekka
    Gard, David
    Busso, Carlos
    INTERSPEECH 2023, 2023, : 1888 - 1892