Applying Generative Adversarial Networks and Vision Transformers in Speech Emotion Recognition

被引:0
|
作者
Heracleous, Panikos [1 ]
Fukayama, Satoru [1 ]
Ogata, Jun [1 ]
Mohammad, Yasser [1 ]
机构
[1] National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Tokyo, Koto-ku,135-0064, Japan
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Character recognition - Generative adversarial networks - Human computer interaction - Speech recognition
引用
下载
收藏
页码:67 / 75
相关论文
共 50 条
  • [1] Augmenting Generative Adversarial Networks for Speech Emotion Recognition
    Latif, Siddique
    Asim, Muhammad
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Schuller, Bjoern W.
    INTERSPEECH 2020, 2020, : 521 - 525
  • [2] On Enhancing Speech Emotion Recognition using Generative Adversarial Networks
    Sahu, Saurabh
    Gupta, Rahul
    Espy-Wilson, Carol
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3693 - 3697
  • [3] Robust Semisupervised Generative Adversarial Networks for Speech Emotion Recognition via Distribution Smoothness
    Zhao, Huan
    Xiao, Yufeng
    Zhang, Zixing
    IEEE ACCESS, 2020, 8 (08): : 106889 - 106900
  • [4] ROBUST SPEECH RECOGNITION USING GENERATIVE ADVERSARIAL NETWORKS
    Sriram, Anuroop
    Jun, Heewoo
    Gaur, Yashesh
    Satheesh, Sanjeev
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5639 - 5643
  • [5] Speech emotion recognition using data augmentation method by cycle-generative adversarial networks
    Shilandari, Arash
    Marvi, Hossein
    Khosravi, Hossein
    Wang, Wenwu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1955 - 1962
  • [6] Speech emotion recognition using data augmentation method by cycle-generative adversarial networks
    Arash Shilandari
    Hossein Marvi
    Hossein Khosravi
    Wenwu Wang
    Signal, Image and Video Processing, 2022, 16 : 1955 - 1962
  • [7] EXPLORING SPEECH ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Donahue, Chris
    Li, Bo
    Prabhavalkar, Rohit
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5024 - 5028
  • [8] Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
    Wang, Ke
    Zhang, Junbo
    Sun, Sining
    Wang, Yujun
    Xiang, Fei
    Xie, Lei
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1581 - 1585
  • [9] ENSEMBLE OF DOMAIN ADVERSARIAL NEURAL NETWORKS FOR SPEECH EMOTION RECOGNITION
    Lee, Shi-wook
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 374 - 379
  • [10] Unsupervised Domain Adaptation with Generative Adversarial Networks for Facial Emotion Recognition
    Fan, Yingruo
    Lam, Jacqueline C. K.
    Li, Victor O. K.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4460 - 4464