LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION

被引:15
|
作者
Fan, Weiquan [1 ]
Xu, Xiangmin [1 ]
Xing, Xiaofen [1 ]
Chen, Weidong [1 ]
Huang, Dongyan [2 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
[2] UBTECH Robot Corp, Shenzhen, Guangdong, Peoples R China
关键词
speech emotion recognition; dataset; pretrained model; deep learning;
D O I
10.1109/ICASSP39728.2021.9414542
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI). However, current existing small-scale databases have limited the development of related research. In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate realworld distribution. In addition, we release some pre-trained models based on LSSED, which can not only promote the development of speech emotion recognition, but can also be transferred to related downstream tasks such as mental health analysis where data is extremely difficult to collect. Finally, our experiments show the necessity of large-scale datasets and the effectiveness of pre-trained models. The dateset will be released on https://github.com/tobefans/LSSED.
引用
收藏
页码:641 / 645
页数:5
相关论文
共 50 条
  • [21] A New Amharic Speech Emotion Dataset and Classification Benchmark
    Retta, Ephrem Afele
    Almekhlafi, Eiad
    Sutcliffe, Richard
    Mhamed, Mustafa
    Ali, Haider
    Feng, Jun
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [22] EmoSet: A Large-scale Visual Emotion Dataset with Rich Attributes
    Yang, Jingyuan
    Huang, Qirui
    Ding, Tingting
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20326 - 20337
  • [23] DEVELOPING REAL-TIME STREAMING TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION ON LARGE-SCALE DATASET
    Chen, Xie
    Wu, Yu
    Wang, Zhenghao
    Liu, Shujie
    Li, Jinyu
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5904 - 5908
  • [24] Exploring Transformers for Large-Scale Speech Recognition
    Lu, Liang
    Liu, Changliang
    Li, Jinyu
    Gong, Yifan
    [J]. INTERSPEECH 2020, 2020, : 5041 - 5045
  • [25] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [26] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    [J]. Scientific Data, 10
  • [27] LDPolypVideo Benchmark: A Large-Scale Colonoscopy Video Dataset of Diverse Polyps
    Ma, Yiting
    Chen, Xuejin
    Cheng, Kai
    Li, Yang
    Sun, Bin
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 387 - 396
  • [28] Introduction and Analysis of a Large-Scale Benchmark Automatic Vehicle Identification Dataset
    He, Zhaocheng
    Chen, Kaiying
    Chen, Xinyu
    Sun, Weiwei
    [J]. INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2018: CONNECTED AND AUTONOMOUS VEHICLES AND TRANSPORTATION SAFETY, 2018, : 35 - 43
  • [29] Eye Disease Diagnosis and Fundus Synthesis: A Large-Scale Dataset and Benchmark
    Xia, Xue
    Zhan, Kun
    Li, Ying
    Xiao, Guobei
    Yan, Jinhua
    Huang, Zhuxiang
    Huang, Guofu
    Fang, Yuming
    [J]. 2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [30] nablaDFT: Large-Scale Conformational Energy and Hamiltonian Prediction benchmark and dataset
    Khrabrov, Kuzma
    Shenbin, Ilya
    Ryabov, Alexander
    Tsypin, Artem
    Telepov, Alexander
    Alekseev, Anton
    Grishin, Alexander
    Strashnov, Pavel
    Zhilyaev, Petr
    Nikolenko, Sergey
    Kadurin, Artur
    [J]. PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2022, 24 (42) : 25853 - 25863