LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION

被引:15
|
作者
Fan, Weiquan [1 ]
Xu, Xiangmin [1 ]
Xing, Xiaofen [1 ]
Chen, Weidong [1 ]
Huang, Dongyan [2 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
[2] UBTECH Robot Corp, Shenzhen, Guangdong, Peoples R China
关键词
speech emotion recognition; dataset; pretrained model; deep learning;
D O I
10.1109/ICASSP39728.2021.9414542
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI). However, current existing small-scale databases have limited the development of related research. In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate realworld distribution. In addition, we release some pre-trained models based on LSSED, which can not only promote the development of speech emotion recognition, but can also be transferred to related downstream tasks such as mental health analysis where data is extremely difficult to collect. Finally, our experiments show the necessity of large-scale datasets and the effectiveness of pre-trained models. The dateset will be released on https://github.com/tobefans/LSSED.
引用
收藏
页码:641 / 645
页数:5
相关论文
共 50 条
  • [1] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
    Oh, Sangmin
    Hoogs, Anthony
    Perera, Amitha
    Cuntoor, Naresh
    Chen, Chia-Chih
    Lee, Jong Taek
    Mukherjee, Saurajit
    Aggarwal, J. K.
    Lee, Hyungtae
    Davis, Larry
    Swears, Eran
    Wang, Xioyang
    Ji, Qiang
    Reddy, Kishore
    Shah, Mubarak
    Vondrick, Carl
    Pirsiavash, Hamed
    Ramanan, Deva
    Yuen, Jenny
    Torralba, Antonio
    Song, Bi
    Fong, Anesco
    Roy-Chowdhury, Amit
    Desai, Mita
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [2] Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and the Benchmark
    You, Quanzeng
    Luo, Jiebo
    Jin, Hailin
    Yang, Jianchao
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 308 - 314
  • [3] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
    Wu, Xiaoping
    Zhan, Chi
    Lai, Yu-Kun
    Cheng, Ming-Ming
    Yang, Jufeng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8779 - 8788
  • [5] SER30K: A Large-Scale Dataset for Sticker Emotion Recognition
    Liu, Shengzhe
    Zhang, Xin
    Yang, Jufeng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [6] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
    Guo, Yandong
    Zhang, Lei
    Hu, Yuxiao
    He, Xiaodong
    Gao, Jianfeng
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102
  • [7] FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
    Khan, Faizan Farooq
    Li, Xiang
    Temple, Andrew J.
    Elhoseiny, Mohamed
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20439 - 20449
  • [8] EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition
    Song, Yingjie
    Liu, Zhi
    Li, Gongyang
    Xie, Jiawei
    Wu, Qiang
    Zeng, Dan
    Xu, Lihua
    Zhang, Tianhong
    Wang, Jijun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [9] SDFC dataset: a large-scale benchmark dataset for hyperspectral image classification
    Sun, Liwei
    Zhang, Junjie
    Li, Jia
    Wang, Yueming
    Zeng, Dan
    [J]. OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (02)
  • [10] ClearPose: Large-scale Transparent Object Dataset and Benchmark
    Chen, Xiaotong
    Zhang, Huijie
    Yu, Zeren
    Opipari, Anthony
    Jenkins, Odest Chadwicke
    [J]. COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 381 - 396