LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION

被引：15

作者：

Fan, Weiquan ^{[1
]}

Xu, Xiangmin ^{[1
]}

Xing, Xiaofen ^{[1
]}

Chen, Weidong ^{[1
]}

Huang, Dongyan ^{[2
]}

机构：

[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China

[2] UBTECH Robot Corp, Shenzhen, Guangdong, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

speech emotion recognition; dataset; pretrained model; deep learning;

D O I：

10.1109/ICASSP39728.2021.9414542

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI). However, current existing small-scale databases have limited the development of related research. In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate realworld distribution. In addition, we release some pre-trained models based on LSSED, which can not only promote the development of speech emotion recognition, but can also be transferred to related downstream tasks such as mental health analysis where data is extremely difficult to collect. Finally, our experiments show the necessity of large-scale datasets and the effectiveness of pre-trained models. The dateset will be released on https://github.com/tobefans/LSSED.

引用

页码：641 / 645

页数：5

共 50 条

[1] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
Oh, Sangmin
Hoogs, Anthony
Perera, Amitha
Cuntoor, Naresh
Chen, Chia-Chih
Lee, Jong Taek
Mukherjee, Saurajit
Aggarwal, J. K.
Lee, Hyungtae
Davis, Larry
Swears, Eran
Wang, Xioyang
Ji, Qiang
Reddy, Kishore
Shah, Mubarak
Vondrick, Carl
Pirsiavash, Hamed
Ramanan, Deva
Yuen, Jenny
Torralba, Antonio
Song, Bi
Fong, Anesco
Roy-Chowdhury, Amit
Desai, Mita
[J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
[2] Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and the Benchmark
You, Quanzeng
Luo, Jiebo
Jin, Hailin
Yang, Jianchao
[J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 308 - 314
[3] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
Hua, Yuansheng
Mou, Lichao
Jin, Pu
Zhu, Xiao Xiang
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4] IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
Wu, Xiaoping
Zhan, Chi
Lai, Yu-Kun
Cheng, Ming-Ming
Yang, Jufeng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8779 - 8788
[5] SER30K: A Large-Scale Dataset for Sticker Emotion Recognition
Liu, Shengzhe
Zhang, Xin
Yang, Jufeng
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
[6] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
Guo, Yandong
Zhang, Lei
Hu, Yuxiao
He, Xiaodong
Gao, Jianfeng
[J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102
[7] FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
Khan, Faizan Farooq
Li, Xiang
Temple, Andrew J.
Elhoseiny, Mohamed
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20439 - 20449
[8] EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition
Song, Yingjie
Liu, Zhi
Li, Gongyang
Xie, Jiawei
Wu, Qiang
Zeng, Dan
Xu, Lihua
Zhang, Tianhong
Wang, Jijun
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[9] SDFC dataset: a large-scale benchmark dataset for hyperspectral image classification
Sun, Liwei
Zhang, Junjie
Li, Jia
Wang, Yueming
Zeng, Dan
[J]. OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (02)
[10] ClearPose: Large-scale Transparent Object Dataset and Benchmark
Chen, Xiaotong
Zhang, Huijie
Yu, Zeren
Opipari, Anthony
Jenkins, Odest Chadwicke
[J]. COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 381 - 396

← 1 2 3 4 5 →