SER30K: A Large-Scale Dataset for Sticker Emotion Recognition

被引:2
|
作者
Liu, Shengzhe [1 ]
Zhang, Xin [1 ]
Yang, Jufeng [1 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
关键词
dataset; sticker emotion analysis; multimodal learning;
D O I
10.1145/3503161.3548407
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the popularity of instant messaging applications, online chatting plays an essential role in our daily life. The prevailing use of stickers to express emotions in online chatting leads to the necessity of multimodal sticker emotion recognition. Considering the lack of sticker emotion data, we collect a large-scale sticker emotion recognition dataset named SER30K. It consists of a total of 1,887 sticker themes with total 30,739 sticker images. Some commonly used images, such as realistic images and facial expression images, have been well studied in the field of emotion analysis. However, it is still challenging to understand the emotion of sticker images. Since the characteristics in stickers from the same theme are similar, we can only accurately predict the emotion by capturing the local information (e.g., expressions, poses) and understanding the global information (e.g., relations among objects). To tackle this challenge, we propose a LOcal Re-Attention multimodal network (LORA) to learn sticker emotions in an end-to-end manner. Different from previous approaches using convolutional neural networks, LORA employs the vision transformer to extract visual features, leading to better capture the global relations. In addition, we design a local re-attention module to focus on important region information. Then a simple but efficient modal fusion module combines visual and language features. Extensive experiments are performed on the SER30K and other emotion recognition datasets, demonstrating the effectiveness of our proposed method. Our code, model and dataset are released on https://github.com/nku-shengzheliu/SER30K.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION
    Fan, Weiquan
    Xu, Xiangmin
    Xing, Xiaofen
    Chen, Weidong
    Huang, Dongyan
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 641 - 645
  • [2] Products-6K: A Large-Scale Groceries Product Recognition Dataset
    Georgiadis, Kostas
    Kordopatis-Zilos, Giorgos
    Kalaganis, Fotis P.
    Migkotzidis, Panagiotis
    Chatzilari, Elisavet
    Panakidou, Valasia
    Pantouvakis, Kyriakos
    Tortopidis, Savvas
    Papadopoulos, Symeon
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    [J]. THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021, 2021, : 1 - 7
  • [3] EmoSet: A Large-scale Visual Emotion Dataset with Rich Attributes
    Yang, Jingyuan
    Huang, Qirui
    Ding, Tingting
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20326 - 20337
  • [4] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [5] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    [J]. Scientific Data, 10
  • [6] HEU Emotion: a large-scale database for multimodal emotion recognition in the wild
    Jing Chen
    Chenhui Wang
    Kejun Wang
    Chaoqun Yin
    Cong Zhao
    Tao Xu
    Xinyi Zhang
    Ziqiang Huang
    Meichen Liu
    Tao Yang
    [J]. Neural Computing and Applications, 2021, 33 : 8669 - 8685
  • [7] HEU Emotion: a large-scale database for multimodal emotion recognition in the wild
    Chen, Jing
    Wang, Chenhui
    Wang, Kejun
    Yin, Chaoqun
    Zhao, Cong
    Xu, Tao
    Zhang, Xinyi
    Huang, Ziqiang
    Liu, Meichen
    Yang, Tao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14): : 8669 - 8685
  • [8] A Large-Scale 3D Object Recognition dataset
    Solund, Thomas
    Buch, Anders Glent
    Kruger, Norbert
    Aanaes, Henrik
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 73 - 82
  • [9] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
    Oh, Sangmin
    Hoogs, Anthony
    Perera, Amitha
    Cuntoor, Naresh
    Chen, Chia-Chih
    Lee, Jong Taek
    Mukherjee, Saurajit
    Aggarwal, J. K.
    Lee, Hyungtae
    Davis, Larry
    Swears, Eran
    Wang, Xioyang
    Ji, Qiang
    Reddy, Kishore
    Shah, Mubarak
    Vondrick, Carl
    Pirsiavash, Hamed
    Ramanan, Deva
    Yuen, Jenny
    Torralba, Antonio
    Song, Bi
    Fong, Anesco
    Roy-Chowdhury, Amit
    Desai, Mita
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [10] Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and the Benchmark
    You, Quanzeng
    Luo, Jiebo
    Jin, Hailin
    Yang, Jianchao
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 308 - 314