SER30K: A Large-Scale Dataset for Sticker Emotion Recognition

被引:2
|
作者
Liu, Shengzhe [1 ]
Zhang, Xin [1 ]
Yang, Jufeng [1 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
关键词
dataset; sticker emotion analysis; multimodal learning;
D O I
10.1145/3503161.3548407
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the popularity of instant messaging applications, online chatting plays an essential role in our daily life. The prevailing use of stickers to express emotions in online chatting leads to the necessity of multimodal sticker emotion recognition. Considering the lack of sticker emotion data, we collect a large-scale sticker emotion recognition dataset named SER30K. It consists of a total of 1,887 sticker themes with total 30,739 sticker images. Some commonly used images, such as realistic images and facial expression images, have been well studied in the field of emotion analysis. However, it is still challenging to understand the emotion of sticker images. Since the characteristics in stickers from the same theme are similar, we can only accurately predict the emotion by capturing the local information (e.g., expressions, poses) and understanding the global information (e.g., relations among objects). To tackle this challenge, we propose a LOcal Re-Attention multimodal network (LORA) to learn sticker emotions in an end-to-end manner. Different from previous approaches using convolutional neural networks, LORA employs the vision transformer to extract visual features, leading to better capture the global relations. In addition, we design a local re-attention module to focus on important region information. Then a simple but efficient modal fusion module combines visual and language features. Extensive experiments are performed on the SER30K and other emotion recognition datasets, demonstrating the effectiveness of our proposed method. Our code, model and dataset are released on https://github.com/nku-shengzheliu/SER30K.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
    Wu, Xiaoping
    Zhan, Chi
    Lai, Yu-Kun
    Cheng, Ming-Ming
    Yang, Jufeng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8779 - 8788
  • [22] Large-scale RDF Dataset Slicing
    Marx, Edgard
    Shekarpour, Saeedeh
    Auer, Soeren
    Ngomo, Axel-Cyrille Ngonga
    [J]. 2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 228 - 235
  • [23] Euler Clustering on Large-scale Dataset
    Wu, Jian-Sheng
    Zheng, Wei-Shi
    Lai, Jian-Huang
    Suen, Ching Y.
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (04) : 502 - 515
  • [24] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [25] ArgAnalysis35K: A large-scale dataset for Argument Quality Analysis
    Joshi, Omkar Jayant
    Pitre, Priya Nitin
    Haribhakta, Yashodhara
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13916 - 13931
  • [26] VStego800K: Large-Scale Steganalysis Dataset for Streaming Voice
    Xu, Xuan
    Guo, Shengnan
    Fang, Zhengyang
    Zhou, Pengcheng
    Yang, Zhongliang
    Zhou, Linna
    [J]. DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 292 - 303
  • [27] LogoDet-3K. A Large-scale Image Dataset for Logo Detection
    Wang, Jing
    Min, Weiqing
    Hou, Sujuan
    Ma, Shengnan
    Zheng, Yuanjie
    Jiang, Shuqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [28] The Jester Dataset: A Large-Scale Video Dataset of Human Gestures
    Materzynska, Joanna
    Berger, Guillaume
    Bax, Ingo
    Memisevic, Roland
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2874 - 2882
  • [29] POLIMI-ITW-S: A large-scale dataset for human activity recognition in the wild
    Quan, Hao
    Hu, Yu
    Bonarini, Andrea
    [J]. DATA IN BRIEF, 2022, 43
  • [30] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
    Guo, Yandong
    Zhang, Lei
    Hu, Yuxiao
    He, Xiaodong
    Gao, Jianfeng
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102