Affective image recognition with multi-attribute knowledge in deep neural networks

被引:1
|
作者
Zhang, Hao [1 ]
Luo, Gaifang [2 ]
Yue, Yingying [3 ]
He, Kangjian [1 ]
Xu, Dan [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Peoples R China
[2] Shanxi Agr Univ, Sch Software, Jinzhong, Peoples R China
[3] Yuxi Normal Univ, Sch Math & Informat Technol, Yuxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Affective image recognition; Multi-attribute; Visual details; Semantics; Deep metric learning;
D O I
10.1007/s11042-023-16081-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incorporating visual attributes such as objects and scene features into deep models has been proved valuable for affective image recognition. In general, the existing works achieve it by either fine-tuning popular CNNs for emotion recognition, or connecting external attributes through additional well-designed modules. However, they do not realize the diversity of emotional representations for different styles of affective images, or utilize the inter-hierarchical correlations in deep models. In this paper, we propose a multi-attribute model which incorporates different visual concepts to solve this problem. The model consists of 2 branch modules from local to global view: one trains a gram encoder to capture local visual details, and the other trains a semantic tokenizer to extract global semantics simultaneously. Through a fusion layer, we represent image sentiments with aggregated attributes. Different from the existing methods, our model is composed of stacked CNNs without additional backbones, and it shows the great ability to learn hierarchical attributes from internal intermediate features. Furthermore, inspired by deep metric learning, we design an emotional contrast loss to consider dynamic polarity embedded in affective images, and optimize the model within cross-entropy loss as well. A comprehensive evaluation on 5 datasets supports that our model outperforms the others.
引用
收藏
页码:18353 / 18379
页数:27
相关论文
共 50 条
  • [1] Affective image recognition with multi-attribute knowledge in deep neural networks
    Hao Zhang
    Gaifang Luo
    Yingying Yue
    Kangjian He
    Dan Xu
    [J]. Multimedia Tools and Applications, 2024, 83 : 18353 - 18379
  • [2] Multi-attribute Open Set Recognition
    Saranrittichai, Piyapat
    Mummadi, Chaithanya Kumar
    Blaiotta, Claudia
    Munoz, Mauricio
    Fischer, Volker
    [J]. PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 101 - 115
  • [3] Attribute Knowledge Integration for Speech Recognition Based on Multi-task Learning Neural Networks
    Zheng, Hao
    Yang, Zhanlei
    Qiao, Liwei
    Li, Jianping
    Liu, Wenju
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 543 - 547
  • [4] Multi-attribute Learning for Pedestrian Attribute Recognition in Surveillance Scenarios
    Li, Dangwei
    Chen, Xiaotang
    Huang, Kaiqi
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 111 - 115
  • [5] FILLNG THE GAPS: REDUCING THE COMPLEXITY OF NETWORKS FOR MULTI-ATTRIBUTE IMAGE AESTHETIC PREDICTION
    Kairanbay, Magzhan
    See, John
    Wong, Lai-Kuan
    Hii, Yong-Lian
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3051 - 3055
  • [6] Efficient multi-attribute image classification through context-driven networks
    Banger, Sean
    Ceresani, Ryan
    Twedt, Jason
    [J]. AUTOMATIC TARGET RECOGNITION XXXII, 2022, 12096
  • [7] A Case Study on Attribute Recognition of Heated Metal Mark Image Using Deep Convolutional Neural Networks
    Mao, Keming
    Lu, Duo
    E, Dazhi
    Tan, Zhenhua
    [J]. SENSORS, 2018, 18 (06)
  • [8] Simultaneous Multi-Attribute Image-to-Image Translation Using Parallel Latent Transform Networks
    Xu, Sen-Zhe
    Lai, Yu-Kun
    [J]. COMPUTER GRAPHICS FORUM, 2020, 39 (07) : 531 - 542
  • [9] Robustness of Deep Convolutional Neural Networks for Image Recognition
    Ulicny, Matej
    Lundstrom, Jens
    Byttner, Stefan
    [J]. INTELLIGENT COMPUTING SYSTEMS, 2016, 597 : 16 - 30
  • [10] Indexing and Finding Deep Neural Networks for Image Recognition
    Nailussa'ada
    Bintang, Fazlur Rahman
    Harsono, Tri
    Barakbah, Ali Ridho
    Takano, Kosuke
    [J]. INFORMATION MODELLING AND KNOWLEDGE BASES XXXI, 2020, 321 : 458 - 469