Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression

被引:157
|
作者
Zhao, Sicheng [1 ]
Yao, Hongxun [2 ]
Gao, Yue [1 ]
Ji, Rongrong [3 ]
Ding, Guiguang [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[3] Xiamen Univ, Dept Cognit Sci, Sch Informat Sci & Engn, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Gaussian mixture model; image emotion; multitask learning; probability distribution; valence-arousal; shared sparse regression (SSR); RECOGNITION; CLASSIFICATION; VALENCE; SYSTEM;
D O I
10.1109/TMM.2016.2617741
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous works on image emotion analysis mainly focused on predicting the dominant emotion category or the average dimension values of an image for affective image classification and regression. However, this is often insufficient in various real-world applications, as the emotions that are evoked in viewers by an image are highly subjective and different. In this paper, we propose to predict the continuous probability distribution of image emotions which are represented in dimensional valence-arousal space. We carried out large-scale statistical analysis on the constructed Image-Emotion-Social-Net dataset, on which we observed that the emotion distribution can be well-modeled by a Gaussian mixture model. This model is estimated by an expectation-maximization algorithm with specified initializations. Then, we extract commonly used emotion features at different levels for each image. Finally, we formalize the emotion distribution prediction task as a shared sparse regression (SSR) problem and extend it to multitask settings, named multitask shared sparse regression (MTSSR),to explore the latent information between different prediction tasks. SSR and MTSSR are optimized by iteratively reweighted least squares. Experiments are conducted on the Image-Emotion-Social-Net dataset with comparisons to three alternative baselines. The quantitative results demonstrate the superiority of the proposed method.
引用
收藏
页码:632 / 645
页数:14
相关论文
共 46 条
  • [21] Oil Spill SAR Image Segmentation via Probability Distribution Modeling
    Chen, Fang
    Zhang, Aihua
    Balzter, Heiko
    Ren, Peng
    Zhou, Huiyu
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 533 - 554
  • [22] Subspace Learning via Local Probability Distribution for Hyperspectral Image Classification
    Luo, Huiwu
    Tang, Yuan Yan
    Yang, Lina
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [23] End-to-End Saliency Mapping via Probability Distribution Prediction
    Jetley, Saumya
    Murray, Naila
    Vig, Eleonora
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5753 - 5761
  • [24] Image completion using prediction concept via support vector regression
    Cho-Wei Shih
    Tsung-Hsuan Lai
    Hui-Chuan Chu
    Yuh-Min Chen
    Machine Vision and Applications, 2013, 24 : 753 - 768
  • [25] Hyperspectral Image Target Detection via Weighted Joint K-Nearest Neighbor and Multitask Learning Sparse Representation
    Ou, Xianfeng
    Zhang, Yiming
    Wang, Hanpu
    Tu, Bing
    Guo, Longyuan
    Zhang, Guoyun
    Xu, Zhi
    IEEE ACCESS, 2020, 8 : 11503 - 11511
  • [26] Image completion using prediction concept via support vector regression
    Shih, Cho-Wei
    Lai, Tsung-Hsuan
    Chu, Hui-Chuan
    Chen, Yuh-Min
    MACHINE VISION AND APPLICATIONS, 2013, 24 (04) : 753 - 768
  • [27] Hyperspectral Image Target Detection via Weighted Joint K-Nearest Neighbor and Multitask Learning Sparse Representation
    Ou, Xianfeng
    Zhang, Yiming
    Wang, Hanpu
    Tu, Bing
    Guo, Longyuan
    Zhang, Guoyun
    Xu, Zhi
    IEEE Access, 2020, 8 : 11503 - 11511
  • [28] Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort
    Wang, Hua
    Nie, Feiping
    Huang, Heng
    Kim, Sungeun
    Nho, Kwangsik
    Risacher, Shannon L.
    Saykin, Andrew J.
    Shen, Li
    BIOINFORMATICS, 2012, 28 (02) : 229 - 237
  • [29] Learning the degradation distribution for medical image superresolution via sparse swin transformer
    Han, Xianjun
    Xie, Zhaoyang
    Chen, Qianqian
    Li, Xuejun
    Yang, Hongyu
    COMPUTERS & GRAPHICS-UK, 2023, 114 : 168 - 178
  • [30] NONLINEAR PREDICTION OF MULTIDIMENSIONAL SIGNALS VIA DEEP REGRESSION WITH APPLICATIONS TO IMAGE CODING
    Zhang, Xi
    Wu, Xiaolin
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1602 - 1606