Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression

被引：157

作者：

Zhao, Sicheng ^{[1
]}

Yao, Hongxun ^{[2
]}

Gao, Yue ^{[1
]}

Ji, Rongrong ^{[3
]}

Ding, Guiguang ^{[1
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China

[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China

[3] Xiamen Univ, Dept Cognit Sci, Sch Informat Sci & Engn, Xiamen 361005, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2017年 / 19卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Gaussian mixture model; image emotion; multitask learning; probability distribution; valence-arousal; shared sparse regression (SSR); RECOGNITION; CLASSIFICATION; VALENCE; SYSTEM;

D O I：

10.1109/TMM.2016.2617741

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Previous works on image emotion analysis mainly focused on predicting the dominant emotion category or the average dimension values of an image for affective image classification and regression. However, this is often insufficient in various real-world applications, as the emotions that are evoked in viewers by an image are highly subjective and different. In this paper, we propose to predict the continuous probability distribution of image emotions which are represented in dimensional valence-arousal space. We carried out large-scale statistical analysis on the constructed Image-Emotion-Social-Net dataset, on which we observed that the emotion distribution can be well-modeled by a Gaussian mixture model. This model is estimated by an expectation-maximization algorithm with specified initializations. Then, we extract commonly used emotion features at different levels for each image. Finally, we formalize the emotion distribution prediction task as a shared sparse regression (SSR) problem and extend it to multitask settings, named multitask shared sparse regression (MTSSR),to explore the latent information between different prediction tasks. SSR and MTSSR are optimized by iteratively reweighted least squares. Experiments are conducted on the Image-Emotion-Social-Net dataset with comparisons to three alternative baselines. The quantitative results demonstrate the superiority of the proposed method.

引用

页码：632 / 645

页数：14

共 46 条

[21] Oil Spill SAR Image Segmentation via Probability Distribution Modeling
Chen, Fang
Zhang, Aihua
Balzter, Heiko
Ren, Peng
Zhou, Huiyu
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 533 - 554
[22] Subspace Learning via Local Probability Distribution for Hyperspectral Image Classification
Luo, Huiwu
Tang, Yuan Yan
Yang, Lina
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[23] End-to-End Saliency Mapping via Probability Distribution Prediction
Jetley, Saumya
Murray, Naila
Vig, Eleonora
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5753 - 5761
[24] Image completion using prediction concept via support vector regression
Cho-Wei Shih
Tsung-Hsuan Lai
Hui-Chuan Chu
Yuh-Min Chen
Machine Vision and Applications, 2013, 24 : 753 - 768
[25] Hyperspectral Image Target Detection via Weighted Joint K-Nearest Neighbor and Multitask Learning Sparse Representation
Ou, Xianfeng
Zhang, Yiming
Wang, Hanpu
Tu, Bing
Guo, Longyuan
Zhang, Guoyun
Xu, Zhi
IEEE ACCESS, 2020, 8 : 11503 - 11511
[26] Image completion using prediction concept via support vector regression
Shih, Cho-Wei
Lai, Tsung-Hsuan
Chu, Hui-Chuan
Chen, Yuh-Min
MACHINE VISION AND APPLICATIONS, 2013, 24 (04) : 753 - 768
[27] Hyperspectral Image Target Detection via Weighted Joint K-Nearest Neighbor and Multitask Learning Sparse Representation
Ou, Xianfeng
Zhang, Yiming
Wang, Hanpu
Tu, Bing
Guo, Longyuan
Zhang, Guoyun
Xu, Zhi
IEEE Access, 2020, 8 : 11503 - 11511
[28] Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort
Wang, Hua
Nie, Feiping
Huang, Heng
Kim, Sungeun
Nho, Kwangsik
Risacher, Shannon L.
Saykin, Andrew J.
Shen, Li
BIOINFORMATICS, 2012, 28 (02) : 229 - 237
[29] Learning the degradation distribution for medical image superresolution via sparse swin transformer
Han, Xianjun
Xie, Zhaoyang
Chen, Qianqian
Li, Xuejun
Yang, Hongyu
COMPUTERS & GRAPHICS-UK, 2023, 114 : 168 - 178
[30] NONLINEAR PREDICTION OF MULTIDIMENSIONAL SIGNALS VIA DEEP REGRESSION WITH APPLICATIONS TO IMAGE CODING
Zhang, Xi
Wu, Xiaolin
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1602 - 1606

← 1 2 3 4 5 →