UNDERSTANDING THE AESTHETIC STYLES OF SOCIAL IMAGES

被引:0
|
作者
Ma, Yihui [1 ,2 ]
Jia, Jia [1 ,2 ]
Hou, Yufan [1 ,2 ]
Bu, Yaohua [1 ,2 ,3 ]
Han, Wentao [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Minist Educ, Key Lab Pervas Comp, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing, Peoples R China
[3] Tsinghua Univ, Acad Arts & Design, Beijing, Peoples R China
关键词
Aesthetic style; social image; autoencoder; dimensional space;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Aesthetic perception is nearly the most direct impact people could receive from images. Recent research on image understanding is mainly focused on image analysis, recognition and classification, regardless of the aesthetic meanings embedded in images. In this paper, we systematically study the problem of understanding the aesthetic styles of social images. First, we build a two-dimensional Image Aesthetic Space (IAS) to describe image aesthetic styles quantitatively and universally. Then, we propose a Bimodal Deep Autoencoder with Cross Edges (BDA-CE) to deeply fuse the social image related features (i.e. images' visual features, tags' textual features). Connecting BDA-CE with a regression model, we are able to map the features to the IAS. The experimental results on the benchmark dataset we build with 120 thousand Flickr images show that our model outperforms (+ 5.5% in terms of MSE) alternative baselines. Furthermore, we conduct an interesting case study to demonstrate the advantages of our methods.
引用
收藏
页码:3056 / 3060
页数:5
相关论文
共 50 条