Learning Visual Balance from Large-scale Datasets of Aesthetically Highly Rated Images

被引:12
|
作者
Jahanian, Ali [1 ]
Vishwanathan, S. V. N. [2 ,3 ]
Allebach, Jan P. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
来源
关键词
Visual balance; Arnheim's theory of visual rightness; layout; aesthetics; automatic visual design; the Rule of Thirds; symmetry; design mining; SPATIAL COMPOSITION; GOLDEN SECTION; PHOTOGRAPHS; PERCEPTION; ART;
D O I
10.1117/12.2084548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of visual balance is innate for humans, and influences how we perceive visual aesthetics and cognize harmony. Although visual balance is a vital principle of design and taught in schools of designs, it is barely quantified. On the other hand, with emergence of automantic/semi-automatic visual designs for self-publishing, learning visual balance and computationally modeling it, may escalate aesthetics of such designs. In this paper, we present how questing for understanding visual balance inspired us to revisit one of the well-known theories in visual arts, the so called theory of "visual rightness", elucidated by Arnheim. We define Arnheim's hypothesis as a design mining problem with the goal of learning visual balance from work of professionals. We collected a dataset of 120K images that are aesthetically highly rated, from a professional photography website. We then computed factors that contribute to visual balance based on the notion of visual saliency. We fitted a mixture of Gaussians to the saliency maps of the images, and obtained the hotspots of the images. Our inferred Gaussians align with Arnheim's hotspots, and confirm his theory. Moreover, the results support the viability of the center of mass, symmetry, as well as the Rule of Thirds in our dataset.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Learning to Index in Large-Scale Datasets
    Prayoonwong, Amorntip
    Wang, Cheng-Hsien
    Chiu, Chih-Yi
    [J]. MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 305 - 316
  • [2] On-the-fly learning for visual search of large-scale image and video datasets
    Chatfield, Ken
    Arandjelovic, Relja
    Parkhi, Omkar
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2015, 4 (02) : 75 - 93
  • [3] DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
    Bertucci D.
    Hamid M.M.
    Anand Y.
    Ruangrotsakun A.
    Tabatabai D.
    Perez M.
    Kahng M.
    [J]. IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 320 - 330
  • [4] Learning Bayesian Network Structure from Large-scale Datasets
    Hong, Yu
    Xia, Xiaoling
    Le, Jiajin
    Zhou, Xiangdong
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 258 - 264
  • [5] Learning From Noisy Large-Scale Datasets With Minimal Supervision
    Veit, Andreas
    Alldrin, Neil
    Chechik, Gal
    Krasin, Ivan
    Gupta, Abhinav
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6575 - 6583
  • [6] GUILD - A Generator for Usable Images in Large-Scale Datasets
    Roch, Peter
    Nejad, Bijan Shahbaz
    Handte, Marcus
    Marron, Pedro Jose
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 245 - 258
  • [7] Supervised machine learning for diagnostic classification from large-scale neuroimaging datasets
    Lanka, Pradyumna
    Rangaprakash, D.
    Dretsch, Michael N.
    Katz, Jeffrey S.
    Denney, Thomas S., Jr.
    Deshpande, Gopikrishna
    [J]. BRAIN IMAGING AND BEHAVIOR, 2020, 14 (06) : 2378 - 2416
  • [8] Supervised machine learning for diagnostic classification from large-scale neuroimaging datasets
    Pradyumna Lanka
    D Rangaprakash
    Michael N. Dretsch
    Jeffrey S. Katz
    Thomas S. Denney
    Gopikrishna Deshpande
    [J]. Brain Imaging and Behavior, 2020, 14 : 2378 - 2416
  • [9] Learning to Match Images in Large-Scale Collections
    Cao, Song
    Snavely, Noah
    [J]. COMPUTER VISION - ECCV 2012: WORKSHOPS AND DEMONSTRATIONS, PT I, 2012, 7583 : 259 - 270
  • [10] Datasets, tasks, and training methods for large-scale hypergraph learning
    Kim, Sunwoo
    Lee, Dongjin
    Kim, Yul
    Park, Jungho
    Hwang, Taeho
    Shin, Kijung
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (06) : 2216 - 2254