Learning Visual Balance from Large-scale Datasets of Aesthetically Highly Rated Images

被引:12
|
作者
Jahanian, Ali [1 ]
Vishwanathan, S. V. N. [2 ,3 ]
Allebach, Jan P. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
来源
关键词
Visual balance; Arnheim's theory of visual rightness; layout; aesthetics; automatic visual design; the Rule of Thirds; symmetry; design mining; SPATIAL COMPOSITION; GOLDEN SECTION; PHOTOGRAPHS; PERCEPTION; ART;
D O I
10.1117/12.2084548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of visual balance is innate for humans, and influences how we perceive visual aesthetics and cognize harmony. Although visual balance is a vital principle of design and taught in schools of designs, it is barely quantified. On the other hand, with emergence of automantic/semi-automatic visual designs for self-publishing, learning visual balance and computationally modeling it, may escalate aesthetics of such designs. In this paper, we present how questing for understanding visual balance inspired us to revisit one of the well-known theories in visual arts, the so called theory of "visual rightness", elucidated by Arnheim. We define Arnheim's hypothesis as a design mining problem with the goal of learning visual balance from work of professionals. We collected a dataset of 120K images that are aesthetically highly rated, from a professional photography website. We then computed factors that contribute to visual balance based on the notion of visual saliency. We fitted a mixture of Gaussians to the saliency maps of the images, and obtained the hotspots of the images. Our inferred Gaussians align with Arnheim's hotspots, and confirm his theory. Moreover, the results support the viability of the center of mass, symmetry, as well as the Rule of Thirds in our dataset.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Learning Quintuplet Loss for Large-Scale Visual Geolocalization
    Zhai, Qiang
    Huang, Rui
    Cheng, Hong
    Zhan, Huiqin
    Li, Jun
    Liu, Zicheng
    [J]. IEEE MULTIMEDIA, 2020, 27 (03) : 34 - 43
  • [22] Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets
    Severyn, Aliaksei
    Moschitti, Alessandro
    [J]. ETERNAL SYSTEMS, 2012, 255 : 34 - 41
  • [23] Research on machine learning based processing strategies for large-scale datasets
    Yang, Longfei
    Zheng, Kai
    Xiao, Hui
    Yang, Zhiqiang
    Li, Shufang
    Fan, Lei
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [24] Visualization of large-scale trajectory datasets
    Zachar, Gergely
    [J]. 2023 CYBER-PHYSICAL SYSTEMS AND INTERNET-OF-THINGS WEEK, CPS-IOT WEEK WORKSHOPS, 2023, : 152 - 157
  • [25] Harvesting Mid-level Visual Concepts from Large-scale Internet Images
    Li, Quannan
    Wu, Jiajun
    Tul, Zhuowen
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 851 - 858
  • [26] Generating Visual Concept Network from Large-Scale Weakly-Tagged Images
    Yang, Chunlei
    Luo, Hangzai
    Fan, Jianping
    [J]. ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 251 - +
  • [27] iVAR: Interactive Visual Analytics of Radiomics Features from Large-Scale Medical Images
    Yu, Lina
    Jiang, Hengle
    Yu, Hongfeng
    Zhang, Chi
    Mcallister, Josiah
    Zheng, Dandan
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3916 - 3923
  • [28] ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning
    Lee, Sangho
    Chung, Jiwan
    Yu, Youngjae
    Kim, Gunhee
    Breuel, Thomas
    Chechik, Gal
    Song, Yale
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10254 - 10264
  • [29] Interactive Virtual Reality Exploration of Large-Scale Datasets Using Omnidirectional Stereo Images
    Marrinan, Thomas
    Tan, Jifu
    Insley, Joseph A.
    Kanayinkal, Alina
    Papka, Michael E.
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT I, 2022, 13598 : 115 - 128
  • [30] Learning to Associate Words and Images Using a Large-scale Graph
    Ya, Heqing
    Sun, Haonan
    Helt, Jeffrey
    Lee, Tai Sing
    [J]. 2017 14TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2017), 2017, : 16 - 23