Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and a New Method

被引:12
|
作者
Yi, Ran [1 ]
Tian, Haoyuan [1 ]
Gu, Zhihao [1 ]
Lai, Yu-Kun [2 ]
Rosin, Paul L. [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Cardiff Univ, Cardiff, Wales
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.02144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image aesthetics assessment (IAA) is a challenging task due to its highly subjective nature. Most of the current studies rely on large-scale datasets (e.g., AVA and AADB) to learn a general model for all kinds of photography images. However, little light has been shed on measuring the aesthetic quality of artistic images, and the existing datasets only contain relatively few artworks. Such a defect is a great obstacle to the aesthetic assessment of artistic images. To fill the gap in the field of artistic image aesthetics assessment (AIAA), we first introduce a large-scale AIAA dataset: Boldbrush Artistic Image Dataset (BAID), which consists of 60,337 artistic images covering various art forms, with more than 360,000 votes from online users. We then propose a new method, SAAN (Style-specific Art Assessment Network), which can effectively extract and utilize style-specific and generic aesthetic information to evaluate artistic images. Experiments demonstrate that our proposed approach outperforms existing IAA methods on the proposed BAID dataset according to quantitative comparisons. We believe the proposed dataset and method can serve as a foundation for future AIAA works and inspire more research in this field. Dataset and code are available at: https://github.com/Dreemurr-T/BAID.git
引用
收藏
页码:22388 / 22397
页数:10
相关论文
共 50 条
  • [31] A large-scale solar dynamics observatory image dataset for computer vision applications
    Ahmet Kucuk
    Juan M. Banda
    Rafal A. Angryk
    Scientific Data, 4
  • [32] MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking
    Torpey, David
    Parkin, Max
    Alter, Jonah
    Klein, Richard
    James, Steven
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (01)
  • [33] I-Nema: a large-scale microscopic image dataset for nematode recognition
    Shenglin Lu
    Sheldon Fung
    Yihao Wang
    Xuequan Lu
    Wanli Ouyang
    Xue Qing
    Hongmei Li
    Neural Computing and Applications, 2025, 37 (4) : 2763 - 2773
  • [34] A large-scale container dataset and a baseline method for container hole localization
    Diao, Yunfeng
    Tang, Xin
    Wang, He
    Taylor, Emma Christophine Florence
    Xiao, Shirui
    Xie, Mengtian
    Cheng, Wenming
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (03) : 577 - 589
  • [35] WHU-OHS: A benchmark dataset for large-scale Hersepctral Image classification
    Li, Jiayi
    Huang, Xin
    Tu, Lilin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
  • [36] LogoDet-3K. A Large-scale Image Dataset for Logo Detection
    Wang, Jing
    Min, Weiqing
    Hou, Sujuan
    Ma, Shengnan
    Zheng, Yuanjie
    Jiang, Shuqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [37] A large-scale container dataset and a baseline method for container hole localization
    Yunfeng Diao
    Xin Tang
    He Wang
    Emma Christophine Florence Taylor
    Shirui Xiao
    Mengtian Xie
    Wenming Cheng
    Journal of Real-Time Image Processing, 2022, 19 : 577 - 589
  • [38] The Introduction of a New Risk Index Assessment Method for Large-scale Olympic Venues
    Zhang, Qingsong
    Zhao, Guomin
    THEORY AND PRACTICE OF RISK ANALYSIS AND CRISIS RESPONSE, PROCEEDINGS, 2008, : 761 - +
  • [39] CelebHair: A New Large-Scale Dataset for Hairstyle Recommendation Based on CelebA
    Chen, Yutao
    Zhang, Yuxuan
    Huang, Zhongrui
    Luo, Zhenyao
    Chen, Jinpeng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 323 - 336
  • [40] The Jester Dataset: A Large-Scale Video Dataset of Human Gestures
    Materzynska, Joanna
    Berger, Guillaume
    Bax, Ingo
    Memisevic, Roland
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2874 - 2882