Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration

被引:0
|
作者
Lan, Ziwen [1 ]
Maeda, Keisuke [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo, Japan
基金
日本学术振兴会;
关键词
Task analysis; Semantics; Image classification; Correlation; Convolutional neural networks; Visualization; Context modeling; Hierarchical classification; anime illustration; attribute classification; graph convolutional networks; image captioning;
D O I
10.1109/ACCESS.2023.3265728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we present a hierarchical multi-modal multi-label attribute classification model for anime illustrations using graph convolutional networks (GCNs). The focus of this study is multi-label attribute classification, as creators of anime illustrations frequently and deliberately emphasize subtle features of characters and objects. To analyze the connections between attributes, we develop a multi-modal GCN-based model that can use semantic features of anime illustrations. To create features representing the semantic information of anime illustrations, we construct a novel captioning framework by combining real-world images with their animated style transformations. In addition, because the attributes of anime illustrations are hierarchical, we introduce a loss function that considers the hierarchy of attributes to improve classification accuracy. The proposed method has two main contributions: 1) By introducing a GCN with semantic features into the multi-label attribute classification task of anime illustrations, we capture more comprehensive relationships between attributes. 2) By following certain rules to build a hierarchical structure of attributes that appear frequently in anime illustrations, we further capture subordinate relationships between attributes. In addition, we demonstrate the effectiveness of the proposed method by experiments.
引用
收藏
页码:35447 / 35456
页数:10
相关论文
共 50 条
  • [1] Multi-Label Classification in Anime Illustrations Based on Hierarchical Attribute Relationships
    Lan, Ziwen
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. SENSORS, 2023, 23 (10)
  • [2] Hierarchical Multi-Label Classification Networks
    Wehrmann, Jonatas
    Cerri, Ricardo
    Barros, Rodrigo C.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] Pedestrian attribute classification with multi-scale and multi-label convolutional neural networks
    朱建清
    Zeng Huanqiang
    Zhang Yuzhao
    Zheng Lixin
    Cai Canhui
    [J]. High Technology Letters, 2018, 24 (01) : 53 - 61
  • [4] Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification
    Li, Zhitao
    Wang, Jianzong
    Cheng, Ning
    Xiao, Jing
    [J]. PAAP 2021: 2021 12TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING, 2021, : 146 - 151
  • [5] Multiple Semantic Embedding with Graph Convolutional Networks for Multi-Label Image Classification
    Zhou, Tong
    Feng, Songhe
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 449 - 461
  • [6] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [7] GCN-BASED MULTI-MODAL MULTI-LABEL ATTRIBUTE CLASSIFICATION IN ANIME ILLUSTRATION USING DOMAIN-SPECIFIC SEMANTIC FEATURES
    Lan, Ziwen
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2021 - 2025
  • [8] Multi-Label Image Classification Based on Object Detection and Dynamic Graph Convolutional Networks
    Liu, Xiaoyu
    Hu, Yong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4413 - 4432
  • [9] Multi-label convolutional neural network based pedestrian attribute classification
    Zhu, Jianqing
    Liao, Shengcai
    Lei, Zhen
    Li, Stan Z.
    [J]. IMAGE AND VISION COMPUTING, 2017, 58 : 224 - 229
  • [10] Hierarchical Multi-label Classification of Text with Capsule Networks
    Aly, Rami
    Remus, Steffen
    Biemann, Chris
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 323 - 330