Unbiased Scene Graph Generation Using Predicate Similarities

被引:0
|
作者
Matsui, Yusuke [1 ]
Ohashi, Misaki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Knowledge transfer; Feature extraction; Visualization; Training; Computer vision; Transfer learning; Bioinformatics; Genomics; Classification algorithms; Scene classification; Scene graph; unbiased generation; predicate similarities; transfer learning; long-tailed distribution; SMOTE;
D O I
10.1109/ACCESS.2024.3424230
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene Graphs are widely applied in computer vision as a graphical representation of relationships between objects shown in images. However, these applications have not yet reached a practical stage of development owing to biased training caused by long-tailed predicate distributions. In recent years, many studies have tackled this problem. In contrast, relatively few works have considered predicate similarities as a unique dataset feature which also leads to the biased prediction. Due to the feature, infrequent predicates (e.g., "parked on", "covered in") are easily misclassified as closely-related frequent predicates (e.g., "on", "in"). Utilizing predicate similarities, we propose a new classification scheme that branches the process to several fine-grained classifiers for similar predicate groups. The classifiers aim to capture the differences among similar predicates in detail. We also introduce the idea of transfer learning to enhance the features for the predicates which lack sufficient training samples to learn the descriptive representations. Our target here is to improve the average precision scores even for the instances with the tail predicators. The results of extensive experiments on the Visual Genome dataset show that the combination of our method and an existing debiasing approach greatly improves performance on tail predicates in challenging SGCls/SGDet tasks. Nonetheless, the overall performance of the proposed approach does not reach that of the current state of the art, so further analysis remains necessary as future work.
引用
收藏
页码:95507 / 95516
页数:10
相关论文
共 50 条
  • [41] A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution
    Jia, Dongdong
    Zhou, Meili
    Wei, Wei
    Wang, Dong
    Bai, Zongwen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (12): : 3383 - 3397
  • [42] Semantic Diversity-Aware Prototype-Based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 379 - 395
  • [43] Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
    Jeon, Jaehyeong
    Kim, Kibum
    Yoon, Kanghoon
    Park, Chanyoung
    arXiv,
  • [44] Unbiased Heterogeneous Scene Graph Generation with Relation-Aware Message Passing Neural Network
    Yoon, Kanghoon
    Kim, Kibum
    Moon, Jinyoung
    Park, Chanyoung
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3285 - 3294
  • [45] Zero-Shot Predicate Prediction for Scene Graph Parsing
    Li, Yiming
    Yang, Xiaoshan
    Huang, Xuhui
    Ma, Zhe
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3140 - 3153
  • [46] Weakly-supervised Video Scene Graph Generation via Unbiased Cross-modal Learning
    Wu, Ziyue
    Gao, Junyu
    Xu, Changsheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4574 - 4583
  • [47] Taking a Closer Look At Visual Relation: Unbiased Video Scene Graph Generation With Decoupled Label Learning
    Wang, Wenqing
    Luo, Yawei
    Chen, Zhiqing
    Jiang, Tao
    Yang, Yi
    Xiao, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5718 - 5728
  • [48] Refine and Redistribute: Multi-Domain Fusion and Dynamic Label Assignment for Unbiased Scene Graph Generation
    Zhang, Yujie
    Li, Yaochen
    Gao, Yuan
    Guo, Yimou
    Tang, Wenneng
    Li, Yanxue
    Atlaw, Meklit
    2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 1307 - 1316
  • [49] Unconditional Scene Graph Generation
    Garg, Sarthak
    Dhamo, Helisa
    Farshad, Azade
    Musatian, Sabrina
    Navab, Nassir
    Tombari, Federico
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16342 - 16351
  • [50] Iterative Scene Graph Generation
    Khandelwal, Siddhesh
    Sigal, Leonid
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,