GroupRF: Panoptic Scene Graph Generation with group relation tokens

被引:0
|
作者
Wang, Hongyun [1 ,2 ]
Li, Jiachen [1 ,2 ]
Xiang, Xiang [3 ]
Xie, Qing [1 ,2 ]
Ma, Yanchun [1 ,2 ]
Liu, Yongjian [1 ,2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Hubei, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan 430070, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Panoptic Scene Graph Generation; Multiple relation token; Fine-grained interaction;
D O I
10.1016/j.jvcir.2025.104405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Panoptic Scene Graph Generation (PSG) aims to predict a variety of relations between pairs of objects within an image, and indicate the objects by panoptic segmentation masks instead of bounding boxes. Existing PSG methods attempt to straightforwardly fuse the object tokens for relation prediction, thus failing to fully utilize the interaction between the pairwise objects. To address this problem, we propose a novel framework named Group RelationFormer (GroupRF) to capture the fine-grained inter-dependency among all instances. Our method introduce a set of learnable tokens termed group rln tokens, which exploit fine-grained contextual interaction between object tokens with multiple attentive relations. In the process of relation prediction, we adopt multiple triplets to take advantage of the fine-grained interaction included in group rln tokens. We conduct comprehensive experiments on OpenPSG dataset, which show that our method outperforms the previous state-of-the-art method. Furthermore, we also show the effectiveness of our framework by ablation studies. Our code is available at https://github.com/WHY-student/GroupRF.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Vision Relation Transformer for Unbiased Scene Graph Generation
    Sudhakaran, Gopika
    Dhami, Devendra Singh
    Kersting, Kristian
    Roth, Stefan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21825 - 21836
  • [22] Scene Adaptive Context Modeling and Balanced Relation Prediction for Scene Graph Generation
    Xu, Kai
    Wang, Lichun
    Li, Shuang
    Gao, Tong
    Yin, Baocai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (03)
  • [23] Improving rare relation inferring for scene graph generation using bipartite graph network
    Lu, Jiale
    Chen, Lianggangxu
    Guan, Haoyue
    Lin, Shaohui
    Gu, Chunhua
    Wang, Changbo
    He, Gaoqi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
  • [24] From Easy to Hard: Learning Curricular Shape-Aware Features for Robust Panoptic Scene Graph Generation
    Shi, Hanrong
    Li, Lin
    Xiao, Jun
    Zhuang, Yueting
    Chen, Long
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 489 - 508
  • [25] Haystack: A Panoptic Scene Graph Dataset to Evaluate Rare Predicate Classes
    Lorenz, Julian
    Barthel, Florian
    Kienzle, Daniel
    Lienhart, Rainer
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 62 - 70
  • [26] Attention-Translation-Relation Network for Scalable Scene Graph Generation
    Gkanatsios, Nikolaos
    Pitsikalis, Vassilis
    Koutras, Petros
    Maragos, Petros
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1754 - 1764
  • [27] Scene Graph Generation Based on Node-Relation Context Module
    Lin, Xin
    Li, Yonggang
    Liu, Chunping
    Ji, Yi
    Yang, Jianyu
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 134 - 145
  • [28] Relation-Specific Feature Augmentation for unbiased scene graph generation
    Liu, Zhihong
    Wang, Jianji
    Chen, Hui
    Ma, Yongqiang
    Zheng, Nanning
    PATTERN RECOGNITION, 2025, 157
  • [29] Unconditional Scene Graph Generation
    Garg, Sarthak
    Dhamo, Helisa
    Farshad, Azade
    Musatian, Sabrina
    Navab, Nassir
    Tombari, Federico
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16342 - 16351
  • [30] Iterative Scene Graph Generation
    Khandelwal, Siddhesh
    Sigal, Leonid
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,