GroupRF: Panoptic Scene Graph Generation with group relation tokens

被引:0
|
作者
Wang, Hongyun [1 ,2 ]
Li, Jiachen [1 ,2 ]
Xiang, Xiang [3 ]
Xie, Qing [1 ,2 ]
Ma, Yanchun [1 ,2 ]
Liu, Yongjian [1 ,2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Hubei, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan 430070, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Panoptic Scene Graph Generation; Multiple relation token; Fine-grained interaction;
D O I
10.1016/j.jvcir.2025.104405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Panoptic Scene Graph Generation (PSG) aims to predict a variety of relations between pairs of objects within an image, and indicate the objects by panoptic segmentation masks instead of bounding boxes. Existing PSG methods attempt to straightforwardly fuse the object tokens for relation prediction, thus failing to fully utilize the interaction between the pairwise objects. To address this problem, we propose a novel framework named Group RelationFormer (GroupRF) to capture the fine-grained inter-dependency among all instances. Our method introduce a set of learnable tokens termed group rln tokens, which exploit fine-grained contextual interaction between object tokens with multiple attentive relations. In the process of relation prediction, we adopt multiple triplets to take advantage of the fine-grained interaction included in group rln tokens. We conduct comprehensive experiments on OpenPSG dataset, which show that our method outperforms the previous state-of-the-art method. Furthermore, we also show the effectiveness of our framework by ablation studies. Our code is available at https://github.com/WHY-student/GroupRF.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Relation Detection with Transformers for Panoptic Scene Graph Generation
    Liu, Chang
    Yan, Wenchao
    Chen, Shilin
    Huang, Liqun
    Huang, Xiaotao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT IV, 2025, 15034 : 223 - 238
  • [2] Panoptic Scene Graph Generation
    Yang, Jingkang
    Ang, Yi Zhe
    Guo, Zujin
    Zhou, Kaiyang
    Zhang, Wayne
    Liu, Ziwei
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 178 - 196
  • [3] Panoptic Video Scene Graph Generation
    Yang, Jingkang
    Peng, Wenxuan
    Li, Xiangtai
    Guo, Zujin
    Chen, Liangyu
    Li, Bo
    Ma, Zheng
    Zhou, Kaiyang
    Zhang, Wayne
    Loy, Chen Change
    Liu, Ziwei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18675 - 18685
  • [4] Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation
    Wang, Jinghao
    Wen, Zhengyu
    Li, Xiangtai
    Guo, Zujin
    Yang, Jingkang
    Liu, Ziwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10452 - 10465
  • [5] Focusing on Flexible Masks: A Novel Framework for Panoptic Scene Graph Generation with Relation Constraints
    Yang, Jiarui
    Wang, Chuan
    Liu, Zeming
    Wu, Jiahong
    Wang, Dongsheng
    Yang, Liang
    Cao, Xiaochun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4209 - 4218
  • [6] 4D Panoptic Scene Graph Generation
    Yang, Jingkang
    Cen, Jun
    Peng, Wenxuan
    Liu, Shuai
    Hong, Fangzhou
    Li, Xiangtai
    Zhou, Kaiyang
    Chen, Qifeng
    Liu, Ziwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36, NEURIPS 2023, 2023,
  • [7] TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
    Zhao, Chengyang
    Shen, Yikang
    Chen, Zhenfang
    Ding, Mingyu
    Gan, Chuang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2827 - 2838
  • [8] Panoptic Scene Graph Generation with Semantics-Prototype Learning
    Li, Li
    Ji, Wei
    Wu, Yiming
    Li, Mengze
    Qin, You
    Wei, Lina
    Zimmermann, Roger
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3145 - 3153
  • [9] A Fair Ranking and New Model for Panoptic Scene Graph Generation
    Lorenz, Julian
    Pest, Alexander
    Kienzle, Daniel
    Ludwig, Katja
    Lienhart, Rainer
    COMPUTER VISION - ECCV 2024, PT LXI, 2025, 15119 : 148 - 164
  • [10] DOMAIN-WISE INVARIANT LEARNING FOR PANOPTIC SCENE GRAPH GENERATION
    Li, Li
    Qin, You
    Ji, Wei
    Zhou, Yuxiao
    Zimmermann, Roger
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3165 - 3169