GroupRF: Panoptic Scene Graph Generation with group relation tokens

被引:0
|
作者
Wang, Hongyun [1 ,2 ]
Li, Jiachen [1 ,2 ]
Xiang, Xiang [3 ]
Xie, Qing [1 ,2 ]
Ma, Yanchun [1 ,2 ]
Liu, Yongjian [1 ,2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Hubei, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan 430070, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Panoptic Scene Graph Generation; Multiple relation token; Fine-grained interaction;
D O I
10.1016/j.jvcir.2025.104405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Panoptic Scene Graph Generation (PSG) aims to predict a variety of relations between pairs of objects within an image, and indicate the objects by panoptic segmentation masks instead of bounding boxes. Existing PSG methods attempt to straightforwardly fuse the object tokens for relation prediction, thus failing to fully utilize the interaction between the pairwise objects. To address this problem, we propose a novel framework named Group RelationFormer (GroupRF) to capture the fine-grained inter-dependency among all instances. Our method introduce a set of learnable tokens termed group rln tokens, which exploit fine-grained contextual interaction between object tokens with multiple attentive relations. In the process of relation prediction, we adopt multiple triplets to take advantage of the fine-grained interaction included in group rln tokens. We conduct comprehensive experiments on OpenPSG dataset, which show that our method outperforms the previous state-of-the-art method. Furthermore, we also show the effectiveness of our framework by ablation studies. Our code is available at https://github.com/WHY-student/GroupRF.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Adversarial Attacks on Scene Graph Generation
    Zhao, Mengnan
    Zhang, Lihe
    Wang, Wei
    Kong, Yuqiu
    Yin, Baocai
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3210 - 3225
  • [42] Scene Graph Generation With Hierarchical Context
    Ren, Guanghui
    Ren, Lejian
    Liao, Yue
    Liu, Si
    Li, Bo
    Han, Jizhong
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 909 - 915
  • [43] Heterogeneous Learning for Scene Graph Generation
    He, Yunqing
    Ren, Tongwei
    Tang, Jinhui
    Wu, Gangshan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4704 - 4713
  • [44] Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
    Dong, Xingning
    Gan, Tian
    Song, Xuemeng
    Wu, Jianlong
    Cheng, Yuan
    Nie, Liqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19405 - 19414
  • [45] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
    Kurosawa, Ikuto
    Kobayashi, Tetsunori
    Hayashi, Yoshihiko
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
  • [46] Graph-LSTM with Global Attribute for Scene Graph Generation
    Shao, Tong
    Wu, Dapeng Oliver
    Journal of Physics: Conference Series, 2021, 2003 (01)
  • [47] Dynamic Gated Graph Neural Networks for Scene Graph Generation
    Khademi, Mahmoud
    Schulte, Oliver
    COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 669 - 685
  • [48] Atom correlation based graph propagation for scene graph generation
    Lin, Bingqian
    Zhu, Yi
    Liang, Xiaodan
    PATTERN RECOGNITION, 2022, 122
  • [49] Dynamic Interactive Relation Capturing via Scene Graph Learning for Robotic Surgical Report Generation
    Wang, Hongqiu
    Jin, Yueming
    Zhu, Lei
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2702 - 2709
  • [50] Unbiased Heterogeneous Scene Graph Generation with Relation-Aware Message Passing Neural Network
    Yoon, Kanghoon
    Kim, Kibum
    Moon, Jinyoung
    Park, Chanyoung
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3285 - 3294