Decomposed Prototype Learning for Few-Shot Scene Graph Generation

被引:0
|
作者
Li, Xingchen [1 ,2 ]
Xiao, Jun [1 ]
Chen, Guikun [1 ]
Feng, Yinfu [3 ]
Yang, Yi [1 ]
Liu, An-an [4 ]
Chen, Long [5 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] China Mobile Zhejiang Innovat Res Inst, Hangzhou, Peoples R China
[3] Alibaba Grp, Guangzhou, Peoples R China
[4] Tianjin Univ, Tianjin, Peoples R China
[5] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene Graph Generation (SGG); Few-Shot Learning; Prompt Learning; Prototype Learning;
D O I
10.1145/3700877
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today's scene graph generation (SGG) models typically require abundant manual annotations to learn new predicate types. Therefore, it is difficult to apply them to real-world applications with massive uncommon predicate categories whose annotations are hard to collect. In this article, we focus on Few-Shot SGG (FSSGG), which encourages SGG models to be able to quickly transfer previous knowledge and recognize unseen predicates well with only a few examples. However, current methods for FSSGG are hindered by the high intra-class variance of predicate categories in SGG: On one hand, each predicate category commonly has multiple semantic meanings under different contexts. On the other hand, the visual appearance of relation triplets with the same predicate differs greatly under different subject-object compositions. Such great variance of inputs makes it hard to learn generalizable representation for each predicate category with current few-shot learning (FSL) methods. However, we found that this intra-class variance of predicates is highly related to the composed subjects and objects. To model the intra-class variance of predicates with subject-object context, we propose a novel Decomposed Prototype Learning (DPL) model for FSSGG. Specifically, we first construct a decomposable prototype space to capture diverse semantics and visual patterns of subjects and objects for predicates by decomposing them into multiple prototypes. Afterwards, we integrate these prototypes with different weights to generate query-adaptive predicate representation with more reliable semantics for each query sample. We conduct extensive experiments and compare with various baseline methods to show the effectiveness of our method.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Subspace prototype learning for few-Shot remote sensing scene classification
    Wang, Wuli
    Xing, Lei
    Ren, Peng
    Jiang, Yumeng
    Wang, Ge
    Liu, Baodi
    SIGNAL PROCESSING, 2023, 208
  • [2] Prototype Completion for Few-Shot Learning
    Zhang, Baoquan
    Li, Xutao
    Ye, Yunming
    Feng, Shanshan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12250 - 12268
  • [3] Prototype Reinforcement for Few-Shot Learning
    Xu, Liheng
    Xie, Qian
    Jiang, Baoqing
    Zhang, Jiashuo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4912 - 4916
  • [4] TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning
    Zhang, Chenrui
    Lyu, Xiaoqing
    Tang, Zhi
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1641 - 1649
  • [5] Prototype Calibration with Feature Generation for Few-Shot Remote Sensing Image Scene Classification
    Zeng, Qingjie
    Geng, Jie
    Huang, Kai
    Jiang, Wen
    Guo, Jun
    REMOTE SENSING, 2021, 13 (14)
  • [6] Few-shot learning with representative global prototype
    Liu, Yukun
    Shi, Daming
    Lin, Hexiu
    NEURAL NETWORKS, 2024, 180
  • [7] Decomposed Meta-Learning for Few-Shot Sequence Labeling
    Ma, Tingting
    Wu, Qianhui
    Jiang, Huiqiang
    Lin, Jieru
    Karlsson, Borje F.
    Zhao, Tiejun
    Lin, Chin-Yew
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1980 - 1993
  • [8] Graph Few-shot Learning with Attribute Matching
    Wang, Ning
    Luo, Minnan
    Ding, Kaize
    Zhang, Lingling
    Li, Jundong
    Zheng, Qinghua
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1545 - 1554
  • [9] Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation
    Lu, Bin
    Gan, Xiaoying
    Yang, Lina
    Zhang, Weinan
    Fu, Luoyi
    Wang, Xinbing
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1152 - 1161
  • [10] Prototype Relationship Optimization Network for Few-Shot Learning
    Wang, Dengzhong
    Zhong, Yuan
    Ma, Yunfei
    Guo, Chunsheng
    IEEJ Transactions on Electrical and Electronic Engineering, 2024,