Lifelong Learning for Human-Object Interaction Detection

被引:0
|
作者
Sun, Bo [1 ]
Lu, Sixu [2 ]
He, Jun [1 ]
Yu, Lejun [2 ]
机构
[1] Beijing Normal Univ, Sch Artificial Intelligence, Coll Educ Future, Beijing, Zhuhai, Peoples R China
[2] Beijing Normal Univ, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
human-object interaction detection; lifelong learning; contrastive learning; object detection; incremental learning;
D O I
10.1109/ICICN56848.2022.10006558
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human-Object Interaction (HOI) Detection is a critical task in scene understanding, which aims to detect the triplet<human, object, interaction> in images or videos. Existing methods solve this problem under a strong assumption that all triplets that are to be detected would be available during training stage. However, in real scene, new HOIs may be introduced continuously, which requires the trained model to have the ability to identify new classes without forgetting old ones. Due to the limitations of storage, computing resources and the privacy of data, it is impractical to train the model from scratch using old and new data every time. In this paper, we propose a new HOI detection task scenario called Lifelong Learning Human-Object Interaction Detection (LL-HOI) which is more natural than the existing closed-world one and solve this problem in an incremental and contrastive learning manner (Fig. 1). Our method is composed of two stages according to under incremental setting or not: 1) identify humans, objects and actions in HOIs using backbone detector and contrastive learning and 2) incrementally learn new HOI classes without forgetting previously learned ones. Besides, to address the catastrophic forgetting problem, we propose a Feature Replay Network (FRN) based on contrastive learning to adaptively process the images conditioned on the incremental process. Extensive experiments on HICO-DET and HOI-W datasets demonstrate the effectiveness and superiority of our method on lifelong human-object interaction detection.
引用
下载
收藏
页码:582 / 587
页数:6
相关论文
共 50 条
  • [1] Learning Human-Object Interaction Detection using Interaction Points
    Wang, Tiancai
    Yang, Tong
    Danelljan, Martin
    Khan, Fahad Shahbaz
    Zhang, Xiangyu
    Sun, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4115 - 4124
  • [2] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [3] Affordance Transfer Learning for Human-Object Interaction Detection
    Hou, Zhi
    Yu, Baosheng
    Qiao, Yu
    Peng, Xiaojiang
    Tao, Dacheng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 495 - 504
  • [4] Learning Human-Object Interaction Detection via Deformable Transformer
    Cai, Shuang
    Ma, Shiwei
    Gu, Dongzhou
    2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [5] Human-Object Interaction Detection: An Overview
    Wang J.
    Shuai H.
    Li Y.
    Cheng W.
    IEEE Consumer Electronics Magazine, 2024, 13 (06) : 1 - 14
  • [6] A Survey of Human-Object Interaction Detection
    Gong X.
    Zhang Z.
    Liu L.
    Ma B.
    Wu K.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [7] Compositional Learning in Transformer-Based Human-Object Interaction Detection
    Zhuang, Zikun
    Qian, Ruihao
    Xie, Chi
    Liang, Shuang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1038 - 1043
  • [8] From detection to understanding: A survey on representation learning for human-object interaction
    Luo, Tianlun
    Guan, Steven
    Yang, Rui
    Smith, Jeremy
    NEUROCOMPUTING, 2023, 543
  • [9] Improving Human-Object Interaction Detection via Virtual Image Learning
    Fang, Shuman
    Liu, Shuai
    Li, Jie
    Jiang, Guannan
    Lin, Xianming
    Ji, Rongrong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5455 - 5463
  • [10] An Improved Human-Object Interaction Detection Network
    Gao, Song
    Wang, Hongyu
    Song, Jilai
    Xu, Fang
    Zou, Fengshan
    PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 192 - 196