Structured LSTM for Human-Object Interaction Detection and Anticipation

被引:0
|
作者
Anh Minh Truong [1 ]
Yoshitaka, Atsuo [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding human activities is one of the important tasks in. computer vision. Although a lot of efforts have been made, recognizing complex human activities such as human-object interactions remains challenging. In general, human-object interactions can be considered as a temporal sequence with the transition in relationships of humans and objects over the time. Recently, many studies have shown sequential learning power of Long Short-Term Memory (ISM) for long-term temporal dependency problems. In this work, we focus on the problem of modeling spatio-temporal relationships between objects and humans with LSTM for detecting and anticipating human-object interactions in daily life. Instead of considering only how human pose and human-object relations change during human activity, we also take the impact of human activity on the state of objects and the relationships between objects into account for labeling human-object interaction. We evaluated our method on a challenging human-object interaction dataset consisting of 120 videos with different high-level activities, sub-activities and object affordances, The experimental results showed the significant improvements in both detecting and anticipating interaction activities.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Survey of Human-Object Interaction Detection
    Gong X.
    Zhang Z.
    Liu L.
    Ma B.
    Wu K.
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [2] Human-Object Interaction Detection: An Overview
    Wang, Jia
    Shuai, Hong-Han
    Li, Yung-Hui
    Cheng, Wen-Huang
    [J]. IEEE Consumer Electronics Magazine, 2024, 13 (06) : 56 - 72
  • [3] An Improved Human-Object Interaction Detection Network
    Gao, Song
    Wang, Hongyu
    Song, Jilai
    Xu, Fang
    Zou, Fengshan
    [J]. PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 192 - 196
  • [4] Human-object interaction detection with missing objects
    Kogashi, Kaen
    Wu, Yang
    Nobuhara, Shohei
    Nishino, Ko
    [J]. IMAGE AND VISION COMPUTING, 2021, 113
  • [5] Distance Matters in Human-Object Interaction Detection
    Wang, Guangzhi
    Guo, Yangyang
    Wong, Yongkang
    Kankanhalli, Mohan
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4546 - 4554
  • [6] Agglomerative Transformer for Human-Object Interaction Detection
    Tu, Danyang
    Sun, Wei
    Zhai, Guangtao
    Shen, Wei
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21557 - 21567
  • [7] Diagnosing Rarity in Human-object Interaction Detection
    Kilickaya, Mert
    Smeulders, Arnold
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3956 - 3960
  • [8] Parallel Queries for Human-Object Interaction Detection
    Chen, Junwen
    Yanai, Keiji
    [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [9] Lifelong Learning for Human-Object Interaction Detection
    Sun, Bo
    Lu, Sixu
    He, Jun
    Yu, Lejun
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 582 - 587
  • [10] Human-Object Interaction Detection with Missing Objects
    Kogashi, Kaen
    Wu, Yang
    Nobuhara, Shohei
    Nishino, Ko
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,