Three-stream network with context convolution module for human-object interaction detection

被引:4
|
作者
Siadari, Thomhert S. [1 ,2 ]
Han, Mikyong [2 ]
Yoon, Hyunjin [1 ,2 ]
机构
[1] Univ Sci & Technol, ETRI Sch, ICT Major, Daejeon, South Korea
[2] Elect & Telecommun Res Inst, City & Transportat ICT Res Dept, Daejeon, South Korea
关键词
context convolution module; deep learning; HOI detection; human-object interactions; three-stream network;
D O I
10.4218/etrij.2019-0230
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human-object interaction (HOI) detection is a popular computer vision task that detects interactions between humans and objects. This task can be useful in many applications that require a deeper understanding of semantic scenes. Current HOI detection networks typically consist of a feature extractor followed by detection layers comprising small filters (eg, 1 x 1 or 3 x 3). Although small filters can capture local spatial features with a few parameters, they fail to capture larger context information relevant for recognizing interactions between humans and distant objects owing to their small receptive regions. Hence, we herein propose a three-stream HOI detection network that employs a context convolution module (CCM) in each stream branch. The CCM can capture larger contexts from input feature maps by adopting combinations of large separable convolution layers and residual-based convolution layers without increasing the number of parameters by using fewer large separable filters. We evaluate our HOI detection method using two benchmark datasets, V-COCO and HICO-DET, and demonstrate its state-of-the-art performance.
引用
收藏
页码:230 / 238
页数:9
相关论文
共 50 条
  • [1] Multi-stream Network for Human-object Interaction Detection
    Wang, Chang
    Sun, Jinyu
    Ma, Shiwei
    Lu, Yuqiu
    Liu, Wang
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (08)
  • [2] Three-stream interaction decoder network for RGB-thermal salient object detection
    Huo, Fushuo
    Zhu, Xuegui
    Li, Bingheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [3] An Improved Human-Object Interaction Detection Network
    Gao, Song
    Wang, Hongyu
    Song, Jilai
    Xu, Fang
    Zou, Fengshan
    [J]. PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 192 - 196
  • [4] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [5] Parallel disentangling network for human-object interaction detection
    Cheng, Yamin
    Duan, Hancong
    Wang, Chen
    Chen, Zhijun
    [J]. PATTERN RECOGNITION, 2024, 146
  • [6] Semantic Inference Network for Human-Object Interaction Detection
    Liu, Hongyi
    Mo, Lisha
    Ma, Huimin
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 518 - 529
  • [7] Hierarchical Reasoning Network for Human-Object Interaction Detection
    Gao, Yiming
    Kuang, Zhanghui
    Li, Guanbin
    Zhang, Wayne
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (8306-8317) : 8306 - 8317
  • [8] HOKEM: HUMAN AND OBJECT KEYPOINT-BASED EXTENSION MODULE FOR HUMAN-OBJECT INTERACTION DETECTION
    Ito, Yoshiki
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 271 - 275
  • [9] ERNet: An Efficient and Reliable Human-Object Interaction Detection Network
    Lim, JunYi
    Baskaran, Vishnu Monn
    Lim, Joanne Mun-Yee
    Wong, KokSheik
    See, John
    Tistarelli, Massimo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 964 - 979
  • [10] Polysemy Deciphering Network for Robust Human-Object Interaction Detection
    Zhong, Xubin
    Ding, Changxing
    Qu, Xian
    Tao, Dacheng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1910 - 1929