Multi-stream neural network fused with local information and global information for HOI detection

被引:0
|
作者
Limin Xia
Rui Li
机构
[1] Central South University,School of Automation
来源
Applied Intelligence | 2020年 / 50卷
关键词
Human-object interactions; Global contextual information; Local region information; Information fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Human-Object Interaction (HOI) Detection is a new genre of human-centric visual relationship detection task, which is significant to deep understanding of visual scenes. Due to the complexity of the visual scene in the image, HOI detection is still a challenging task, the most critical part of which is feature extraction and representation. Some existing approaches rely solely on local region information for HOI detection without using global contextual information, but global contextual information contributes to this task in some HOI categories. Other approaches incorporate global contextual information for HOI detection while losing local region information. In this work, we propose a multi-stream neural network architecture composed of three special module that employs both local region information and global contextual information for HOI detection. This model can detect not only the HOI categories based on local region information but also on global contextual information. Our model more fully considers all HOI categories in the dataset. Compared with other existing approaches, the proposed model shows improved performance on V-COCO and HICO-DET benchmark datasets, especially when predicting rare HOI categories.
引用
收藏
页码:4495 / 4505
页数:10
相关论文
共 50 条
  • [1] Multi-stream neural network fused with local information and global information for HOI detection
    Xia, Limin
    Li, Rui
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4495 - 4505
  • [2] Multi-stream Information-Based Neural Network for Mammogram Mass Segmentation
    Li, Zhilin
    Deng, Zijian
    Chen, Li
    Gui, Yu
    Cai, Zhigang
    Liao, Jianwei
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 267 - 278
  • [3] An analysis of information segregation in parallel streams of a multi-stream convolutional neural network
    Tamura, Hiroshi
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [4] Combining Information from Multi-Stream Features Using Deep Neural Network in Speech Recognition
    Zhou, Pan
    Dai, Lirong
    Liu, Qingfeng
    Jiang, Hui
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 557 - +
  • [5] Research on Grey Modeling for Multi-stream Information
    Liu, Xin
    Dai, Jin
    Zhou, Weijie
    [J]. JOURNAL OF GREY SYSTEM, 2016, 28 (04): : 127 - 137
  • [6] Employing heterogeneous information in a multi-stream framework
    Christensen, H
    Lindberg, B
    Andersen, O
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1571 - 1574
  • [7] A Multi-Stream Recurrent Neural Network for Social Role Detection in Multiparty Interactions
    Zhang, Lingyu
    Radke, Richard J.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (03) : 554 - 567
  • [8] Low-light image enhancement network based on multi-stream information supplement
    Yang, Yong
    Hu, Wei
    Huang, Shuying
    Tu, Wei
    Wan, Weiguo
    [J]. MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2022, 33 (03) : 711 - 723
  • [9] Low-light image enhancement network based on multi-stream information supplement
    Yong Yang
    Wei Hu
    Shuying Huang
    Wei Tu
    Weiguo Wan
    [J]. Multidimensional Systems and Signal Processing, 2022, 33 : 711 - 723
  • [10] MULTI-STREAM REGION PROPOSAL NETWORK FOR PEDESTRIAN DETECTION
    Lei, Jianjun
    Chen, Yue
    Peng, Bo
    Huang, Qingming
    Ling, Nam
    Hou, Chunping
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,