A Spatio-Temporal CRF for Human Interaction Understanding

被引:30
|
作者
Wang, Zhenhua [1 ]
Liu, Sheng [2 ]
Zhang, Jianhua [3 ]
Chen, Shengyong [2 ,4 ]
Guan, Qiu [2 ]
机构
[1] Zhejiang Univ Technol, Sch Comp Sci, Hangzhou 310014, Zhejiang, Peoples R China
[2] Zhejiang Univ Technol, Dept Comp Sci, Hangzhou 310014, Zhejiang, Peoples R China
[3] Zhejiang Univ Technol, Coll Comp Sci, Hangzhou 310014, Zhejiang, Peoples R China
[4] Tianjin Univ Technol, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional random fields (CRFs); human action recognition (HAR); interaction; video understanding; ACTION RECOGNITION;
D O I
10.1109/TCSVT.2016.2539699
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A better understanding of human interactions in videos can be achieved by simultaneously considering the coarse interactions between people, the action of each individual, and the activity of all people as a whole. We divide the recognition task into two stages. The first stage discriminates interactions and noninteractions, actions and activities based on local image information, while during the second stage, actions and activities are recognized in a global manner based on the local recognition results. A conditional random field (CRF) is designed to model human interactions in the spatio-temporal space. Different from most existing global models which cover either action or activity variables only, our model covers them both by considering the interactions between different types of variables. The graph structure of the CRF is predicted by a model learned from training data, which is different from traditional graph construction methods that typically rely on human heuristics. We learn the parameters of the CRF via structured support vector machine. We propose an efficient inference algorithm to tackle the estimation of labels in long videos containing many people. Our model admits both semantic-level understanding of human interactions in videos and competitive action and activity recognition performance.
引用
收藏
页码:1647 / 1660
页数:14
相关论文
共 50 条
  • [1] Overtaking Vehicle Detection Using A Spatio-temporal CRF
    Zhang, Xuetao
    Jiang, Peilin
    Wang, Fei
    2014 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2014, : 338 - 343
  • [2] Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning
    Fan, Lifeng
    Wang, Wenguan
    Huang, Siyuan
    Tang, Xinyu
    Zhu, Song-Chun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5723 - 5732
  • [3] Understanding the Human Brain Via its Spatio-temporal Properties
    Wolfson, Ouri
    26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 85 - 88
  • [4] Understanding Spatio-Temporal Urban Processes
    Rocha, Lais M. A.
    Bessa, Aline
    Chirigati, Fernando
    OFriel, Eugene
    Moro, Mirella M.
    Freire, Juliana
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 563 - 572
  • [5] UNDERSTANDING THE SPATIO-TEMPORAL PATTERN OF TWEETS
    Li, Yue
    Shan, Jie
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2013, 79 (09): : 769 - 773
  • [6] Understanding the spatio-temporal pattern of tweets
    1600, American Society for Photogrammetry and Remote Sensing, 5410 Grosvenor Lane, Suite 210, Bethesda, MD 20814-2160, United States (79):
  • [7] Learning Bag of Spatio-Temporal Features for Human Interaction Recognition
    Slimani, Khadidja Nour El Houda
    Benezeth, Yannick
    Souami, Feryel
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [8] GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory Prediction
    Wang, Chengxin
    Cai, Shaofeng
    An, Gary
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3449 - 3458
  • [9] Human Interaction Recognition Using Improved Spatio-Temporal Features
    Sivarathinabala, M.
    Abirami, S.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 191 - 199
  • [10] Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network
    Xing, Hao
    Burschka, Darius
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 5195 - 5201