Exploiting Human Pose and Scene Information for Interaction Detection

被引:1
|
作者
Waheed, Manahil [1 ]
Chelloug, Samia Allaoua [2 ]
Shorfuzzaman, Mohammad [3 ]
Alsufyani, Abdulmajeed [3 ]
Jalal, Ahmad [1 ]
Alnowaiser, Khaled [4 ]
Park, Jeongmin [5 ]
机构
[1] Air Univ, Dept Comp Sci, Islamabad 44000, Pakistan
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia
[3] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, Taif 21944, Saudi Arabia
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Engn, Al Kharj 11942, Saudi Arabia
[5] Korea Polytech Univ, Dept Comp Engn, 237, Siheung si, Gyeonggi, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 74卷 / 03期
关键词
Artificial intelligence; daily activities; human interactions; human; pose information; image foresting transform; scene feature information; ACTIONLET ENSEMBLE; ACTION RECOGNITION; CLASSIFICATION;
D O I
10.32604/cmc.2023.033769
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying human actions and interactions finds its use in many areas, such as security, surveillance, assisted living, patient monitoring, rehabilitation, sports, and e-learning. This wide range of applications has attracted many researchers to this field. Inspired by the existing recognition systems, this paper proposes a new and efficient human-object interaction recognition (HOIR) model which is based on modeling human pose and scene feature information. There are different aspects involved in an interaction, including the humans, the objects, the various body parts of the human, and the background scene. The main objectives of this research include critically examining the importance of all these elements in determining the interaction, estimating human pose through image foresting transform (IFT), and detecting the performed interactions based on an optimized multi-feature vector. The proposed methodology has six main phases. The first phase involves preprocessing the images. During preprocessing stages, the videos are converted into image frames. Then their contrast is adjusted, and noise is removed. In the second phase, the human-object pair is detected and extracted from each image frame. The third phase involves the identification of key body parts of the detected humans using IFT. The fourth phase relates to three different kinds of feature extraction techniques. Then these features are combined and optimized during the fifth phase. The optimized vector is used to classify the interactions in the last phase. The MSR Daily Activity 3D dataset has been used to test this model and to prove its efficiency. The proposed system obtains an average accuracy of 91.7% on this dataset.
引用
收藏
页码:5853 / 5870
页数:18
相关论文
共 50 条
  • [1] Exploiting Scene Graphs for Human-Object Interaction Detection
    He, Tao
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15964 - 15973
  • [2] Exploiting colour information for better scene text detection and recognition
    Fraz, Muhammad
    Sarfraz, M. Saquib
    Edirisinghe, Eran A.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) : 153 - 167
  • [3] Exploiting colour information for better scene text detection and recognition
    Muhammad Fraz
    M. Saquib Sarfraz
    Eran A. Edirisinghe
    International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 153 - 167
  • [4] Exploiting Temporal Information for 3D Human Pose Estimation
    Hossain, Mir Rayat Imtiaz
    Little, James J.
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
  • [5] Using human pose information for handgun detection
    Velasco-Mata, Alberto
    Ruiz-Santaquiteria, Jesus
    Vallez, Noelia
    Deniz, Oscar
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (24): : 17273 - 17286
  • [6] Using human pose information for handgun detection
    Alberto Velasco-Mata
    Jesus Ruiz-Santaquiteria
    Noelia Vallez
    Oscar Deniz
    Neural Computing and Applications, 2021, 33 : 17273 - 17286
  • [7] Scene specific people detection by simple human interaction
    Munaro, Matteo
    Cenedese, Angelo
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [8] Interact-Pose Datasets for 2D Human Pose Estimation in Multi-person Interaction Scene
    Jiang, Yifei
    Gao, Hao
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT II, 2022, 1701 : 211 - 223
  • [9] Exploiting Scene Cues for Dropped Object Detection
    Lopez-Mendez, Adolfo
    Monay, Florent
    Odobez, Jean-Marc
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 14 - 21
  • [10] Driver Fatigue Detection Method Based on Human Pose Information Entropy
    Li, Taiguo
    Zhang, Tiance
    Zhang, Yingzhi
    Yang, Liben
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022