Exploiting Human Pose and Scene Information for Interaction Detection

被引：1

作者：

Waheed, Manahil ^{[1
]}

Chelloug, Samia Allaoua ^{[2
]}

Shorfuzzaman, Mohammad ^{[3
]}

Alsufyani, Abdulmajeed ^{[3
]}

Jalal, Ahmad ^{[1
]}

Alnowaiser, Khaled ^{[4
]}

Park, Jeongmin ^{[5
]}

机构：

[1] Air Univ, Dept Comp Sci, Islamabad 44000, Pakistan

[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia

[3] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, Taif 21944, Saudi Arabia

[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Engn, Al Kharj 11942, Saudi Arabia

[5] Korea Polytech Univ, Dept Comp Engn, 237, Siheung si, Gyeonggi, South Korea

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 74卷 / 03期

关键词：

Artificial intelligence; daily activities; human interactions; human; pose information; image foresting transform; scene feature information; ACTIONLET ENSEMBLE; ACTION RECOGNITION; CLASSIFICATION;

D O I：

10.32604/cmc.2023.033769

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying human actions and interactions finds its use in many areas, such as security, surveillance, assisted living, patient monitoring, rehabilitation, sports, and e-learning. This wide range of applications has attracted many researchers to this field. Inspired by the existing recognition systems, this paper proposes a new and efficient human-object interaction recognition (HOIR) model which is based on modeling human pose and scene feature information. There are different aspects involved in an interaction, including the humans, the objects, the various body parts of the human, and the background scene. The main objectives of this research include critically examining the importance of all these elements in determining the interaction, estimating human pose through image foresting transform (IFT), and detecting the performed interactions based on an optimized multi-feature vector. The proposed methodology has six main phases. The first phase involves preprocessing the images. During preprocessing stages, the videos are converted into image frames. Then their contrast is adjusted, and noise is removed. In the second phase, the human-object pair is detected and extracted from each image frame. The third phase involves the identification of key body parts of the detected humans using IFT. The fourth phase relates to three different kinds of feature extraction techniques. Then these features are combined and optimized during the fifth phase. The optimized vector is used to classify the interactions in the last phase. The MSR Daily Activity 3D dataset has been used to test this model and to prove its efficiency. The proposed system obtains an average accuracy of 91.7% on this dataset.

引用

页码：5853 / 5870

页数：18

共 50 条

[1] Exploiting Scene Graphs for Human-Object Interaction Detection
He, Tao
Gao, Lianli
Song, Jingkuan
Li, Yuan-Fang
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15964 - 15973
[2] Exploiting colour information for better scene text detection and recognition
Fraz, Muhammad
Sarfraz, M. Saquib
Edirisinghe, Eran A.
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) : 153 - 167
[3] Exploiting colour information for better scene text detection and recognition
Muhammad Fraz
M. Saquib Sarfraz
Eran A. Edirisinghe
International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 153 - 167
[4] Exploiting Temporal Information for 3D Human Pose Estimation
Hossain, Mir Rayat Imtiaz
Little, James J.
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 69 - 86
[5] Using human pose information for handgun detection
Velasco-Mata, Alberto
Ruiz-Santaquiteria, Jesus
Vallez, Noelia
Deniz, Oscar
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (24): : 17273 - 17286
[6] Using human pose information for handgun detection
Alberto Velasco-Mata
Jesus Ruiz-Santaquiteria
Noelia Vallez
Oscar Deniz
Neural Computing and Applications, 2021, 33 : 17273 - 17286
[7] Scene specific people detection by simple human interaction
Munaro, Matteo
Cenedese, Angelo
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
[8] Interact-Pose Datasets for 2D Human Pose Estimation in Multi-person Interaction Scene
Jiang, Yifei
Gao, Hao
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT II, 2022, 1701 : 211 - 223
[9] Exploiting Scene Cues for Dropped Object Detection
Lopez-Mendez, Adolfo
Monay, Florent
Odobez, Jean-Marc
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 14 - 21
[10] Driver Fatigue Detection Method Based on Human Pose Information Entropy
Li, Taiguo
Zhang, Tiance
Zhang, Yingzhi
Yang, Liben
JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022

← 1 2 3 4 5 →