Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引：2

作者：

Pang, Bo ^{[1
]}

Zha, Kaiwen ^{[1
]}

Lu, Cewu ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年

关键词：

D O I：

10.1109/CVPRW.2018.00308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.

引用

页码：2388 / 2397

页数：10

共 50 条

[41] Ensemble Three-Stream RGB-S Deep Neural Network for Human Behavior Recognition Under Intelligent Home Service Robot Environments
Byeon, Yeong-Hyeon
Kim, Dohyung
Lee, Jaeyeon
Kwak, Keun-Chang
IEEE ACCESS, 2021, 9 : 73240 - 73250
[42] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
Zhang, He
Yin, Lu
Zhang, Hanling
Wu, Xuesong
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3761 - 3771
[43] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
He Zhang
Lu Yin
Hanling Zhang
Xuesong Wu
Signal, Image and Video Processing, 2024, 18 : 3761 - 3771
[44] Aeriform in-action: A novel dataset for human action recognition in aerial videos
Kapoor, Surbhi
Sharma, Akashdeep
Verma, Amandeep
Singh, Sarbjeet
PATTERN RECOGNITION, 2023, 140
[45] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
Megrhi, Sameh
Jmal, Marwa
Beghdadi, Azeddine
Mseddi, Wided
VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
[46] Calculation of stationary probabilities for a three-stream model of control of the access to the resources of a wireless wideband network with hystereses
I. I. Tsitovich
A. V. Chernushevich
Journal of Communications Technology and Electronics, 2011, 56 : 1543 - 1551
[47] Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition
Liu, An-An
Xu, Ning
Nie, Wei-Zhi
Su, Yu-Ting
Wong, Yongkang
Kankanhalli, Mohan
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (07) : 1781 - 1794
[48] The Johns Hopkins University Multimodal Dataset for Human Action Recognition
Murray, Thomas S.
Mendat, Daniel R.
Pouliquen, Philippe O.
Andreou, Andreas G.
RADAR SENSOR TECHNOLOGY XIX; AND ACTIVE AND PASSIVE SIGNATURES VI, 2015, 9461
[49] A large-scale fMRI dataset for human action recognition
Zhou, Ming
Gong, Zhengxin
Dai, Yuxuan
Wen, Yushan
Liu, Youyi
Zhen, Zonglei
SCIENTIFIC DATA, 2023, 10 (01)
[50] A large-scale fMRI dataset for human action recognition
Ming Zhou
Zhengxin Gong
Yuxuan Dai
Yushan Wen
Youyi Liu
Zonglei Zhen
Scientific Data, 10

← 1 2 3 4 5 →