Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引：2

作者：

Pang, Bo ^{[1
]}

Zha, Kaiwen ^{[1
]}

Lu, Cewu ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年

关键词：

D O I：

10.1109/CVPRW.2018.00308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.

引用

页码：2388 / 2397

页数：10

共 50 条

[21] First-Person Activity Recognition Based on Three-Stream Deep Features
Kim, Ye-Ji
Lee, Dong-Gyu
Lee, Seong-Whan
2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 297 - 299
[22] Three-Stream Convolutional Neural Network with Multi-task and Ensemble Learning for 3D Action Recognition
Liang, Duohan
Fan, Guoliang
Lin, Guangfeng
Chen, Wanjun
Pan, Xiaorong
Zhu, Hong
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 934 - 940
[23] 3 s-STNet: three-stream spatial–temporal network with appearance and skeleton information learning for action recognition
Ming Fang
Siyu Peng
Yang Zhao
Haibo Yuan
Chih-Cheng Hung
Shuhua Liu
Neural Computing and Applications, 2023, 35 : 1835 - 1848
[24] Human action recognition on depth dataset
Gao, Zan
Zhang, Hua
Liu, Anan A.
Xu, Guangping
Xue, Yanbing
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (07): : 2047 - 2054
[25] Human action recognition on depth dataset
Zan Gao
Hua Zhang
Anan A. Liu
Guangping Xu
Yanbing Xue
Neural Computing and Applications, 2016, 27 : 2047 - 2054
[26] Improved two-stream model for human action recognition
Zhao, Yuxuan
Man, Ka Lok
Smith, Jeremy
Siddique, Kamran
Guan, Sheng-Uei
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
[27] 3 s-STNet: three-stream spatial-temporal network with appearance and skeleton information learning for action recognition
Fang, Ming
Peng, Siyu
Zhao, Yang
Yuan, Haibo
Hung, Chih-Cheng
Liu, Shuhua
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1835 - 1848
[28] Improved two-stream model for human action recognition
Yuxuan Zhao
Ka Lok Man
Jeremy Smith
Kamran Siddique
Sheng-Uei Guan
EURASIP Journal on Image and Video Processing, 2020
[29] Multiple stream deep learning model for human action recognition
Gu, Ye
Ye, Xiaofeng
Sheng, Weihua
Ou, Yongsheng
Li, Yongqiang
IMAGE AND VISION COMPUTING, 2020, 93
[30] Towards heart infarction detection via image-based dataset and three-stream fusion framework
Zhong, Chuyi
Yang, Dingkang
Wang, Shunli
Zhang, Lihua
COMPUTER COMMUNICATIONS, 2024, 215 : 21 - 28

← 1 2 3 4 5 →