Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引：2

作者：

Pang, Bo ^{[1
]}

Zha, Kaiwen ^{[1
]}

Lu, Cewu ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年

关键词：

D O I：

10.1109/CVPRW.2018.00308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.

引用

页码：2388 / 2397

页数：10

共 50 条

[1] Three-stream CNNs for action recognition
Wang, Liangliang
Ge, Lianzheng
Li, Ruifeng
Fang, Yajun
PATTERN RECOGNITION LETTERS, 2017, 92 : 33 - 40
[2] Multi-Modal Three-Stream Network for Action Recognition
Khalid, Muhammad Usman
Yu, Jie
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3210 - 3215
[3] Trajectory-aware three-stream CNN for video action recognition
Weng, Zhengkui
Guan, Yepeng
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
[4] Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN
Shi, Yemin
Tian, Yonghong
Wang, Yaowei
Huang, Tiejun
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1510 - 1520
[5] Visual Scene Induced Three-stream Network for Efficient Action Recognition
He, Jun
Zhao, Xiaochong
Sun, Bo
Yu, Xiaocui
Zhang, Yinghui
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 550 - 554
[6] Three-Stream Graph Convolutional Networks for Zero-Shot Action Recognition
Wu, Nan
Kawamoto, Kazuhiko
2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 392 - 396
[7] Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks
Wu, Nan
Kawamoto, Kazuhiko
SENSORS, 2021, 21 (11)
[8] Beyond Two-stream: Skeleton-based Three-stream Networks for Action Recognition in Videos
Xu, Jianfeng
Tasaka, Kazuyuki
Yanagihara, Hiromasa
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1567 - 1573
[9] Semantic three-stream network for social relation recognition
Yan, Haibin
Song, Chaohui
PATTERN RECOGNITION LETTERS, 2019, 128 : 78 - 84
[10] Multi-level Three-Stream Convolutional Networks for Video-Based Action Recognition
Lv, Yijing
Zheng, Huicheng
Zhang, Wei
PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 237 - 249

← 1 2 3 4 5 →