Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引:2
|
作者
Pang, Bo [1 ]
Zha, Kaiwen [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPRW.2018.00308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.
引用
收藏
页码:2388 / 2397
页数:10
相关论文
共 50 条
  • [21] First-Person Activity Recognition Based on Three-Stream Deep Features
    Kim, Ye-Ji
    Lee, Dong-Gyu
    Lee, Seong-Whan
    2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 297 - 299
  • [22] Three-Stream Convolutional Neural Network with Multi-task and Ensemble Learning for 3D Action Recognition
    Liang, Duohan
    Fan, Guoliang
    Lin, Guangfeng
    Chen, Wanjun
    Pan, Xiaorong
    Zhu, Hong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 934 - 940
  • [23] 3 s-STNet: three-stream spatial–temporal network with appearance and skeleton information learning for action recognition
    Ming Fang
    Siyu Peng
    Yang Zhao
    Haibo Yuan
    Chih-Cheng Hung
    Shuhua Liu
    Neural Computing and Applications, 2023, 35 : 1835 - 1848
  • [24] Human action recognition on depth dataset
    Gao, Zan
    Zhang, Hua
    Liu, Anan A.
    Xu, Guangping
    Xue, Yanbing
    NEURAL COMPUTING & APPLICATIONS, 2016, 27 (07): : 2047 - 2054
  • [25] Human action recognition on depth dataset
    Zan Gao
    Hua Zhang
    Anan A. Liu
    Guangping Xu
    Yanbing Xue
    Neural Computing and Applications, 2016, 27 : 2047 - 2054
  • [26] Improved two-stream model for human action recognition
    Zhao, Yuxuan
    Man, Ka Lok
    Smith, Jeremy
    Siddique, Kamran
    Guan, Sheng-Uei
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [27] 3 s-STNet: three-stream spatial-temporal network with appearance and skeleton information learning for action recognition
    Fang, Ming
    Peng, Siyu
    Zhao, Yang
    Yuan, Haibo
    Hung, Chih-Cheng
    Liu, Shuhua
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1835 - 1848
  • [28] Improved two-stream model for human action recognition
    Yuxuan Zhao
    Ka Lok Man
    Jeremy Smith
    Kamran Siddique
    Sheng-Uei Guan
    EURASIP Journal on Image and Video Processing, 2020
  • [29] Multiple stream deep learning model for human action recognition
    Gu, Ye
    Ye, Xiaofeng
    Sheng, Weihua
    Ou, Yongsheng
    Li, Yongqiang
    IMAGE AND VISION COMPUTING, 2020, 93
  • [30] Towards heart infarction detection via image-based dataset and three-stream fusion framework
    Zhong, Chuyi
    Yang, Dingkang
    Wang, Shunli
    Zhang, Lihua
    COMPUTER COMMUNICATIONS, 2024, 215 : 21 - 28