Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引:2
|
作者
Pang, Bo [1 ]
Zha, Kaiwen [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPRW.2018.00308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.
引用
收藏
页码:2388 / 2397
页数:10
相关论文
共 50 条
  • [41] Ensemble Three-Stream RGB-S Deep Neural Network for Human Behavior Recognition Under Intelligent Home Service Robot Environments
    Byeon, Yeong-Hyeon
    Kim, Dohyung
    Lee, Jaeyeon
    Kwak, Keun-Chang
    IEEE ACCESS, 2021, 9 : 73240 - 73250
  • [42] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
    Zhang, He
    Yin, Lu
    Zhang, Hanling
    Wu, Xuesong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3761 - 3771
  • [43] Facial micro-expression recognition using three-stream vision transformer network with sparse sampling and relabeling
    He Zhang
    Lu Yin
    Hanling Zhang
    Xuesong Wu
    Signal, Image and Video Processing, 2024, 18 : 3761 - 3771
  • [44] Aeriform in-action: A novel dataset for human action recognition in aerial videos
    Kapoor, Surbhi
    Sharma, Akashdeep
    Verma, Amandeep
    Singh, Sarbjeet
    PATTERN RECOGNITION, 2023, 140
  • [45] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
    Megrhi, Sameh
    Jmal, Marwa
    Beghdadi, Azeddine
    Mseddi, Wided
    VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
  • [46] Calculation of stationary probabilities for a three-stream model of control of the access to the resources of a wireless wideband network with hystereses
    I. I. Tsitovich
    A. V. Chernushevich
    Journal of Communications Technology and Electronics, 2011, 56 : 1543 - 1551
  • [47] Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition
    Liu, An-An
    Xu, Ning
    Nie, Wei-Zhi
    Su, Yu-Ting
    Wong, Yongkang
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (07) : 1781 - 1794
  • [48] The Johns Hopkins University Multimodal Dataset for Human Action Recognition
    Murray, Thomas S.
    Mendat, Daniel R.
    Pouliquen, Philippe O.
    Andreou, Andreas G.
    RADAR SENSOR TECHNOLOGY XIX; AND ACTIVE AND PASSIVE SIGNATURES VI, 2015, 9461
  • [49] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    SCIENTIFIC DATA, 2023, 10 (01)
  • [50] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    Scientific Data, 10