Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

被引:2
|
作者
Pang, Bo [1 ]
Zha, Kaiwen [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPRW.2018.00308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the first benchmark for a new problem - recognizing human action adverbs (HAA): "Adverbs Describing Human Actions" (ADHA). We demonstrate some key features of ADHA: a semantically complete set of adverbs describing human actions, a set of common, describable human actions, and an exhaustive labelling of simultaneously emerging actions in each video. We commit an in-depth analysis on the implementation of current effective models in action recognition and image captioning on adverb recognition, and the results reveal that such methods are unsatisfactory. Furthermore, we propose a novel three-stream hybrid model to tackle the HAA problem, which achieves better performances and receives relatively promising results.
引用
收藏
页码:2388 / 2397
页数:10
相关论文
共 50 条
  • [1] Three-stream CNNs for action recognition
    Wang, Liangliang
    Ge, Lianzheng
    Li, Ruifeng
    Fang, Yajun
    PATTERN RECOGNITION LETTERS, 2017, 92 : 33 - 40
  • [2] Multi-Modal Three-Stream Network for Action Recognition
    Khalid, Muhammad Usman
    Yu, Jie
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3210 - 3215
  • [3] Trajectory-aware three-stream CNN for video action recognition
    Weng, Zhengkui
    Guan, Yepeng
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [4] Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN
    Shi, Yemin
    Tian, Yonghong
    Wang, Yaowei
    Huang, Tiejun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1510 - 1520
  • [5] Visual Scene Induced Three-stream Network for Efficient Action Recognition
    He, Jun
    Zhao, Xiaochong
    Sun, Bo
    Yu, Xiaocui
    Zhang, Yinghui
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 550 - 554
  • [6] Three-Stream Graph Convolutional Networks for Zero-Shot Action Recognition
    Wu, Nan
    Kawamoto, Kazuhiko
    2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 392 - 396
  • [7] Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks
    Wu, Nan
    Kawamoto, Kazuhiko
    SENSORS, 2021, 21 (11)
  • [8] Beyond Two-stream: Skeleton-based Three-stream Networks for Action Recognition in Videos
    Xu, Jianfeng
    Tasaka, Kazuyuki
    Yanagihara, Hiromasa
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1567 - 1573
  • [9] Semantic three-stream network for social relation recognition
    Yan, Haibin
    Song, Chaohui
    PATTERN RECOGNITION LETTERS, 2019, 128 : 78 - 84
  • [10] Multi-level Three-Stream Convolutional Networks for Video-Based Action Recognition
    Lv, Yijing
    Zheng, Huicheng
    Zhang, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 237 - 249