An Attention-based Activity Recognition for Egocentric Video

被引:40
|
作者
Matsuo, Kenji [1 ]
Yamada, Kentaro [1 ]
Ueno, Satoshi [1 ]
Naito, Sei [1 ]
机构
[1] KDDI R&D Labs Inc, Fujimino, Saitama, Japan
关键词
VISUAL-ATTENTION; MODEL;
D O I
10.1109/CVPRW.2014.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a human activity recognition method from first-person videos, which provides a supplementary method to improve the recognition accuracy. Conventional methods detect objects and derive a user's behavior based on their taxonomy. One of the recent works has achieved accuracy improvement by determining key objects based on hand manipulation. However, such manipulation-based approach has a restriction on applicable scenes and object types because the user's hands don't always present significant information. In contrast, our proposed attention-based approach provides a solution to detect visually salient objects as key objects in a non-contact manner. Experimental results show that the proposed method classifies first-person actions more accurately than the previous method by 6.4 percentage points and its average accuracy reaches 43.3%.
引用
收藏
页码:565 / +
页数:3
相关论文
共 50 条
  • [31] Describing Video With Attention-Based Bidirectional LSTM
    Bin, Yi
    Yang, Yang
    Shen, Fumin
    Xie, Ning
    Shen, Heng Tao
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (07) : 2631 - 2641
  • [32] Residual attention-based LSTM for video captioning
    Xiangpeng Li
    Zhilong Zhou
    Lijiang Chen
    Lianli Gao
    [J]. World Wide Web, 2019, 22 : 621 - 636
  • [33] Residual attention-based LSTM for video captioning
    Li, Xiangpeng
    Zhou, Zhilong
    Chen, Lijiang
    Gao, Lianli
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 621 - 636
  • [34] The effect of using video title in attention-based video summarization
    Li, Changwei
    Yeh, Zhi-Ting
    Gunuganti, Jeshmitha
    Chang, Jia-Bin
    Norouzi, Mehdi
    [J]. 2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [35] STA-HAR: A Spatiotemporal Attention-Based Framework for Human Activity Recognition
    Khaliluzzaman, Md.
    Furquan, Md.
    Khan, Mohammod Sazid Zaman
    Hoque, Md. Jiabul
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2024, 2024
  • [36] An Online Attention-Based Model for Speech Recognition
    Fan, Ruchao
    Zhou, Pan
    Chen, Wei
    Jia, Jia
    Liu, Gang
    [J]. INTERSPEECH 2019, 2019, : 4390 - 4394
  • [37] A Neural Autoregressive Approach to Attention-based Recognition
    Zheng, Yin
    Zemel, Richard S.
    Zhang, Yu-Jin
    Larochelle, Hugo
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 67 - 79
  • [38] A Neural Autoregressive Approach to Attention-based Recognition
    Yin Zheng
    Richard S. Zemel
    Yu-Jin Zhang
    Hugo Larochelle
    [J]. International Journal of Computer Vision, 2015, 113 : 67 - 79
  • [39] Significance of handcrafted features in human activity recognition with attention-based RNN models
    Abraham, Sonia
    James, Rekha K.
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (10) : 1151 - 1163
  • [40] Attention-Based Deep Learning Framework for Human Activity Recognition With User Adaptation
    Buffelli, Davide
    Vandin, Fabio
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (12) : 13474 - 13483