An Attention-based Activity Recognition for Egocentric Video

被引:40
|
作者
Matsuo, Kenji [1 ]
Yamada, Kentaro [1 ]
Ueno, Satoshi [1 ]
Naito, Sei [1 ]
机构
[1] KDDI R&D Labs Inc, Fujimino, Saitama, Japan
关键词
VISUAL-ATTENTION; MODEL;
D O I
10.1109/CVPRW.2014.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a human activity recognition method from first-person videos, which provides a supplementary method to improve the recognition accuracy. Conventional methods detect objects and derive a user's behavior based on their taxonomy. One of the recent works has achieved accuracy improvement by determining key objects based on hand manipulation. However, such manipulation-based approach has a restriction on applicable scenes and object types because the user's hands don't always present significant information. In contrast, our proposed attention-based approach provides a solution to detect visually salient objects as key objects in a non-contact manner. Experimental results show that the proposed method classifies first-person actions more accurately than the previous method by 6.4 percentage points and its average accuracy reaches 43.3%.
引用
收藏
页码:565 / +
页数:3
相关论文
共 50 条
  • [41] A Neural Autoregressive Approach to Attention-based Recognition
    Yin Zheng
    Richard S. Zemel
    Yu-Jin Zhang
    Hugo Larochelle
    [J]. International Journal of Computer Vision, 2015, 113 : 67 - 79
  • [42] Attention-Based Deep Learning Framework for Human Activity Recognition With User Adaptation
    Buffelli, Davide
    Vandin, Fabio
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (12) : 13474 - 13483
  • [43] Attention-based Gender Recognition on Masked Faces
    Carletti, Vincenzo
    Greco, Antonio
    Saggese, Alessia
    Vento, Mario
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 672 - 678
  • [44] Preliminary Investigation of Object-based Activity Recognition Using Egocentric Video Based on Web Knowledge
    Nakatani, Tomoya
    Kuga, Ryohei
    Maekawa, Takuya
    [J]. 17TH INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS MULTIMEDIA (MUM 2018), 2018, : 375 - 381
  • [45] Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition
    Garg, Mallika
    Ghosh, Debashis
    Pradhan, Pyari Mohan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 80 - 84
  • [46] Video Captioning With Attention-Based LSTM and Semantic Consistency
    Gao, Lianli
    Guo, Zhao
    Zhang, Hanwang
    Xu, Xing
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2045 - 2055
  • [47] Attention-Based Video Hashing for Large-Scale Video Retrieval
    Wang, Yingxin
    Nie, Xiushan
    Shi, Yang
    Zhou, Xin
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 491 - 502
  • [48] Visual attention-based deepfake video forgery detection
    Ganguly, Shreyan
    Mohiuddin, Sk
    Malakar, Samir
    Cuevas, Erik
    Sarkar, Ram
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (04) : 981 - 992
  • [49] Motional foreground attention-based video crowd counting
    Ling, Miaogen
    Pan, Tianhang
    Ren, Yi
    Wang, Ke
    Geng, Xin
    [J]. PATTERN RECOGNITION, 2023, 144
  • [50] Visual attention-based deepfake video forgery detection
    Shreyan Ganguly
    Sk Mohiuddin
    Samir Malakar
    Erik Cuevas
    Ram Sarkar
    [J]. Pattern Analysis and Applications, 2022, 25 : 981 - 992