TwinLSTM: Two-channel LSTM Network for Online Action Detection

被引:4
|
作者
Han, Yunfei [1 ]
Tan, Shan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
关键词
D O I
10.1109/ICPR56361.2022.9956717
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online Action Detection (OAD) has attracted more and more attention in recent years. A network for OAD generally consists of three parts: a frame-level feature extractor, a temporal modeling module, and an action classifier. Most recent OAD networks use a single-channel Recurrent Neural Network (RNN) to capture long-term history information, with spatial and temporal features concatenated as network input. In OAD, spatial features describe object appearance and scene configuration within each frame while temporal features capture motion cues over time. It is crucial to effectively fuse both spatial and temporal features. In this paper, we propose a new framework named TwinLSTM based on two-channel Long Short-Term Memory (LSTM) network for OAD, in which each channel is used to extract and handle either spatial features or temporal features. To more effectively fuse both spatial and temporal features, we design a prediction fusion module (PFM) to utilize hidden states of both channels to obtain more action content, including information interaction and future context prediction. We evaluate TwinLSTM on two challenging datasets: THUMOS14 and HDD. Experiments show that TwinLSTM outperforms existing single-channel models by a significant margin. We also show the effectiveness of PFM through comprehensive ablation studies.
引用
收藏
页码:3310 / 3317
页数:8
相关论文
共 50 条
  • [1] A two-channel optical downconverter for phase detection
    Biernacki, PD
    Nichols, LT
    Enders, DG
    Williams, KJ
    Esman, RD
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 1998, 46 (11) : 1784 - 1787
  • [2] Two-Channel Passive Detection of Cyclostationary Signals
    Horstmann, Stefanie
    Ramirez, David
    Schreier, Peter J.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 2340 - 2355
  • [3] Two-Channel Passive Detection Exploiting Cyclostationarity
    Horstmann, Stefanie
    Ramirez, David
    Schreier, Peter J.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [4] Modeling temporal structure with LSTM for online action detection
    De Geest, Roeland
    Tuytelaars, Tinne
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1549 - 1557
  • [5] An online open circuit faults diagnosis method for converter using the lightweight two-channel deep network
    Zhao, Shaishai
    Chen, Jianfei
    Zhang, Chaolong
    He, Yigang
    MEASUREMENT, 2025, 243
  • [6] Two-channel decentralized integral-action controller design
    Gündes, AN
    Özguler, AB
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (12) : 2084 - 2088
  • [7] Polarized object detection in crabs: a two-channel system
    Ailin Basnak, Melanie
    Perez-Schuster, Veronica
    Hermitte, Gabriela
    Beron de Astrada, Martin
    JOURNAL OF EXPERIMENTAL BIOLOGY, 2018, 221 (10):
  • [8] Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network
    Li, Wenfa
    Xu, Yilong
    Wang, Gongming
    IEEE ACCESS, 2019, 7 : 145944 - 145952
  • [9] Two-channel spectroheliograph
    Nikulin, I.F.
    Pribory i Tekhnika Eksperimenta, 1995, (02): : 148 - 151
  • [10] Research on an Two-Channel ACNN-LSTM Model for Financial Text Sentiment Analysis
    Shi, Hanxiao
    You, Liqiang
    Ren, Mimi
    Li, Xiaojun
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 200 - 205