Attention-based encoder-decoder networks for workflow recognition

被引:0
|
作者
Min Zhang
Haiyang Hu
Zhongjin Li
Jie Chen
机构
[1] Hangzhou Dianzi University,School of Computer Science and Technology
来源
关键词
Workflow recognition; Activity detection; Temporal action localization;
D O I
暂无
中图分类号
学科分类号
摘要
Behavior recognition is a fundamental yet challenging task in intelligent surveillance system, which plays an increasingly important role in the process of “Industry 4.0”. However, monitoring the workflow of both workers and machines in production procedure is quite difficult in complex industrial environments. In this paper, we propose a novel workflow recognition framework to recognize the behavior of working subjects based on the well-designed encoder-decoder structure. Namely, attention-based workflow recognition framework, termed as AWR. To improve the accuracy of workflow recognition, a temporal attention cell (AttCell) is introduced to draw dynamic attention distribution in the last stage of the framework. In addition, a Rough-to-Refine phase localization model is exploited to improve localization accuracy, which can effectively identify the boundaries of a specific phase instance in long untrimmed videos. Comprehensive experiments indicate a 1.4% mAP@IoU= 0.4 boost on THUMOS’14 dataset and a 3.4% mAP@IoU= 0.4 boost on hand-crafted workflow dataset detection challenge compared to the advanced GTAN pipeline respectively. More remarkably, the effectiveness of the workflow recognition system is validated in a real-world production scenario.
引用
收藏
页码:34973 / 34995
页数:22
相关论文
共 50 条
  • [21] Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation
    Yang, Libin
    Zhang, Zeqing
    Cai, Xiaoyan
    Dai, Tao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [22] Modeling User Session and Intent with an Attention-based Encoder-Decoder Architecture
    Loyola, Pablo
    Liu, Chen
    Hirate, Yu
    [J]. PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 147 - 151
  • [23] Attention-based encoder-decoder model for answer selection in question answering
    Nie, Yuan-ping
    Han, Yi
    Huang, Jiu-ming
    Jiao, Bo
    Li, Ai-ping
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (04) : 535 - 544
  • [24] An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents
    Ly, Nam Tuan
    Nguyen, Cuong Tuan
    Nakagawa, Masaki
    [J]. PATTERN RECOGNITION LETTERS, 2020, 136 : 134 - 141
  • [25] Multivariate time series forecasting via attention-based encoder-decoder framework
    Du, Shengdong
    Li, Tianrui
    Yang, Yan
    Horng, Shi-Jinn
    [J]. NEUROCOMPUTING, 2020, 388 (388) : 269 - 279
  • [26] Mining Implicit Intention Using Attention-Based RNN Encoder-Decoder Model
    Li, ChenXing
    Du, YaJun
    Wang, SiDa
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 413 - 424
  • [27] Accurate water quality prediction with attention-based bidirectional LSTM and encoder-decoder
    Bi, Jing
    Chen, Zexian
    Yuan, Haitao
    Zhang, Jia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [28] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
    Chen, Zhengyang
    Han, Bing
    Wang, Shuai
    Qian, Yanmin
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649
  • [29] IMPROVED MULTI-STAGE TRAINING OF ONLINE ATTENTION-BASED ENCODER-DECODER MODELS
    Garg, Abhinav
    Gowda, Dhananjaya
    Kumar, Ankur
    Kim, Kwangyoun
    Kumar, Mehul
    Kim, Chanwoo
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 70 - 77
  • [30] Hybrid Attention-Based Encoder-Decoder Fully Convolutional Network for PolSAR Image Classification
    Fang, Zheng
    Zhang, Gong
    Dai, Qijun
    Xue, Biao
    Wang, Peng
    [J]. REMOTE SENSING, 2023, 15 (02)