Attention-based Context Aware Reasoning for Situation Recognition

被引:9
|
作者
Cooray, Thilini [1 ]
Cheung, Ngai-Man [1 ]
Lu, Wei [1 ]
机构
[1] Singapore Univ Technol & Design SUTD, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR42600.2020.00479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Situation Recognition (SR) is a fine-grained action recognition task where the model is expected to not only predict the salient action of the image, but also predict values of all associated semantic roles of the action. Predicting semantic roles is very challenging: a vast variety of possibilities can be the match for a semantic role. Existing work has focused on dependency modelling architectures to solve this issue. Inspired by the success achieved by query-based visual reasoning (e.g., Visual Question Answering), we propose to address semantic role prediction as a query-based visual reasoning problem. However, existing query-based reasoning methods have not considered handling of inter-dependent queries which is a unique requirement of semantic role prediction in SR. Therefore, to the best of our knowledge, we propose the first set of methods to address inter-dependent queries in query-based visual reasoning. Extensive experiments demonstrate the effectiveness of our proposed method which achieves outstanding performance on Situation Recognition task. Furthermore, leveraging query inter-dependency, our methods improve upon a state-of-the-art method that answers queries separately. Our code: https://github.com/thilinicooray/context-aware-reasoning-for-sr
引用
收藏
页码:4735 / 4744
页数:10
相关论文
共 50 条
  • [41] An attention-based network for serial number recognition on banknotes
    Lin, Zhijie
    He, Zhaoshui
    Tan, Beihai
    Shen, Yijiang
    Wang, Peitao
    Liu, Taiheng
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 106
  • [42] TransSketchNet: Attention-Based Sketch Recognition Using Transformers
    Jain, Gaurav
    Chopra, Shivang
    Chopra, Suransh
    Parihar, Anil Singh
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2907 - 2908
  • [43] Attention-based BiLSTM Network for Chinese Simile Recognition
    Guo, Jingjin
    Song, Wei
    Liu, Xianjun
    Liu, Lizhen
    Zhao, Xinlei
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 144 - 147
  • [44] ATTENTION-BASED SPEECH RECOGNITION USING GAZE INFORMATION
    Segawa, Osamu
    Hayashi, Tomoki
    Takeda, Kazuya
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 465 - 470
  • [45] Effective Attention-Based Mechanism for Masked Face Recognition
    Pann, Vandet
    Lee, Hyo Jong
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [46] Towards Understanding Attention-Based Speech Recognition Models
    Qin, Chu-Xiong
    Qu, Dan
    [J]. IEEE ACCESS, 2020, 8 : 24358 - 24369
  • [47] Attention-Based Gated Recurrent Unit for Gesture Recognition
    Khodabandelou, Ghazaleh
    Jung, Pyeong-Gook
    Amirat, Yacine
    Mohammed, Samer
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (02) : 495 - 507
  • [48] Lattice generation in attention-based speech recognition models
    Zapotoczny, Michal
    Pietrzak, Piotr
    Lancucki, Adrian
    Chorowski, Jan
    [J]. INTERSPEECH 2019, 2019, : 2225 - 2229
  • [49] Region Dual Attention-Based Video Emotion Recognition
    Liu, Xiaodong
    Xu, Huating
    Wang, Miao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Attention-Based Dense LSTM for Speech Emotion Recognition
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Zhao, Li
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07): : 1426 - 1429