Deep relational self-Attention networks for scene graph generation

被引:3
|
作者
Li, Ping [1 ]
Yu, Zhou [1 ]
Zhan, Yibing [1 ,2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou, Peoples R China
[2] JD Explore Acad, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Scene graph generation; Image understanding; Deep neural networks;
D O I
10.1016/j.patrec.2021.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph generation (SGG) aims to simultaneously detect objects in an image and predict relations for these detected objects. SGG is challenging that requires modeling the contextualized relationships among objects rather than only considering relationships between paired objects. Most existing approaches ad -dress this problem by using a CNN or RNN framework, which can not explicitly and effectively models the dense interactions among objects. In this paper, we exploit the attention mechanism and introduce a relational self-attention (RSA) module to simultaneously model the object and relation contexts. By stack -ing such RSA modules in depth, we obtain a deep relational self-attention network (RSAN), which is able to characterize complex interactions thus facilitating the understanding of object and relation semantics. Extensive experiments on the benchmark Visual Genome dataset demonstrate the effectiveness of RSAN. (c) 2021 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:200 / 206
页数:7
相关论文
共 50 条
  • [11] SELF-ATTENTION AND RETRIEVAL ENHANCED NEURAL NETWORKS FOR ESSAY GENERATION
    Wang, Wei
    Zheng, Hai-Tao
    Lin, Zibo
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8199 - 8203
  • [12] Convolutional Self-Attention Networks
    Yang, Baosong
    Wang, Longyue
    Wong, Derek F.
    Chao, Lidia S.
    Tu, Zhaopeng
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4040 - 4045
  • [13] Graph convolutional networks with the self-attention mechanism for adaptive influence maximization in social networks
    Tang, Jianxin
    Song, Shihui
    Du, Qian
    Yao, Yabing
    Qu, Jitao
    COMPLEX & INTELLIGENT SYSTEMS, 2024, : 8383 - 8401
  • [14] Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks
    Lian, Zheng
    Tao, Jianhua
    Liu, Bin
    Huang, Jian
    Yang, Zhanlei
    Li, Rongjun
    INTERSPEECH 2020, 2020, : 2347 - 2351
  • [15] Self-attention generative adversarial networks applied to conditional music generation
    Pedro Lucas Tomaz Neves
    José Fornari
    João Batista Florindo
    Multimedia Tools and Applications, 2022, 81 : 24419 - 24430
  • [16] Learning Scene Representations for Human-assistive Displays Using Self-attention Networks
    Ruiz-Serra, Jaime
    White, Jack
    Petrie, Stephen
    Kameneva, Tatiana
    Mccarthy, Chris
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [17] Global Self-Attention as a Replacement for Graph Convolution
    Hussain, Md Shamim
    Zaki, Mohammed J.
    Subramanian, Dharmashankar
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 655 - 665
  • [18] Original Music Generation using Recurrent Neural Networks with Self-Attention
    Jagannathan, Akash
    Chandrasekaran, Bharathi
    Dutta, Shubham
    Patil, Uma Rameshgouda
    Eirinaki, Magdalini
    2022 FOURTH IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST 2022), 2022, : 56 - 63
  • [19] Self-attention generative adversarial networks applied to conditional music generation
    Tomaz Neves, Pedro Lucas
    Fornari, Jose
    Florindo, Joao Batista
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24419 - 24430
  • [20] Progressive Scene Segmentation Based on Self-Attention Mechanism
    Pan, Yunyi
    Gan, Yuan
    Liu, Kun
    Zhang, Yan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3985 - 3992