Dual Scene Graph Convolutional Network for Motivation Prediction

被引:3
|
作者
Wanyan, Yuyang [1 ,2 ]
Yang, Xiaoshan [1 ,2 ,3 ]
Ma, Xuan [1 ,2 ]
Xu, Changsheng [1 ,2 ,3 ]
机构
[1] Univ Chinese Acad Sci UCAS, Inst Automat, Chinese Acad Sci CASIA, Natl Lab Pattern Recognit, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Motivation prediction; scene graph; graph convolutional network; multi-modalities; INTENTS;
D O I
10.1145/3572914
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Humans can easily infer the motivations behind human actions from only visual data by comprehensively analyzing the complex context information and utilizing abundant life experiences. Inspired by humans' reasoning ability, existing motivation prediction methods have improved image-based deep classification models using the commonsense knowledge learned by pre-trained language models. However, the knowledge learned from public text corpora is probably incompatible with the task-specific data of the motivation prediction, which may impact the model performance. To address this problem, this paper proposes a dual scene graph convolutional network (dual-SGCN) to comprehensively explore the complex visual information and semantic context prior from the image data for motivation prediction. The proposed dual-SGCN has a visual branch and a semantic branch. For the visual branch, we build a visual graph based on scene graph where object nodes and relation edges are represented by visual features. For the semantic branch, we build a semantic graph where nodes and edges are directly represented by the word embeddings of the object and relation labels. In each branch, node-oriented and edge-oriented message passing is adopted to propagate interaction information between different nodes and edges. Besides, a multi-modal interactive attention mechanism is adopted to cooperatively attend and fuse the visual and semantic information. The proposed dual-SGCN is learned in an end-to-end form by a multi-task co-training scheme. In the inference stage, Total Direct Effect is adopted to alleviate the bias caused by the semantic context prior. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Dual flow fusion graph convolutional network for traffic flow prediction
    Zhao, Yuan
    Li, Mingxin
    Wen, Haoyang
    Zhao, Hui
    Wang, Yongjian
    Wen, Shixi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3425 - 3437
  • [2] Smart Lung Tumor Prediction Using Dual Graph Convolutional Neural Network
    Alameen, Abdalla
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 369 - 383
  • [3] Irregular Scene Text Detection Based on a Graph Convolutional Network
    Zhang, Shiyu
    Zhou, Caiying
    Li, Yonggang
    Zhang, Xianchao
    Ye, Lihua
    Wei, Yuanwang
    [J]. SENSORS, 2023, 23 (03)
  • [4] A dual-path dynamic directed graph convolutional network for air quality prediction
    Xiao, Xiao
    Jin, Zhiling
    Wang, Shuo
    Xu, Jing
    Peng, Ziyan
    Wang, Rui
    Shao, Wei
    Hui, Yilong
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 827
  • [5] Bayesian graph convolutional network for traffic prediction
    Fu, Jun
    Zhou, Wei
    Chen, Zhibo
    [J]. NEUROCOMPUTING, 2024, 582
  • [6] Scene-Perception Graph Convolutional Networks for Human Action Prediction
    Tao, Ji'an
    Xu, Lu
    Ma, Xinyan
    Yan, Jianyu
    Mei, Kuizhi
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] Drug Recommendation Model for Graph Embedding Dual Graph Convolutional Network
    Jiang, Yuzhe
    Cheng, Quan
    [J]. Computer Engineering and Applications, 2024, 60 (07) : 315 - 324
  • [8] Dual Attention Graph Convolutional Network for Relation Extraction
    Zhang, Donghao
    Liu, Zhenyu
    Jia, Weiqiang
    Wu, Fei
    Liu, Hui
    Tan, Jianrong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 530 - 543
  • [9] DEDGCN: Dual Evolving Dynamic Graph Convolutional Network
    Zhong, Fengzhe
    Liu, Yan
    Liu, Lian
    Zhang, Guangsheng
    Duan, Shunran
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [10] Dual Cost-sensitive Graph Convolutional Network
    Duan, Yijun
    Liu, Xin
    Jatowt, Adam
    Yu, Hai-tao
    Lynden, Steven
    Kim, Kyoung-Sook
    Matono, Akiyoshi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,