Entity Dependency Learning Network With Relation Prediction for Video Visual Relation Detection

被引:0
|
作者
Zhang, Guoguang [1 ]
Tang, Yepeng [1 ]
Zhang, Chunjie [1 ]
Zheng, Xiaolong [2 ,3 ,4 ]
Zhao, Yao [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Trajectory; Visualization; Task analysis; Object detection; Encoding; Decoding; Visual relation detection; entity dependency learning; video understanding; GENERATION;
D O I
10.1109/TCSVT.2024.3437437
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video Visual Relation Detection (VidVRD) is a pivotal task in the field of video analysis. It involves detecting object trajectories in videos, predicting potential dynamic relation between these trajectories, and ultimately representing these relationships in the form of <subject, predicate, object> triplets. Correct prediction of relation is vital for VidVRD. Existing methods mostly adopt the simple fusion of visual and language features of entity trajectories as the feature representation for relation predicates. However, these methods do not take into account the dependency information between the relation predication and the subject and object within the triplet. To address this issue, we propose the entity dependency learning network(EDLN), which can capture the dependency information between relation predicates and subjects, objects, and subject-object pairs. It adaptively integrates these dependency information into the feature representation of relation predicates. Additionally, to effectively model the features of the relation existing between various object entities pairs, in the context encoding phase for relation predicate features, we introduce a fully convolutional encoding approach as a substitute for the self-attention mechanism in the Transformer. Extensive experiments on two public datasets demonstrate the effectiveness of the proposed EDLN.
引用
收藏
页码:12425 / 12436
页数:12
相关论文
共 50 条
  • [31] Software Knowledge Entity Relation Extraction with Entity-Aware and Syntactic Dependency Structure Information
    Tang, Mingjing
    Li, Tong
    Wang, Wei
    Zhu, Rui
    Ma, Zifei
    Tang, Yahui
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [32] Joint Entity and Relation Extraction With Set Prediction Networks
    Sui, Dianbo
    Zeng, Xiangrong
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12784 - 12795
  • [33] Relation-wise transformer network and reinforcement learning for visual navigation
    He Y.
    Zhou K.
    Neural Computing and Applications, 2024, 36 (21) : 13205 - 13221
  • [34] Multiple Hypothesis Video Relation Detection
    Di, Donglin
    Shang, Xindi
    Zhang, Weinan
    Yang, Xun
    Chua, Tat-Seng
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 287 - 291
  • [35] A Novel Entity and Relation Joint Interaction Learning Approach for Entity Alignment
    Wu, Di
    Li, Tong
    Zhao, Yiran
    Liu, Junrui
    Tang, Zifang
    Yang, Zhen
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (05) : 821 - 843
  • [36] A Partition Filter Network for Joint Entity and Relation Extraction
    Yan, Zhiheng
    Zhang, Chong
    Fu, Jinlan
    Zhang, Qi
    Wei, Zhongyu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 185 - 197
  • [37] A Dependency-Based Neural Network for Relation Classification
    Liu, Yang
    Wei, Furu
    Li, Sujian
    Ji, Heng
    Zhou, Ming
    Wang, Houfeng
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 285 - 290
  • [38] Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching
    Zhang, Kun
    Wu, Le
    Lv, Guangyi
    Wang, Meng
    Chen, Enhong
    Ruan, Shulan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14411 - 14419
  • [39] Integration of Relation Filtering and Multi-Task Learning in GlobalPointer for Entity and Relation Extraction
    Liu, Bin
    Tao, Jialin
    Chen, Wanyuan
    Zhang, Yijie
    Chen, Min
    He, Lei
    Tang, Dan
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [40] MRN: Moment Relation Network for Natural Language Video Localization with Transfer Learning
    Jiang, Siyu
    Wu, Guobin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (07)