Entity Dependency Learning Network With Relation Prediction for Video Visual Relation Detection

被引:0
|
作者
Zhang, Guoguang [1 ]
Tang, Yepeng [1 ]
Zhang, Chunjie [1 ]
Zheng, Xiaolong [2 ,3 ,4 ]
Zhao, Yao [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Trajectory; Visualization; Task analysis; Object detection; Encoding; Decoding; Visual relation detection; entity dependency learning; video understanding; GENERATION;
D O I
10.1109/TCSVT.2024.3437437
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video Visual Relation Detection (VidVRD) is a pivotal task in the field of video analysis. It involves detecting object trajectories in videos, predicting potential dynamic relation between these trajectories, and ultimately representing these relationships in the form of <subject, predicate, object> triplets. Correct prediction of relation is vital for VidVRD. Existing methods mostly adopt the simple fusion of visual and language features of entity trajectories as the feature representation for relation predicates. However, these methods do not take into account the dependency information between the relation predication and the subject and object within the triplet. To address this issue, we propose the entity dependency learning network(EDLN), which can capture the dependency information between relation predicates and subjects, objects, and subject-object pairs. It adaptively integrates these dependency information into the feature representation of relation predicates. Additionally, to effectively model the features of the relation existing between various object entities pairs, in the context encoding phase for relation predicate features, we introduce a fully convolutional encoding approach as a substitute for the self-attention mechanism in the Transformer. Extensive experiments on two public datasets demonstrate the effectiveness of the proposed EDLN.
引用
收藏
页码:12425 / 12436
页数:12
相关论文
共 50 条
  • [41] DGRNet: A Dual-Level Graph Relation Network for Video Object Detection
    Qi, Qiang
    Hou, Tianxiang
    Lu, Yang
    Yan, Yan
    Wang, Hanzi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4128 - 4141
  • [42] 3-D Relation Network for visual relation recognition in videos
    Cao, Qianwen
    Huang, Heyan
    Shang, Xindi
    Wang, Boran
    Chua, Tat-Seng
    NEUROCOMPUTING, 2021, 432 : 91 - 100
  • [43] Joint Entity and Relation Extraction Based on Reinforcement Learning
    Zhou, Xin
    Liu, Luping
    Luo, Xiaodong
    Chen, Haiqiang
    Qing, Linbo
    He, Xiaohai
    IEEE ACCESS, 2019, 7 : 125688 - 125699
  • [44] Bootstrapping Joint Entity and Relation Extraction with Reinforcement Learning
    Xia, Min
    Cheng, Xiang
    Su, Sen
    Kuang, Ming
    Li, Gang
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 418 - 432
  • [45] Automatically learning linguistic structures for entity relation extraction
    Yang, Weizhe
    Chen, Yanping
    Xu, Jinling
    Qin, Yongbin
    Chen, Ping
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [46] Knowledge Graph Embedding by Learning to Connect Entity with Relation
    Huang, Zichao
    Li, Bo
    Yin, Jian
    WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 400 - 414
  • [47] Joint Learning of Named Entity Recognition and Relation Extraction
    Xu, Qiuyan
    Li, Fang
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1978 - 1982
  • [48] Learning Entity and Relation Embeddings for Knowledge Graph Completion
    Lin, Yankai
    Liu, Zhiyuan
    Sun, Maosong
    Liu, Yang
    Zhu, Xuan
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2181 - 2187
  • [49] Entity and Entity Type Enhanced Capsule Network for Distant Supervision Relation Extraction
    Heng, Hongjun
    Li, Renjie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [50] Dependency Relation Based Detection of Lexicalized User Goals
    Duan, Ruixue
    Wang, Xiaojie
    Hu, Rile
    Tian, Jilei
    UBIQUITOUS INTELLIGENCE AND COMPUTING, 2010, 6406 : 167 - +