Efficient and Effective Multi-Modal Queries Through Heterogeneous Network Embedding

被引:0
|
作者
Chi Thang Duong [1 ]
Thanh Tam Nguyen [2 ]
Yin, Hongzhi [3 ]
Weidlich, Matthias [4 ]
Mai, Thai Son [5 ]
Aberer, Karl [1 ]
Quoc Viet Hung Nguyen [6 ]
机构
[1] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City 70000, Vietnam
[3] Univ Queensland, St Lucia, Qld 4072, Australia
[4] Humboldt Univ, D-10117 Berlin, Germany
[5] Queens Univ Belfast, Belfast BT7 1NN, Antrim, North Ireland
[6] Griffith Univ, Nathan, Qld 4111, Australia
关键词
Information retrieval; Data models; Semantics; Videos; Games; Task analysis; Heterogeneous networks; Query embedding; graph embedding; heterogeneous information network; INFORMATION; RETRIEVAL; FUSION;
D O I
10.1109/TKDE.2021.3052871
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The heterogeneity of today's Web sources requires information retrieval (IR) systems to handle multi-modal queries. Such queries define a user's information needs by different data modalities, such as keywords, hashtags, user profiles, and other media. Recent IR systems answer such a multi-modal query by considering it as a set of separate uni-modal queries. However, depending on the chosen operationalisation, such an approach is inefficient or ineffective. It either requires multiple passes over the data or leads to inaccuracies since the relations between data modalities are neglected in the relevance assessment. To mitigate these challenges, we present an IR system that has been designed to answer genuine multi-modal queries. It relies on a heterogeneous network embedding, so that features from diverse modalities can be incorporated when representing both, a query and the data over which it shall be evaluated. By embedding a query and the data in the same vector space, the relations across modalities are made explicit and exploited for more accurate query evaluation. At the same time, multi-modal queries are answered with a single pass over the data. An experimental evaluation using diverse real-world and synthetic datasets illustrates that our approach returns twice the amount of relevant information compared to baseline techniques, while scaling to large multi-modal databases.
引用
收藏
页码:5307 / 5320
页数:14
相关论文
共 50 条
  • [1] Fusing heterogeneous information for multi-modal attributed network embedding
    Yang Jieyi
    Zhu Feng
    Dong Yihong
    Qian Jiangbo
    [J]. Applied Intelligence, 2023, 53 : 22328 - 22347
  • [2] Fusing heterogeneous information for multi-modal attributed network embedding
    Yang, Jieyi
    Zhu, Feng
    Dong, Yihong
    Qian, Jiangbo
    [J]. APPLIED INTELLIGENCE, 2023, 53 (19) : 22328 - 22347
  • [3] Multi-source and Multi-modal Deep Network Embedding for Cross-network Node Classification
    Yang, Hongwei
    He, Hui
    Zhang, Weizhe
    Wang, Yan
    Jing, Lin
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (06)
  • [4] Architecture of the multi-modal organizational research and production heterogeneous network (MORPHnet)
    Aiken, RJ
    Carlson, RA
    Foster, IT
    Kuhfuss, TC
    Stevens, RL
    Winkler, L
    [J]. INTELLIGENT NETWORKS AND INTELLIGENCE IN NETWORKS, 1997, : 229 - 258
  • [5] Integrated Heterogeneous Graph Attention Network for Incomplete Multi-modal Clustering
    Wang, Yu
    Yao, Xinjie
    Zhu, Pengfei
    Li, Weihao
    Cao, Meng
    Hu, Qinghua
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3847 - 3866
  • [6] Heterogeneous-Grained Multi-Modal Graph Network for Outfit Recommendation
    Xu, Rucong
    Wang, Jianfeng
    Li, Yun
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1788 - 1799
  • [7] Multi-modal network Protocols
    Balan, RK
    Akella, A
    Seshan, S
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2002, 32 (01) : 60 - 60
  • [8] Multi-modal Interactive Video Retrieval with Temporal Queries
    Heller, Silvan
    Arnold, Rahel
    Gasser, Ralph
    Gsteiger, Viktor
    Parian-Scherb, Mahnaz
    Rossetto, Luca
    Sauter, Loris
    Spiess, Florian
    Schuldt, Heiko
    [J]. MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 493 - 498
  • [9] LCEMH: Label Correlation Enhanced Multi-modal Hashing for efficient multi-modal retrieval
    Zheng, Chaoqun
    Zhu, Lei
    Zhang, Zheng
    Duan, Wenjun
    Lu, Wenpeng
    [J]. INFORMATION SCIENCES, 2024, 659
  • [10] Corporate Relative Valuation Using Heterogeneous Multi-Modal Graph Neural Network
    Yang, Yang
    Yang, Jia-Qi
    Bao, Ran
    Zhan, De-Chuan
    Zhu, Hengshu
    Gao, Xiao-Ru
    Xiong, Hui
    Yang, Jian
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 211 - 224