Multi-View Representation Learning via View-Aware Modulation

被引:0
|
作者
Wang, Ren [1 ]
Sun, Haoliang [1 ]
Nie, Xiushan [2 ]
Lin, Yuxiu [3 ]
Xi, Xiaoming [2 ]
Yin, Yilong [1 ]
机构
[1] Shandong Univ, Jinan, Shandong, Peoples R China
[2] Shandong Jianzhu Univ, Jinan, Shandong, Peoples R China
[3] Shandong Univ Finance & Econ, Jinan, Shandong, Peoples R China
关键词
multi-view; representation learning; feature fusion; modulation;
D O I
10.1145/3581783.3612494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view (representation) learning derives an entity's representation from its multiple observable views to facilitate various downstream tasks. The most challenging topic is how to model unobserved entities and their relationships to specific views. To this end, this work proposes a novel multi-view learning method using a View-Aware parameter Modulation mechanism, termed VAM. The key idea is to use trainable parameters as proxies for unobserved entities and views, such that modeling entity-view relationships is converted into modeling the relationship between proxy parameters. Specifically, we first build a set of trainable parameters to learn a mapping from multi-view data to the unified representation as the entity proxy. Then we learn a prototype for each view and design a Modulation Parameter Generator (MPG) that learns a set of view-aware scale and shift parameters from prototypes to modulate the entity proxy and obtain view proxies. By constraining the representativeness, uniqueness, and simplicity of the proxies and proposing an entity-view contrastive loss, parameters are alternatively updated. We end up with a set of discriminative prototypes, view proxies, and an entity proxy that are flexible enough to yield robust representations for out-of-sample entities. Extensive experiments on five datasets show that the results of our VAM outperform existing methods in both classification and clustering tasks.
引用
收藏
页码:3876 / 3886
页数:11
相关论文
共 50 条
  • [1] UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification
    Li, Yapeng
    Luo, Yong
    Du, Bo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5129 - 5143
  • [2] Uncertainty-Aware Multi-View Representation Learning
    Geng, Yu
    Han, Zongbo
    Zhang, Changqing
    Hu, Qinghua
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7545 - 7553
  • [3] Multi-view representation learning for multi-view action recognition
    Hao, Tong
    Wu, Dan
    Wang, Qian
    Sun, Jin-Sheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 453 - 460
  • [4] Decoupled representation for multi-view learning
    Sun, Shiding
    Wang, Bo
    Tian, Yingjie
    [J]. PATTERN RECOGNITION, 2024, 151
  • [5] Multi-view representation learning and understanding
    [J]. Multimedia Tools and Applications, 2021, 80 : 22865 - 22865
  • [6] Comprehensive Multi-view Representation Learning
    Zheng, Qinghai
    Zhu, Jihua
    Li, Zhongyu
    Tian, Zhiqiang
    Li, Chen
    [J]. INFORMATION FUSION, 2023, 89 : 198 - 209
  • [7] On Deep Multi-View Representation Learning
    Wang, Weiran
    Arora, Raman
    Livescu, Karen
    Bilmes, Jeff
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1083 - 1092
  • [8] A Survey of Multi-View Representation Learning
    Li, Yingming
    Yang, Ming
    Zhang, Zhongfei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (10) : 1863 - 1883
  • [9] Multi-view representation learning and understanding
    Zhou, Tao
    Zhang, Yu
    Thung, Kim-Han
    Adeli, Ehsan
    Rekik, Islem
    Zhao, Qibin
    Zhang, Changqing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22865 - 22865
  • [10] Robust Multi-View Representation Learning
    Venkatesan, Sibi
    Miller, James K.
    Dubrawski, Artur
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13939 - 13940