Double-Matched Matrix Decomposition for Multi-View Data

被引:1
|
作者
Yuan, Dongbang [1 ]
Gaynanova, Irina [1 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Data integration; Dimension reduction; Matrix factorization; Multi-block data; Principal component analysis; COMPONENT ANALYSIS; ALGORITHMS; NUMBER; JOINT;
D O I
10.1080/10618600.2022.2067860
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of extracting joint and individual signals from multi-view data, that is, data collected from different sources on matched samples. While existing methods for multi-view data decomposition explore single matching of data by samples, we focus on double-matched multi-view data (matched by both samples and source features). Our motivating example is the miRNA data collected from both primary tumor and normal tissues of the same subjects; the measurements from two tissues are thus matched both by subjects and by miRNAs. Our proposed double-matched matrix decomposition allows us to simultaneously extract joint and individual signals across subjects, as well as joint and individual signals across miRNAs. Our estimation approach takes advantage of double-matching by formulating a new type of optimization problem with explicit row space and column space constraints, for which we develop an efficient iterative algorithm. Numerical studies indicate that taking advantage of double-matching leads to superior signal estimation performance compared to existing multi-view data decomposition based on single-matching. We apply our method to miRNA data as well as data from the English Premier League soccer matches and find joint and individual multi-view signals that align with domain-specific knowledge. Supplementary materials for this article are available online.
引用
收藏
页码:1114 / 1126
页数:13
相关论文
共 50 条
  • [31] Federated Multi-view Matrix Factorization for Personalized Recommendations
    Flanagan, Adrian
    Oyomno, Were
    Grigorievskiy, Alexander
    Tan, Kuan E.
    Khan, Suleiman A.
    Ammad-Ud-Din, Muhammad
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 324 - 347
  • [32] Parallel Multi-View Graph Matrix Completion for Large Input Matrix
    Koohi, Arezou
    Homayoun, Houman
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 337 - 341
  • [33] Robust multi-view data clustering with multi-view capped-norm K-means
    Huang, Shudong
    Ren, Yazhou
    Xu, Zenglin
    NEUROCOMPUTING, 2018, 311 : 197 - 208
  • [34] Modeling and maintaining multi-view data warehouses
    Stanoi, L
    Agrawal, D
    El Abbadi, A
    CONCEPTUAL MODELING - ER'99, 1999, 1728 : 161 - 175
  • [35] Multi-view Similarity Learning of Manifold Data
    Wang, Rui-rui
    Chen, Si-bao
    Luo, Bin
    Zhang, Jian
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 631 - 643
  • [36] Breaking down multi-view clustering: A comprehensive review of multi-view approaches for complex data structures
    Haris, Muhammad
    Yusoff, Yusliza
    Zain, Azlan Mohd
    Khattak, Abid Saeed
    Hussain, Syed Fawad
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [37] Imputation of missing values in multi-view data
    van Loon, Wouter
    de Vos, Frank
    de Vos, Frank
    Koini, Marisa
    Schmidt, Reinhold
    de Rooij, Mark
    INFORMATION FUSION, 2024, 111
  • [38] Multi-View Concept Learning for Data Representation
    Guan, Ziyu
    Zhang, Lijun
    Peng, Jinye
    Fan, Jianping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) : 3016 - 3028
  • [39] Robust integrative biclustering for multi-view data
    Zhang, Weijie
    Wendt, Christine
    Bowler, Russel
    Hersh, Craig P.
    Safo, Sandra E.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2022, 31 (11) : 2201 - 2216
  • [40] A survey on representation learning for multi-view data
    Qin, Yalan
    Zhang, Xinpeng
    Yu, Shui
    Feng, Guorui
    NEURAL NETWORKS, 2025, 181