Flexible Low-Rank Statistical Modeling with Missing Data and Side Information

被引:16
|
作者
Fithian, William [1 ]
Mazumder, Rahul [2 ,3 ]
机构
[1] Univ Calif Berkeley, Dept Stat, 301 Evans Hall, Berkeley, CA 94720 USA
[2] MIT, Sloan Sch Management, Operat Res Ctr, Bldg E62-583,100 Main St, Cambridge, MA 02142 USA
[3] MIT, Ctr Stat, Bldg E62-583,100 Main St, Cambridge, MA 02142 USA
关键词
Matrix completion; nuclear norm regularization; matrix factorization; convex optimization; missing data; MATRIX COMPLETION; MINIMIZATION; ALGORITHMS; SHRINKAGE; VALUES; NORM;
D O I
10.1214/18-STS642
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We explore a general statistical framework for low-rank modeling of matrix-valued data, based on convex optimization with a generalized nuclear norm penalty. We study several related problems: the usual low-rank matrix completion problem with flexible loss functions arising from generalized linear models; reduced-rank regression and multi-task learning; and generalizations of both problems where side information about rows and columns is available, in the form of features or smoothing kernels. We show that our approach encompasses maximum a posteriori estimation arising from Bayesian hierarchical modeling with latent factors, and discuss ramifications of the missing-data mechanism in the context of matrix completion. While the above problems can be naturally posed as rank-constrained optimization problems, which are nonconvex and computationally difficult, we show how to relax them via generalized nuclear norm regularization to obtain convex optimization problems. We discuss algorithms drawing inspiration from modern convex optimization methods to address these large scale convex optimization computational tasks. Finally, we illustrate our flexible approach in problems arising in functional data reconstruction and ecological species distribution modeling.
引用
收藏
页码:238 / 260
页数:23
相关论文
共 50 条
  • [31] Low-Rank and Deep Plug-and-Play Priors for Missing Traffic Data Imputation
    Chen, Peng
    Li, Fang
    Wei, Deliang
    Lu, Changhong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (02) : 2690 - 2706
  • [32] Successively alternate least square for low-rank matrix factorization with bounded missing data
    Zhao, Keke
    Zhang, Zhenyue
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (10) : 1084 - 1096
  • [33] Multi-Channel Missing Data Recovery by Exploiting the Low-rank Hankel Structures
    Zhang, Shuai
    Hao, Yingshuai
    Wang, Meng
    Chow, Joe H.
    2017 IEEE 7TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2017,
  • [34] LOW-RANK DATA MODELING VIA THE MINIMUM DESCRIPTION LENGTH PRINCIPLE
    Ramirez, Ignacio
    Sapiro, Guillermo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2165 - 2168
  • [35] Low-rank statistical finite elements for scalable model-data synthesis
    Duffin, Connor
    Cripps, Edward
    Stemler, Thomas
    Girolami, Mark
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 463
  • [36] Online Robust Low-Rank Tensor Modeling for Streaming Data Analysis
    Li, Ping
    Feng, Jiashi
    Jin, Xiaojie
    Zhang, Luming
    Xu, Xianghua
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) : 1061 - 1075
  • [37] Low-Rank Representation for Incomplete Data
    Shi, Jiarong
    Yang, Wei
    Yong, Longquan
    Zheng, Xiuyun
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [38] Dim small targets detection based on statistical block low-rank background modeling
    Li Biao
    Xu Zhiyong
    Zhang Jianlin
    Fan Xiangsuo
    AOPC 2019: OPTICAL SENSING AND IMAGING TECHNOLOGY, 2019, 11338
  • [39] SIDE REACTIONS IN REDUCTIVE ALKYLATION OF LOW-RANK COAL
    FRANZ, JA
    SKIENS, WE
    FUEL, 1978, 57 (08) : 502 - 504
  • [40] Generalized Low-Rank Update: Model Parameter Bounds for Low-Rank Training Data Modifications
    Hanada, Hiroyuki
    Hashimoto, Noriaki
    Taji, Kouichi
    Takeuchi, Ichiro
    NEURAL COMPUTATION, 2023, 35 (12) : 1970 - 2005