Flexible Low-Rank Statistical Modeling with Missing Data and Side Information

被引:16
|
作者
Fithian, William [1 ]
Mazumder, Rahul [2 ,3 ]
机构
[1] Univ Calif Berkeley, Dept Stat, 301 Evans Hall, Berkeley, CA 94720 USA
[2] MIT, Sloan Sch Management, Operat Res Ctr, Bldg E62-583,100 Main St, Cambridge, MA 02142 USA
[3] MIT, Ctr Stat, Bldg E62-583,100 Main St, Cambridge, MA 02142 USA
关键词
Matrix completion; nuclear norm regularization; matrix factorization; convex optimization; missing data; MATRIX COMPLETION; MINIMIZATION; ALGORITHMS; SHRINKAGE; VALUES; NORM;
D O I
10.1214/18-STS642
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We explore a general statistical framework for low-rank modeling of matrix-valued data, based on convex optimization with a generalized nuclear norm penalty. We study several related problems: the usual low-rank matrix completion problem with flexible loss functions arising from generalized linear models; reduced-rank regression and multi-task learning; and generalizations of both problems where side information about rows and columns is available, in the form of features or smoothing kernels. We show that our approach encompasses maximum a posteriori estimation arising from Bayesian hierarchical modeling with latent factors, and discuss ramifications of the missing-data mechanism in the context of matrix completion. While the above problems can be naturally posed as rank-constrained optimization problems, which are nonconvex and computationally difficult, we show how to relax them via generalized nuclear norm regularization to obtain convex optimization problems. We discuss algorithms drawing inspiration from modern convex optimization methods to address these large scale convex optimization computational tasks. Finally, we illustrate our flexible approach in problems arising in functional data reconstruction and ecological species distribution modeling.
引用
收藏
页码:238 / 260
页数:23
相关论文
共 50 条
  • [21] STATISTICAL INFERENCE BASED ON ROBUST LOW-RANK DATA MATRIX APPROXIMATION
    Feng, Xingdong
    He, Xuming
    ANNALS OF STATISTICS, 2014, 42 (01): : 190 - 210
  • [22] Convolutional Low-Rank Tensor Representation for Structural Missing Traffic Data Imputation
    Li, Ben-Zheng
    Zhao, Xi-Le
    Chen, Xinyu
    Ding, Meng
    Liu, Ryan Wen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 18847 - 18860
  • [23] Rank Determination for Low-Rank Data Completion
    Ashraphijuo, Morteza
    Wang, Xiaodong
    Aggarwal, Vaneet
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [24] Rank determination for low-rank data completion
    1600, Microtome Publishing (18):
  • [25] Low-Rank Tensor Completion Method for Implicitly Low-Rank Visual Data
    Ji, Teng-Yu
    Zhao, Xi-Le
    Sun, Dong-Lin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1162 - 1166
  • [26] Modeling appearances with low-rank SVM
    Wolf, Lior
    Jhuang, Hueihan
    Hazan, Tamir
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 990 - +
  • [27] MATRIX COMPLETION UNDER LOW-RANK MISSING MECHANISM
    Mao, Xiaojun
    Wong, Raymond K. W.
    Chen, Song Xi
    STATISTICA SINICA, 2021, 31 (04) : 2005 - 2030
  • [28] Statistical mechanics of low-rank tensor decomposition
    Kadmon, Jonathan
    Ganguli, Surya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [29] Statistical mechanics of low-rank tensor decomposition
    Kadmon, Jonathan
    Ganguli, Surya
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2019, 2019 (12):
  • [30] Missing Data Reconstruction for Remote Sensing Images With Weighted Low-Rank Tensor Model
    Cheng, Qing
    Yuan, Qiangqiang
    Ng, Michael Kwok-Po
    Shen, Huanfeng
    Zhang, Liangpei
    IEEE ACCESS, 2019, 7 : 142339 - 142352