On Inferences from Completed Data

被引:0
|
作者
Haddock, Jamie
Molitor, Denali
Needell, Deanna
Sambandam, Sneha
Song, Joy
Sun, Simon
机构
基金
美国国家科学基金会;
关键词
MATRIX;
D O I
10.1109/sampta45681.2019.9030885
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Matrix completion has become an extremely important technique as data scientists are routinely faced with large, incomplete datasets on which they wish to perform statistical inferences. We investigate how error introduced via matrix completion affects statistical inference. Furthermore, we prove recovery error bounds which depend upon the matrix recovery error for several common statistical inferences. We consider matrix recovery via nuclear norm minimization and a variant, l(1)-regularized nuclear norm minimization for data with a structured sampling pattern. Finally, we run a series of numerical experiments on synthetic data and real patient surveys from MyLymeData, which illustrate the relationship between inference recovery error and matrix recovery error. These results indicate that exact matrix recovery is often not necessary to achieve small inference recovery error.
引用
收藏
页数:5
相关论文
共 50 条