DISTRIBUTED ESTIMATION AND INFERENCE FOR SEMIPARAMETRIC BINARY RESPONSE MODELS

被引:0
|
作者
Chen, Xi [1 ]
Jing, Wenbo [1 ]
Liu, Weidong [2 ]
Zhang, Yichen [3 ]
机构
[1] NYU, Stern Sch Business, New York, NY 10012 USA
[2] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai, Peoples R China
[3] Purdue Univ, Mitchell E Daniels Jr Sch Business, W Lafayette, IN USA
来源
ANNALS OF STATISTICS | 2024年 / 52卷 / 03期
关键词
Distributed inference; semiparametric inference; binary response model; maximum score estimator; divide and conquer; DIVIDE-AND-CONQUER; QUANTILE REGRESSION; SUPEREFFICIENCY; EFFICIENCY; FRAMEWORK;
D O I
10.1214/24-AOS2376
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The development of modern technology has enabled data collection of unprecedented size, which poses new challenges to many statistical estimation and inference problems. This paper studies the maximum score estimator of a semiparametric binary choice model under a distributed computing environment without prespecifying the noise distribution. An intuitive divideand-conquer estimator is computationally expensive and restricted by a nonregular constraint on the number of machines, due to the highly nonsmooth nature of the objective function. We propose (1) a one-shot divide-and-conquer estimator after smoothing the objective to relax the constraint, and (2) a multiround estimator to completely remove the constraint via iterative smoothing. We specify an adaptive choice of kernel smoother with a sequentially shrinking bandwidth to achieve the superlinear improvement of the optimization error over multiple iterations. The improved statistical accuracy per iteration is derived, and a quadratic convergence up to the optimal statistical error rate is established. We further provide two generalizations to handle the heterogeneity of data sets and high-dimensional problems where the parameter of interest is sparse.
引用
收藏
页码:922 / 947
页数:26
相关论文
共 50 条