Lack-of-fit checking for parametric models is essential in reducing misspecification. However, for massive data sets which are increasingly prevalent, classical tests become prohibitively costly in computation and their feasibility is questionable even with modern parallel computing platforms. Building on the divide and conquer strategy, we propose a new nonparametric testing method, that is fast to compute and easy to implement with only one tuning parameter determined by a given time budget. Under mild conditions, we show that the proposed test statistic is asymptotically equivalent to that based on the whole data. Benefiting from using the sample-splitting idea for choosing the smoothing parameter, the proposed test is able to retain the type-I error rate pretty well with asymptotic distributions and achieves adaptive rate-optimal detection properties. Its advantage relative to existing methods is also demonstrated in numerical simulations and a data illustration. (C) 2018 Elsevier B.V. All rights reserved.
机构:
Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USAPurdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
Zhe, Shandian
Xu, Zenglin
论文数: 0引用数: 0
h-index: 0
机构:
Univ Elect Sci & Technol China, Big Data Res Ctr, Sch Comp Sci & Engn, Chengdu, Peoples R ChinaPurdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
Xu, Zenglin
Chu, Xinqi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Illinois, Dept Elect & Comp Engn, Urbana, IL USAPurdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
Chu, Xinqi
Qi, Yuan
论文数: 0引用数: 0
h-index: 0
机构:
Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USAPurdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
Qi, Yuan
Park, Youngja
论文数: 0引用数: 0
h-index: 0
机构:
IBM Thomas J Watson Res Ctr, Ossining, NY USAPurdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA