Distributed inference for degenerate U-statistics

被引:2
|
作者
Atta-Asiamah, Ernest [1 ]
Yuan, Mingao [1 ]
机构
[1] North Dakota State Univ, Dept Stat, Fargo, ND 58108 USA
来源
STAT | 2019年 / 8卷 / 01期
关键词
big data analysis; degenerate U-statistics; divide-and-conquer; hypothesis test; EMPIRICAL LIKELIHOOD; BOOTSTRAP;
D O I
10.1002/sta4.234
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In many hypothesis testing problems, the test statistics are degenerate U-statistics. One of the challenges in practice is the computation of U-statistics for large dataset. In this paper, we aim to reduce the computation complexity of degenerate U-statistics by using the divide-and-conquer method. Specifically, we partition the full n data points into k(n) even disjoint groups, compute the U-statistics on each group, and combine them by averaging. In this way, the running time is reduced to O(nmknm-1), where m is the order of the U-statistics. Besides, we study the optimal test rate of the divide-and-conquer methods. For degenerate U-statistics, the optimal rate is n , the same as the optimal test rate for the nondegenerate case. The simulation and real data confirm that the proposed method has high power and faster running time.
引用
收藏
页数:8
相关论文
共 50 条