Private and Non-private Uniformity Testing for Ranking Data

被引:0
|
作者
Busa-Fekete, Robert [1 ]
Fotakis, Dimitris [2 ]
Zampetakis, Manolis [3 ]
机构
[1] Google Res, New York, NY 10011 USA
[2] Natl Tech Univ Athens, Athens, Greece
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of uniformity testing for statistical data that consists of rankings over m items, where the alternative class is restricted to Mallows models. Testing ranking data is challenging because of the size of the large domain that is factorial in m, therefore the tester needs to take advantage of some structure of the alternative class. We show that uniform distribution can be distinguished from Mallows model with O(m-(1/2)) samples based on simple pairwise statistics, which allows us to test uniformity using only two samples, if m is large enough. We also consider uniformity testing with central and local differential privacy (DP) constraints. We present a central DP algorithm that requires O(max{1/epsilon 0, 1/root m}), where epsilon(0) is the privacy budget parameter. Interestingly, our uniformity testing algorithm is straightforward to apply to the local DP scenario, since it works with binary statistics that is extracted from the ranking data. We carry out large-scale experiments, including m = 10, 000, to show that our uniformity testing algorithms scale gracefully with m.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Utility-preserving differentially private data releases via individual ranking microaggregation
    Sanchez, David
    Domingo-Ferrer, Josep
    Martinez, Sergio
    Soria-Comas, Jordi
    INFORMATION FUSION, 2016, 30 : 1 - 14
  • [32] Private Data Synthesis from Decentralized Non-IID Data
    Saleem, Muhammad Usama
    Fan, Liyue
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [33] Locally Private Hypothesis Testing
    Sheffet, Or
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [34] Testing disjointness of private datasets
    Kiayias, A
    Mitrofanova, A
    FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, 2005, 3570 : 109 - 124
  • [35] CONSUMER TESTING IN THE PRIVATE SECTOR
    SATTER, HL
    ASSOCIATION OF FOOD & DRUG OFFICIALS QUARTERLY BULLETIN, 1982, 46 (02): : 127 - 131
  • [36] Differentially Private ANOVA Testing
    Campbell, Zachary
    Bray, Andrew
    Ritz, Anna
    Groce, Adam
    2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, : 281 - 285
  • [37] Pricing private data
    Vasilis Gkatzelis
    Christina Aperjis
    Bernardo A. Huberman
    Electronic Markets, 2015, 25 : 109 - 123
  • [38] Pricing private data
    Gkatzelis, Vasilis
    Aperjis, Christina
    Huberman, Bernardo A.
    ELECTRONIC MARKETS, 2015, 25 (02) : 109 - 123
  • [39] Risk Ranking of Malaysian Public Private Partnership Projects
    Sarvari, Hadi
    Valipuor, Alireza
    Yahaya, Nordin
    Noor, Norhazilan Md
    STRUCTURAL, ENVIRONMENTAL, COASTAL AND OFFSHORE ENGINEERING, 2014, 567 : 613 - 618
  • [40] Bounds on the sample complexity for private learning and private data release
    Beimel, Amos
    Brenner, Hai
    Kasiviswanathan, Shiva Prasad
    Nissim, Kobbi
    MACHINE LEARNING, 2014, 94 (03) : 401 - 437