An improved asymptotic test for the Jaccard similarity index for binary data

被引:8
|
作者
Koeneman, Scott H. H. [1 ]
Cavanaugh, Joseph E. E. [1 ]
机构
[1] Univ Iowa, Coll Publ Hlth, Dept Biostat, 145 N Riverside Dr, Iowa City, IA 52242 USA
关键词
Association measures; Bernoulli distribution; Binary data; Jaccard index; Multinomial distribution;
D O I
10.1016/j.spl.2022.109375
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For paired binary data, we propose a new asymptotic test of independence for the Jaccard index. As demonstrated, the test offers marked improvements in maintaining nominal Type I error rates, and exhibits higher power when these error rates are comparable. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] On the Jaccard similarity test
    Ivchenko G.I.
    Honov S.A.
    Journal of Mathematical Sciences, 1998, 88 (6) : 789 - 794
  • [2] New Similarity Correlation Functions for Sets and Binary Data based on Jaccard Similarity Measure
    Batyrshin, Ildar
    Rudas, Imre
    18TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS, SACI 2024, 2024, : 145 - 149
  • [3] DISTRIBUTIONAL PROPERTIES OF JACCARD INDEX OF SIMILARITY
    MCCORMICK, WP
    LYONS, NI
    HUTCHESON, K
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1992, 21 (01) : 51 - 68
  • [4] The probabilistic basis of Jaccard's index of similarity
    Real, R
    Vargas, JM
    SYSTEMATIC BIOLOGY, 1996, 45 (03) : 380 - 385
  • [5] Jaccard/Tanimoto similarity test and estimation methods
    Chung, Neo Christopher
    Miasojedow, Blażej
    Startek, Michal
    Gambin, Anna
    arXiv, 2019,
  • [6] A modification of the Jaccard-Tanimoto similarity index for diverse selection of chemical compounds using binary strings
    Fligner, MA
    Verducci, JS
    Blower, PE
    TECHNOMETRICS, 2002, 44 (02) : 110 - 119
  • [7] Improving Jaccard Index for Measuring Similarity in Collaborative Filtering
    Lee, Soojung
    INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 799 - 806
  • [8] On the Jaccard Index Similarity Measure in Ranking Fuzzy Numbers
    Ramli, Nazirah
    Mohamad, Daud
    MATEMATIKA, 2009, 25 (02) : 157 - 165
  • [9] Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data
    Chung, Neo Christopher
    Miasojedow, Blazej
    Startek, Michal
    Gambin, Anna
    BMC BIOINFORMATICS, 2019, 20 (Suppl 15)
  • [10] Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data
    Neo Christopher Chung
    BłaŻej Miasojedow
    Michał Startek
    Anna Gambin
    BMC Bioinformatics, 20