A New Measure of the Cluster Hypothesis

被引:0
|
作者
Smucker, Mark D. [1 ]
Allan, James [2 ]
机构
[1] Univ Waterloo, Dept Management Sci, Waterloo, ON N2L 3G1, Canada
[2] Univ Massachusetts Amherst, Ctr Intelligent Informat Retrieval, Dept Comp Sci, Amherst, MA 01003 USA
来源
关键词
Cluster hypothesis; nearest neighbor test; relevant document networks; normalized mean reciprocal distance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have found that the nearest neighbor (NN) test is an insufficient measure of the cluster hypothesis. The NN test is a local measure of the cluster hypothesis. Designers of new document-to-document similarity measures may incorrectly report effective clustering of relevant documents if they use the NN test alone. Utilizing a measure from net;work analysis, we present a new, global measure of the cluster hypothesis: normalized mean reciprocal distance. When used together with a, local measure, such as the NN test, this new global measure allows researchers to better measure the cluster hypothesis.
引用
收藏
页码:281 / +
页数:2
相关论文
共 50 条
  • [1] New tests of the cluster entropy floor hypothesis
    McCarthy, IG
    Babul, A
    Balogh, ML
    Holder, GP
    [J]. GALAXY EVOLUTION: THEORY AND OBSERVATIONS, 2003, 17 : 315 - 316
  • [2] Cluster ensemble selection based on a new cluster stability measure
    Alizadeh, Hosein
    Minaei-Bidgoli, Behrouz
    Parvin, Hamid
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 389 - 408
  • [3] New hypothesis distinctiveness measure for better ellipse extraction
    Wang, Cuilan
    Newman, Timothy S.
    Cao, Chunguang
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2007, 4633 : 176 - 186
  • [4] ON INNATENESS: THE CLUTTER HYPOTHESIS AND THE CLUSTER HYPOTHESIS
    Mameli, Matteo
    [J]. JOURNAL OF PHILOSOPHY, 2008, 105 (12): : 719 - 736
  • [5] Hypothesis for a new method to measure the dynamic patterns of tissue injury
    Dioguardi, Nicola
    [J]. MEDICAL HYPOTHESES, 2011, 77 (06) : 1022 - 1027
  • [6] A new cluster validity measure for bioinformatics relational datasets
    Popescu, Mihail
    Bezdek, James C.
    Keller, James M.
    Havens, Timothy C.
    Huband, Jacalyn M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 726 - +
  • [7] A New Measure of Cluster Validity Using Line Symmetry
    Chou, Chien-Hsing
    Hsieh, Yi-Zeng
    Su, Mu-Chun
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (02) : 443 - 461
  • [8] A new cluster validity index using maximum cluster spread based compactness measure
    Wani, M. Arif
    Riyaz, Romana
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2016, 9 (02) : 179 - 204
  • [9] Cluster goodness: A new measure of performance for cluster formation in the design of cellular manufacturing systems
    Nair, GJ
    Narendran, TT
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 1997, 48 (01) : 49 - 61
  • [10] A new cluster validity measure and its application to image compression
    Chou, CH
    Su, MC
    Lai, E
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2004, 7 (02) : 205 - 220