A new estimator of intrinsic dimension based on the multipoint Morisita index

被引:13
|
作者
Golay, Jean [1 ]
Kanevski, Mikhail [1 ]
机构
[1] Univ Lausanne, Fac Geosci & Environm, Inst Earth Surface Dynam, CH-1015 Lausanne, Switzerland
关键词
Intrinsic dimension; Multipoint Morisita index; Fractal dimension; Multifractality; Dimensionality reduction; FEATURE-SELECTION; GENERALIZED DIMENSIONS;
D O I
10.1016/j.patcog.2015.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The size of datasets has been increasing rapidly both in terms of number of variables and number of events. As a result, the empty space phenomenon and the curse of dimensionality complicate the extraction of useful information. But, in general, data lie on non-linear manifolds of much lower dimension than that of the spaces in which they are embedded. In many pattern recognition tasks, learning these manifolds is a key issue and it requires the knowledge of their true intrinsic dimension. This paper introduces a new estimator of intrinsic dimension based on the multipoint Morisita index. It is applied to both synthetic and real datasets of varying complexities and comparisons with other existing estimators are carried out. The proposed estimator turns out to be fairly robust to sample size and noise, unaffected by edge effects, able to handle large datasets and computationally efficient. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4070 / 4081
页数:12
相关论文
共 50 条
  • [1] Unsupervised feature selection based on the Morisita estimator of intrinsic dimension
    Golay, Jean
    Kanevski, Mikhail
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 125 - 134
  • [2] Feature selection for regression problems based on the Morisita estimator of intrinsic dimension
    Golay, Jean
    Leuenberger, Michael
    Kanevski, Mikhail
    [J]. PATTERN RECOGNITION, 2017, 70 : 126 - 138
  • [3] The multipoint Morisita index for the analysis of spatial patterns
    Golay, Jean
    Kanevski, Mikhail
    Orozco, Carmen D. Vega
    Leuenberger, Michael
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2014, 406 : 191 - 202
  • [4] The generalized ratios intrinsic dimension estimator
    Francesco Denti
    Diego Doimo
    Alessandro Laio
    Antonietta Mira
    [J]. Scientific Reports, 12
  • [5] The generalized ratios intrinsic dimension estimator
    Denti, Francesco
    Doimo, Diego
    Laio, Alessandro
    Mira, Antonietta
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [6] OPTIMIZED INTRINSIC DIMENSION ESTIMATOR USING NEAREST NEIGHBOR GRAPHS
    Sricharan, Kumar
    Raich, Raviv
    Hero, Alfred O., III
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5418 - 5421
  • [7] A New Approach for Interpreting the Morisita Index of Aggregation through Quadrat Size
    Hayes, James J.
    Castillo, Oscar
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (10):
  • [8] A New Regression-Based Tail Index Estimator
    Nicolau, Joao
    Rodrigues, Paulo M. M.
    [J]. REVIEW OF ECONOMICS AND STATISTICS, 2019, 101 (04) : 667 - 680
  • [9] A New Estimator for a Tail Index
    V. Paulauskas
    [J]. Acta Applicandae Mathematica, 2003, 79 : 55 - 67
  • [10] A new estimator for a tail index
    Paulauskas, V
    [J]. ACTA APPLICANDAE MATHEMATICAE, 2003, 79 (1-2) : 55 - 67