Standard data mining procedures are sensitive to the presence of outlying measurements in the data. Therefore, robust data mining procedures are highly desirable, which are resistant to outliers. This work has the aim to propose new robust classification procedures for high-dimensional data and algorithms for their efficient computation. Particularly, we use the idea of implicit weights assigned to individual observation to propose several robust regularized versions of linear discriminant analysis (LDA), suitable for data with the number of variables exceeding the number of observations. The approach is based on a regularized version of the minimum weighted covariance determinant (MWCD) estimator and represents a unique attempt to combine regularization and high robustness, allowing to down-weight outlying observations. Classification performance of new methods is illustrated on real fMRI data acquired in neuroscience research.
机构:
Nankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Xinjiang, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
Wang, Jiyang
Liang, Wanfeng
论文数: 0引用数: 0
h-index: 0
机构:
Dongbei Univ Finance & Econ, Sch Data Sci & Artificial Intelligence, Dalian 116025, Liaoning, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
Liang, Wanfeng
Li, Lijie
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
Li, Lijie
Wu, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
Wu, Yue
Ma, Xiaoyan
论文数: 0引用数: 0
h-index: 0
机构:
Ningxia Univ, Sch Math & Stat, Yinchuan 750021, Ningxia, Peoples R ChinaNankai Univ, Sch Stat & Data Sci, Tianjin 300071, Peoples R China