Missing Value Imputation Using Correlation Coefficient

被引:1
|
作者
Manna, Sweta [1 ]
Pati, Soumen Kumar [2 ]
机构
[1] Maulana Abul Kalam Azad Univ Technol, Dept Comp Sci & Engn, Nadia, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Bioinformat, Nadia, W Bengal, India
关键词
Microarray data; Missing value imputation; Similarity measurement; Correlation coefficient;
D O I
10.1007/978-981-15-2449-3_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing values of microarray dataset are imputed with the help of gene expression sample values. The process by which missing values are calculated is the mean of gene expression sample values and then discretized the sample values. Those discretized values are used to find the similarities between gene expressions with missing value-related genes and genes with no missing values. The gene from without missing values which is most similar of each missing value-related gene is selected, and Pearson's correlation coefficient of the identified gene with all no missing valuerelated genes is calculated. Now, the genes which have higher correlation coefficient with respect to a threshold value are identified. At last, the missing position of the gene is imputed with the mean expression values of the no missing value-related genes which are selected based on correlation coefficient values.
引用
收藏
页码:551 / 558
页数:8
相关论文
共 50 条
  • [1] Missing value imputation using genetic algorithm
    Hengpraphrom, Kairijng
    Wlchian, Sageemas Na
    Meesad, Phayijng
    ICIC Express Letters, 2011, 5 (02): : 355 - 360
  • [2] Optimization of Missing Value Imputation using Reinforcement Programming
    Rachmawan, Irene Erlyn Wina
    Barakbah, Ali Ridho
    2015 International Electronics Symposium (IES), 2015, : 128 - 133
  • [3] A Review On Missing Value Estimation Using Imputation Algorithm
    Armina, Roslan
    Zain, Azlan Mohd
    Ali, Nor Azizah
    Sallehuddin, Roselina
    6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL MATHEMATICS (ICCSCM 2017), 2017, 892
  • [4] Imputation methods of missing data for estimating the population mean using simple random sampling with known correlation coefficient
    Al-Omari, Amer Ibrahim
    Bouza, Carlos N.
    Herrera, Carmelo
    QUALITY & QUANTITY, 2013, 47 (01) : 353 - 365
  • [5] Imputation methods of missing data for estimating the population mean using simple random sampling with known correlation coefficient
    Amer Ibrahim Al-Omari
    Carlos N. Bouza
    Carmelo Herrera
    Quality & Quantity, 2013, 47 : 353 - 365
  • [6] Missing value imputation on missing completely at random data using multilayer perceptrons
    Silva-Ramirez, Esther-Lydia
    Pino-Mejias, Rafael
    Lopez-Coello, Manuel
    Cubiles-de-la-Vega, Maria-Dolores
    NEURAL NETWORKS, 2011, 24 (01) : 121 - 129
  • [7] Impact of Missing Data on Correlation Coefficient Values: Deletion and Imputation Methods for Data Preparation
    Shantal, Mohammed
    Othman, Zalinda
    Abu Bakar, Azuraliza
    MALAYSIAN JOURNAL OF FUNDAMENTAL AND APPLIED SCIENCES, 2023, 19 (06): : 1052 - 1067
  • [8] Missing value imputation using unsupervised machine learning techniques
    Raja, P. S.
    Thangavel, K.
    SOFT COMPUTING, 2020, 24 (06) : 4361 - 4392
  • [9] Missing value imputation using unsupervised machine learning techniques
    P. S. Raja
    K. Thangavel
    Soft Computing, 2020, 24 : 4361 - 4392
  • [10] Missing value imputation method based on correlation analysis and Gaussian mixture model
    Zhang, Jie
    Chang, Yuqing
    Wang, Ran
    Wang, Fuli
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,