A new imputation method for incomplete binary data

被引:8
|
作者
Subasi, Munevver Mine [2 ]
Subasi, Ersoy [1 ]
Anthony, Martin [3 ]
Hammer, Peter L.
机构
[1] Rutgers Ctr Operat Res, RUTCOR, Piscataway, NJ 08854 USA
[2] Florida Inst Technol, Dept Math Sci, Melbourne, FL 32901 USA
[3] Univ London London Sch Econ & Polit Sci, Dept Math, London WC2A 2AE, England
关键词
Imputation; Boolean similarity measure; MISSING DATA MECHANISM; LOGICAL ANALYSIS; LIKELIHOOD; REGRESSION; MODELS;
D O I
10.1016/j.dam.2011.01.024
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In data analysis problems where the data are represented by vectors of real numbers, it is often the case that some of the data-points will have "missing values", meaning that one or more of the entries of the vector that describes the data-point is not observed. In this paper, we propose a new approach to the imputation of missing binary values. The technique we introduce employs a "similarity measure" introduced by Anthony and Hammer (2006) [1]. We compare experimentally the performance of our technique with ones based on the usual Hamming distance measure and multiple imputation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1040 / 1047
页数:8
相关论文
共 50 条
  • [1] A general method for elicitation, imputation, and sensitivity analysis for incomplete repeated binary data
    Tompsett, Daniel
    Sutton, Stephen
    Seaman, Shaun R.
    White, Ian R.
    [J]. STATISTICS IN MEDICINE, 2020, 39 (22) : 2921 - 2935
  • [2] A Hybrid Method for Incomplete Data Imputation
    Zhao, Liang
    Chen, Zhikui
    Yang, Zhennan
    Hu, Yueming
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1725 - 1730
  • [3] Analysis of incomplete longitudinal binary data using multiple imputation
    Li, Xiaoming
    Mehrotra, Devan V.
    Barnard, John
    [J]. STATISTICS IN MEDICINE, 2006, 25 (12) : 2107 - 2124
  • [4] A comparison of multiple imputation methods for incomplete longitudinal binary data
    Yamaguchi, Yusuke
    Misumi, Toshihiro
    Maruo, Kazushi
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2018, 28 (04) : 645 - 667
  • [5] Robustness of a multivariate normal approximation for imputation of incomplete binary data
    Bernaards, Coen A.
    Belin, Thomas R.
    Schafer, Joseph L.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (06) : 1368 - 1382
  • [6] An Imputation-Based Method for Fuzzy Clustering of Incomplete Data
    Soni, S.
    Sharma, I.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 616 - 621
  • [7] A Correlation Based Imputation Method for Incomplete Traffic Accident Data
    Deb, Rupam
    Liew, Alan Wee-Chung
    Oh, Erwin
    [J]. PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 905 - 912
  • [8] An improved imputation method for incomplete GxE trial data for asparagus
    Nichols, MA
    Godfrey, AR
    Wood, GJR
    Qiao, CG
    Ganesalingam, S
    [J]. PROCEEDINGS OF THE TENTH INTERNATIONAL ASPARAGUS SYMPOSIUM, 2002, (589): : 111 - 116
  • [9] Marginal analysis of incomplete longitudinal binary data: A cautionary note on LOCF imputation
    Cook, RJ
    Zeng, LL
    Yi, GY
    [J]. BIOMETRICS, 2004, 60 (03) : 820 - 828
  • [10] Multiple imputation for high-dimensional mixed incomplete continuous and binary data
    He, Ren
    Belin, Thomas
    [J]. STATISTICS IN MEDICINE, 2014, 33 (13) : 2251 - 2262