A Visual-Interactive Idiom to Diagnose Missing Data Mechanisms

被引:1
|
作者
do Amor Divino Lima, Rodrigo Santos [1 ]
Oliveira de Araujo, Tiago Davi [1 ]
Resque dos Santos, Carlos Gustavo [1 ]
Meiguins, Bianchi Serique [1 ]
机构
[1] Fed Univ Para, PPGCC, LABVIS, Belem, Para, Brazil
关键词
missing values; data preprocessing; exploratory data analysis; IMPUTATION; REGRESSION;
D O I
10.1109/IV51561.2020.00027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With vast amounts of data, comes vast numbers of problems. The process of collecting data is far from perfect, either due to human factors or technological errors, which can lead to inaccuracies and uncertainties in the data. One such issue is missing data: the absence of information. Several methods can deal with missing values, but to choose the correct approach, it is necessary to diagnose the missing data mechanisms, which describe how the distribution of missingness in a given data variable correlates to other variables. This diagnosis can be made with statistical tests or data visualization techniques. However, statistical tests provide an uncertainty estimation that is often misinterpreted, and the visualizations readily available in data analysis packages have some scalability issues, such as cognitive overload and lack of screen space. Thus, this paper proposes a visual-interactive idiom for diagnosing missing data mechanisms. The proposed solution consists of a set of visual encodings and two derived metrics that synthesizes the missing data mechanisms and the uncertainty associated with this synthesis. We present the concepts behind the visual encodings, derived metrics, and interactions of the idiom.
引用
收藏
页码:109 / 113
页数:5
相关论文
共 50 条
  • [31] Missing data, part 2. Missing data mechanisms: Missing completely at random, missing at random, missing not at random, and why they matter
    Tra My Pham
    Pandis, Nikolaos
    White, Ian R.
    [J]. AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2022, 162 (01) : 138 - 139
  • [32] Missing data, part 2. Missing data mechanisms: Missing completely at random, missing at random, missing not at random, and why they matter
    Tra My Pham
    Pandis, Nikolaos
    White, Ian R.
    [J]. AMERICAN JOURNAL OF OPHTHALMOLOGY, 2022, 162 (01) : 138 - 139
  • [33] Diagnose the mild cognitive impairment by constructing Bayesian network with missing data
    Sun, Yan
    Tang, Yiyuan
    Ding, Shuxue
    Lv, Shipin
    Cui, Yifen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 442 - 449
  • [34] I4TSPS: a Visual-Interactive Web System for Industrial Time-Series Pre-processing
    Villalobos, Kevin
    Vadillo, Jon
    Diez, Borja
    Calvo, Borja
    Illarramendi, Arantza
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2012 - 2018
  • [35] Sensitivity analyses for trials with missing data, assuming missing not at random mechanisms
    Baptiste Leurent
    Mike Crawford
    Hazel Gilbert
    Richard Morris
    Mike Sweeting
    Irwin Nazareth
    [J]. Trials, 14 (Suppl 1)
  • [36] Interactive visual mechanisms for exploring source code evolution
    Telea, Alexandru
    Voinea, Lucian
    [J]. 3RD IEEE INTERNATIONAL WORKSHOP ON VISUALIZING SOFTWARE FOR UNDERSTANDING AND ANALYSIS, PROCEEEDINGS, 2005, : 52 - 57
  • [37] INTERACTIVE VISUAL DATA ABSTRACTION IN A DECLARATIVE VISUAL PROGRAMMING LANGUAGE
    BURNETT, MM
    AMBLER, AL
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 1994, 5 (01): : 29 - 60
  • [38] SEQUENTIAL IDENTIFICATION OF NONIGNORABLE MISSING DATA MECHANISMS
    Sadinle, Mauricio
    Reiter, Jerome P.
    [J]. STATISTICA SINICA, 2018, 28 (04) : 1741 - 1759
  • [39] Interactive Visual Analysis of Police Records Data
    Ning, Xinyu
    Sun, Guodao
    Jin, Lu
    Ding, Weijie
    Liang, Ronghua
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (07): : 1064 - 1076
  • [40] Visual transformation for interactive spatiotemporal data mining
    Yang Cai
    Richard Stumpf
    Timothy Wynne
    Michelle Tomlinson
    Daniel Sai Ho Chung
    Xavier Boutonnier
    Matthias Ihmig
    Rafael Franco
    Nathaniel Bauernfeind
    [J]. Knowledge and Information Systems, 2007, 13 : 119 - 142