Examining data visualization pitfalls in scientific publications

被引:0
|
作者
Vinh T Nguyen
Kwanghee Jung
Vibhuti Gupta
机构
[1] TNU – University of Information and Communication Technology,Department of Information Technology
[2] Texas Tech University,Department of Educational Psychology, Leadership, and Counseling
[3] Meharry Medical College,Department of Computer Science and Data Science
关键词
Data visualization; Graphical representations; Misinformation; Visual encodings; Association rule mining; Word cloud; Cochran’s Q test; McNemar’s test;
D O I
暂无
中图分类号
学科分类号
摘要
Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran’s Q test and McNemar’s test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.
引用
收藏
相关论文
共 50 条
  • [31] RESEARCH ISSUES IN DATA MODELING FOR SCIENTIFIC VISUALIZATION
    NIELSON, GM
    BRUNET, P
    GROSS, M
    HAGEN, H
    KLIMENKO, SV
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1994, 14 (02) : 70 - 73
  • [32] Tricubic Interpolation in Scientific Data Visualization Problems
    Kondybayeva, A. B.
    Solodov, S., V
    2019 WAVE ELECTRONICS AND ITS APPLICATION IN INFORMATION AND TELECOMMUNICATION SYSTEMS (WECONF), 2019,
  • [33] SCIENTIFIC VISUALIZATION TECHNIQUES FOR DISCRETE SCALAR DATA
    Yervilla Herrera, Heikel
    Reyes Lopez, Yaidel
    Viamontes Esquivel, Alcides
    Recarey Morfa, Carlos A.
    REVISTA INTERNACIONAL DE METODOS NUMERICOS PARA CALCULO Y DISENO EN INGENIERIA, 2010, 26 (01): : 39 - 46
  • [34] Parallel processing speeds visualization of scientific data
    Staley, Stephanie
    Bahrami, Ali
    Scientific Computing and Instrumentation, 2003, 20 (03): : 32 - 36
  • [35] Linking Performance Data into Scientific Visualization Tools
    Huck, Kevin A.
    Potter, Kristin
    Jacobsen, Doug W.
    Childs, Hank
    Malony, Allen D.
    2014 FIRST WORKSHOP ON VISUAL PERFORMANCE ANALYSIS (VPA), 2014, : 50 - 57
  • [36] A collaborative framework for scientific data analysis and visualization
    Ekanayake, Jaliya
    Pallickara, Shrideep
    Fox, Geoffrey
    PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COLLABORATIVE TECHNOLOGIES AND SYSTEMS: CTS 2008, 2008, : 339 - 346
  • [37] Publications The pitfalls of free access
    不详
    BIOFUTUR, 2012, (336) : 10 - 10
  • [38] Preface: Visualization and data analytics for scientific discovery
    Childs, Hank
    Cappello, Franck
    PARALLEL COMPUTING, 2016, 55 : 1 - 1
  • [39] Narrative scientific data visualization in an immersive environment
    Liu, Richen
    Wang, Hailong
    Zhang, Chuyu
    Chen, Xiaojian
    Wang, Lijun
    Ji, Genlin
    Zhao, Bin
    Mao, Zhiwei
    Yang, Dan
    BIOINFORMATICS, 2021, 37 (14) : 2033 - 2041
  • [40] Examining inequality in scientific production: a focus on critical care publications and global economic disparities
    Daltro-Oliveira, Renato
    Quintairos, Amanda
    Santos, Laura I. Oliveira
    Salluh, Jorge I. Figueira
    Nassar Jr, Antonio P.
    INTENSIVE CARE MEDICINE, 2024, : 1538 - 1540