Examining data visualization pitfalls in scientific publications

被引:0
|
作者
Vinh T Nguyen
Kwanghee Jung
Vibhuti Gupta
机构
[1] TNU – University of Information and Communication Technology,Department of Information Technology
[2] Texas Tech University,Department of Educational Psychology, Leadership, and Counseling
[3] Meharry Medical College,Department of Computer Science and Data Science
关键词
Data visualization; Graphical representations; Misinformation; Visual encodings; Association rule mining; Word cloud; Cochran’s Q test; McNemar’s test;
D O I
暂无
中图分类号
学科分类号
摘要
Data visualization blends art and science to convey stories from data via graphical representations. Considering different problems, applications, requirements, and design goals, it is challenging to combine these two components at their full force. While the art component involves creating visually appealing and easily interpreted graphics for users, the science component requires accurate representations of a large amount of input data. With a lack of the science component, visualization cannot serve its role of creating correct representations of the actual data, thus leading to wrong perception, interpretation, and decision. It might be even worse if incorrect visual representations were intentionally produced to deceive the viewers. To address common pitfalls in graphical representations, this paper focuses on identifying and understanding the root causes of misinformation in graphical representations. We reviewed the misleading data visualization examples in the scientific publications collected from indexing databases and then projected them onto the fundamental units of visual communication such as color, shape, size, and spatial orientation. Moreover, a text mining technique was applied to extract practical insights from common visualization pitfalls. Cochran’s Q test and McNemar’s test were conducted to examine if there is any difference in the proportions of common errors among color, shape, size, and spatial orientation. The findings showed that the pie chart is the most misused graphical representation, and size is the most critical issue. It was also observed that there were statistically significant differences in the proportion of errors among color, shape, size, and spatial orientation.
引用
收藏
相关论文
共 50 条
  • [21] Lexicon Visualization Library and Java']JavaScript for Scientific Data Visualization
    Tanyalcin, Ibrahim
    Al Assaf, Carla
    Ferte, Julien
    Ancien, Francois
    Khan, Taushif
    Smits, Guillaume
    Rooman, Marianne
    Vranken, Wim
    COMPUTING IN SCIENCE & ENGINEERING, 2018, 20 (01) : 50 - 65
  • [22] Semantic Annotation of Data Processing Pipelines in Scientific Publications
    Mesbah, Sepideh
    Fragkeskos, Kyriakos
    Lofi, Christoph
    Bozzon, Alessandro
    Houben, Geert-Jan
    SEMANTIC WEB ( ESWC 2017), PT I, 2017, 10249 : 321 - 336
  • [23] Improving Scientific Publications and Public Trust by Data Access
    Donald F Klein
    Neuropsychopharmacology, 2002, 26 : 696 - 697
  • [24] SPedia: A Semantics Based Repository of Scientific Publications Data
    Aslam, Muhammad Ahtisham
    Aljohani, Naif Radi
    WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 479 - 490
  • [25] Improving scientific publications and public trust by data access
    Klein, DF
    NEUROPSYCHOPHARMACOLOGY, 2002, 26 (05) : 696 - 697
  • [26] Publications in Scientific Events as a Data Source for Scientometric Analysis
    Coimbra, Fernanda Silva
    Rodrigues Dias, Thiago Magela
    DATA AND INFORMATION IN ONLINE ENVIRONMENTS, DIONE 2022, 2022, 452 : 49 - 59
  • [27] Measuring the impact of clinical data in terms of data citations by scientific publications
    Bai, Yongmei
    Du, Jian
    18TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI2021), 2021, : 71 - 80
  • [28] Visualization of Dynamic Adaptive Resolution Scientific Data
    Foulks, Andrew
    Bergeron, R. Daniel
    Vohr, Samuel H.
    VISUALIZATION AND DATA ANALYSIS 2011, 2011, 7868
  • [29] Visualization of Multi-Variate Scientific Data
    Fuchs, R.
    Hauser, H.
    COMPUTER GRAPHICS FORUM, 2009, 28 (06) : 1670 - 1690
  • [30] 'SCIENTIFIC DATA VISUALIZATION: FOCUS ON (POSTER) PRESENTATION
    Boers, Maarten
    ANNALS OF THE RHEUMATIC DISEASES, 2019, 78 : 27 - 27