Uncovering Data Landscapes through Data Reconnaissance and Task Wrangling

被引:0
|
作者
Crisan, Anamaria [1 ]
Munzner, Tamara [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Human-centered computing; Visualization; Visualization design and evaluation methods; DESIGN; VISUALIZATION; REFLECTIONS;
D O I
10.1109/visual.2019.8933542
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Domain experts are inundated with new and heterogeneous types of data and require better and more specific types of data visualization systems to help them. In this paper, we consider the data landscape that domain experts seek to understand, namely the set of datasets that are either currently available or could be obtained. Experts need to understand this landscape to triage which data analysis projects might be viable, out of the many possible research questions that they could pursue. We identify data reconnaissance and task wrangling as processes that experts undertake to discover and identify sources of data that could be valuable for some specific analysis goal. These processes have thus far not been formally named or defined by the research community. We provide formal definitions of data reconnaissance and task wrangling and describe how they relate to the data landscape that domain experts must uncover. We propose a conceptual framework with a four-phase cycle of acquire, view, assess, and pursue that occurs within three distinct chronological stages, which we call fog and friction, informed data ideation, and demarcation of final data. Collectively, these four phases embedded within three temporal stages delineate an expert's progressively evolving understanding of the data landscape. We describe and provide concrete examples of these processes within the visualization community through an initial systematic analysis of previous design studies, identifying situations where there is evidence that they were at play. We also comment on the response of domain experts to this framework, and suggest design implications stemming from these processes to motivate future research directions. As technological changes will only keep adding unknown terrain to the data landscape, data reconnaissance and task wrangling are important processes that need to be more widely understood and supported by the data visualization tools. By articulating a concrete understanding of this challenge and its implications, our work impacts the design and evaluation of data visualization systems.
引用
收藏
页码:46 / 50
页数:5
相关论文
共 50 条
  • [1] Big data: Data wrangling
    Goldston, David
    [J]. NATURE, 2008, 455 (7209) : 15 - 15
  • [2] Big data: Data wrangling
    David Goldston
    [J]. Nature, 2008, 455 : 15 - 15
  • [3] Data Context Informed Data Wrangling
    Koehler, Martin
    Bogatu, Alex
    Civili, Cristina
    Konstantinou, Nikolaos
    Abel, Edward
    Fernandes, Alvaro A. A.
    Keane, John
    Libkin, Leonid
    Paton, Norman W.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 956 - 963
  • [4] Fairness in Data Wrangling
    Mazilu, Lacramioara
    Paton, Norman W.
    Konstantinou, Nikolaos
    Fernandes, Alvaro A. A.
    [J]. 2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 341 - 348
  • [5] Data Wrangling: Making data useful again
    Ender, Florian
    Piringer, Harald
    [J]. IFAC PAPERSONLINE, 2015, 48 (01): : 111 - +
  • [6] Wrangling Categorical Data in R
    McNamara, Amelia
    Horton, Nicholas J.
    [J]. AMERICAN STATISTICIAN, 2018, 72 (01): : 97 - 104
  • [7] Data wrangling practices and collaborative interactions with aggregated data
    Shiyan Jiang
    Jennifer Kahn
    [J]. International Journal of Computer-Supported Collaborative Learning, 2020, 15 : 257 - 281
  • [8] Data Wrangling in Database Systems: Purging of Dirty Data
    Azeroual, Otmane
    [J]. DATA, 2020, 5 (02) : 1 - 9
  • [9] Data wrangling practices and collaborative interactions with aggregated data
    Jiang, Shiyan
    Kahn, Jennifer
    [J]. INTERNATIONAL JOURNAL OF COMPUTER-SUPPORTED COLLABORATIVE LEARNING, 2020, 15 (03) : 257 - 281
  • [10] Introducing Data Science to Undergraduates through Big Data: Answering Questions by Wrangling and Profiling a Yelp Dataset
    Jensen, Scott
    [J]. PROCEEDINGS OF THE 50TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2017, : 1033 - 1042