My Facts Are not Your Facts: Data Wrangling as a Socially Negotiated Process, A Case Study in a Multisite Manufacturing Company

被引:0
|
作者
Eckert, Claudia [1 ]
Isaksson, Ola [2 ]
Hane-Hagstrom, Malin [3 ]
Eckert, Calandra [4 ]
机构
[1] Open Univ, Sch Engn & Innovat, Walton Hall,Kents Hill, Milton Keynes MK7 6AA, Bucks, England
[2] Chalmers Univ Technol, Div Prod Dev, Dept Ind & Mat Sci, Chalmersplatsen 4, SE-41296 Gothenburg, Sweden
[3] Volvo Powertrain, Gropegardsgatan 2, SE-41715 Gothenburg, Sweden
[4] Ludwig Maximilians Univ Munchen, Geschwister Scholl Pl 1, D-80539 Monchen, Germany
关键词
big data and analytics; human-computer interfaces; interactions; information management; manufacturing planning; BIG DATA; STATISTICAL LITERACY; DESIGN; COMMUNICATION; SCIENCE;
D O I
10.1115/1.4055953
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The condition under which the data wrangling process is undertaken has a profound impact on the quality of the results of the data wrangling and analysis. This paper presents the results of the analysis of the sociotechnical aspects of a data wrangling activity in a large, multi-site global manufacturer. This activity was technically demanding, as operational data from multiple sources and formats needed to be integrated, but also involved interaction with multiple stakeholders in different parts of the world with their own ways of collecting and structuring the data. The data had been captured previously for a different purpose. The clients were not aware that the data followed a different logic in the various sites and in some cases needed to be manually extracted and interpreted. The paper describes the data wrangling process and analyses the assumptions, goals, and biases of the different stakeholders. The analysis raises questions and insights about how data can be trusted and suggests that human intervention with data along the data wrangling process is often un-intentional, tacit, and easily overlooked. It is suggested that contextual factors, such as data quality and assessment of consequences when acting/making decisions on the new data set are given higher attention during the specification of data wrangling assignments. The paper concludes with recommendations for data wrangling practitioners.
引用
收藏
页数:12
相关论文
共 29 条
  • [21] Analysis of the Message Queueing Telemetry Transport Protocol for Data Labelling: An Orthopedic Manufacturing Process Case Study
    Bhattacharya, Mangolika
    Mohandas, Reenu
    Penica, Mihai
    Southern, Mark
    Vancamp, Karl
    Hayes, Martin J.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2021, : 215 - 222
  • [22] Multiple target data-driven models to enable sustainable process manufacturing: An industrial bioprocess case study
    Fisher, Oliver J.
    Watson, Nicholas J.
    Porcu, Laura
    Bacon, Darren
    Rigley, Martin
    Gomes, Rachel L.
    JOURNAL OF CLEANER PRODUCTION, 2021, 296
  • [23] Engineering English and the high-tech industry: A case study of an English needs analysis of process integration engineers at a semiconductor manufacturing company in Taiwan
    Spence, Paul
    Liu, Gi-Zen
    ENGLISH FOR SPECIFIC PURPOSES, 2013, 32 (02) : 97 - 109
  • [24] APPLICATION OF VALUE STREAM MAPPING IN ORDER TO RECOGNIZE OPPORTUNITIES TO IMPROVE PROCESSES AND INCREASE BUSINESS PROCESS PERFORMANCE: A CASE STUDY FROM MANUFACTURING COMPANY IN THE CZECH REPUBLIC
    Ondra, Pavel
    14TH ANNUAL INTERNATIONAL BATA CONFERENCE FOR PH.D. STUDENTS AND YOUNG RESEARCHERS (DOKBAT), 2018, : 199 - 210
  • [25] Value stream mapping approach and analytical network process to identify and prioritize production system's Mudas (case study: natural fibre clothing manufacturing company)
    Behnam, Donya
    Ayough, Ashkan
    Mirghaderi, S. Hadi
    JOURNAL OF THE TEXTILE INSTITUTE, 2018, 109 (01) : 64 - 72
  • [26] A Data-Driven Approach for Identifying Possible Manufacturing Processes and Production Parameters That Cause Product Defects: A Thin-Film Filter Company Case Study
    Lyu, Jrjung
    Liang, Chia Wen
    Chen, Ping-Shun
    IEEE ACCESS, 2020, 8 : 49395 - 49411
  • [27] A Six Sigma Methodology Using Data Mining: A Case Study on Six Sigma Project for Heat Efficiency Improvement of a Hot Stove System in a Korean Steel Manufacturing Company
    Jang, Gil-Sang
    Jeon, Jong-Hag
    CUTTING-EDGE RESEARCH TOPICS ON MULTIPLE CRITERIA DECISION MAKING, PROCEEDINGS, 2009, 35 : 72 - +
  • [28] Evaluating the process efficiency of industrial wastewater treatment plants using data envelopment analysis approach case study: Khuzestan steel company treatment plant
    Rahbari, K.
    Hassani, A. H.
    Mehrgan, M. R.
    Javid, A. H.
    BULGARIAN CHEMICAL COMMUNICATIONS, 2018, 50 (01): : 124 - 132
  • [29] Multivariate Data Analysis to Assess Process Evolution and Systematic Root Causes Investigation in Tablet Manufacturing at an Industrial Scale-A Case Study Focused on Improving Tablet Hardness
    Mathe, Rita
    Casian, Tibor
    Tomuta, Ioan
    PHARMACEUTICS, 2025, 17 (02)