Assessing Completeness of IoT Data: A Novel Probabilistic Approach

被引:0
|
作者
Klier, Mathias [1 ]
Moestue, Lars [1 ]
Obermeier, Andreas [1 ]
Widmann, Torben [1 ]
机构
[1] Univ Ulm, Inst Business Analyt, Helmholtz Str 22, D-89081 Ulm, Germany
关键词
Data quality; Data quality assessment; Completeness; Internet of Things; Probability-based metric; INDUSTRY; 4.0; INFORMATION QUALITY; INTERNET; THINGS; IDENTIFICATION; METHODOLOGY; CHALLENGES; FRAMEWORK; OUTLIERS; IMPROVE;
D O I
10.1007/s12599-024-00889-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet of Things (IoT) is one of the driving forces behind Industry 4.0 and has the potential to improve the entire value chain, especially in the context of industrial manufacturing. However, results derived from IoT data are only viable if a high level of data quality is maintained. Thereby, completeness is especially critical, as incomplete data is one of the most common and costly data quality defects in the IoT context. Nevertheless, existing approaches for assessing the completeness of IoT data are limited in their applicability because they assume a known number of real-world entities or that the real-world entities appear in regular patterns. Thus, they cannot handle the uncertainty regarding the number of real-world entities typically present in the IoT context. Against this background, the paper proposes a novel, probability-based metric that addresses these issues and provides interpretable metric values representing the probability that an IoT database is complete. This probability is assessed based on the detection of outliers regarding the deviation between the estimated number of real-world entities and the number of digital entities. The evaluation with IoT data from a German car manufacturer demonstrates that the provided metric values are useful and informative and can discriminate well between complete and incomplete IoT data. The metric has the potential to reduce the cost, time, and effort associated with incomplete IoT data, providing tangible benefits in real-world applications.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] A probabilistic approach for assessing the crosswind stability of ground vehicles
    Proppe, Carsten
    Wetzel, Christian
    [J]. VEHICLE SYSTEM DYNAMICS, 2010, 48 : 411 - 428
  • [42] A Probabilistic Approach to Assessing and Predicting the Failure of Notched Components
    Muniz-Calvente, Miguel
    Venta-Vinuela, Lucas
    Alvarez-Vazquez, Adrian
    Fernandez Fernandez, Pelayo
    Lamela Rey, Maria Jesus
    Fernandez Canteli, Alfonso
    [J]. MATERIALS, 2019, 12 (24)
  • [43] Indoor localization by a novel probabilistic approach
    Fang, Shih-Hau
    Lin, Pochiang
    Lin, Tsung-Nan
    [J]. 2007 IEEE 8TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2, 2007, : 221 - 224
  • [44] Probabilistic approach to assessing and monitoring settlements caused by tunneling
    Camos, Caries
    Spackova, Olga
    Straub, Daniel
    Molins, Climent
    [J]. TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2016, 51 : 313 - 325
  • [45] A probabilistic XML approach to data integration
    van Keulen, M
    de Keijzer, A
    Alink, W
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 459 - 470
  • [46] A probabilistic approach for SeaWinds data assimilation
    Portabella, M
    Stoffelen, A
    [J]. QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2004, 130 (596) : 127 - 152
  • [47] A Probabilistic Approach for Missing Data Imputation
    Arefin, Muhammed Nazmul
    Masum, Abdul Kadar Muhammad
    [J]. COMPLEXITY, 2024, 2024
  • [48] A Completeness Theorem for Probabilistic Regular Expressions
    Rozowski, Wojciech
    Silva, Alexandra
    [J]. PROCEEDINGS OF THE 39TH ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, LICS 2024, 2024,
  • [49] A probabilistic approach for cleaning RFID data
    Ziekow, Holger
    Ivantysynova, Lenka
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 106 - +
  • [50] A probabilistic XML approach to data integration
    [J]. Van Keulen, M. (m.vankeulen@utwente.nl), IEEE Computer Society; The Database Society of Japan, DBSJ; Information Processing Society of Japan, IPSJ; Institute of Electronics, Info. Commun. Engineers, IEICE (Institute of Electrical and Electronics Engineers Computer Society):