Automated diagnosis of data-model conflicts using metadata

被引:6
|
作者
Chen, RO [1 ]
Altman, RB [1 ]
机构
[1] Stanford Univ, Sch Med, Stanford, CA 94305 USA
关键词
D O I
10.1136/jamia.1999.0060374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The authors describe a methodology for helping computational biologists diagnose discrepancies they encounter between experimental data and the predictions of scientific models. The authors call these discrepancies data-model conflicts. They have built a prototype system to help scientists resolve these conflicts in a more systematic, evidence-based manner. In computational biology, data-model conflicts are the result of complex computations in which data and models are transformed and evaluated. Increasingly, the data, models, and tools employed in these computations come from diverse and distributed resources, contributing to a widening gap between the scientist and the original context in which these resources were produced. This contextual rift can contribute to the misuse of scientific data or tools and amplifies the problem of diagnosing data-model conflicts. The authors' hypothesis is that systematic collection of metadata about a computational process can help bridge the contextual rift and provide information for supporting automated diagnosis of these conflicts. The methodology involves three major steps. First, the authors decompose the data-model evaluation process into abstract functional components. Next, they use this process decomposition to enumerate the possible causes of the data-model conflict and direct the acquisition of diagnostically relevant metadata. Finally, they use evidence statically and dynamically generated from the metadata collected to identify the most likely causes of the given conflict. They describe how these methods are implemented in a knowledge-based system called GRENDEL and show how GRENDEL can be used to help diagnose conflicts between experimental data and computationally built structural models of the 30S ribosomal subunit.
引用
收藏
页码:374 / 392
页数:19
相关论文
共 50 条
  • [21] Fault Diagnosis of Pumped Storage Units-A Novel Data-Model Hybrid-Driven Strategy
    Bai, Jie
    Che, Chuanqiang
    Liu, Xuan
    Wang, Lixin
    He, Zhiqiang
    Xie, Fucai
    Dou, Bingjie
    Guo, Haonan
    Ma, Ruida
    Zou, Hongbo
    PROCESSES, 2024, 12 (10)
  • [22] Automated curation of spatial metadata in environmental monitoring data
    Mutlu, Ilhan
    Hackermueller, Joerg
    Schor, Jana
    ECOLOGICAL INFORMATICS, 2025, 86
  • [23] Diverse Ensemble Evolution: Curriculum Data-Model Marriage
    Zhou, Tianyi
    Wang, Shengjie
    Bilmes, Jeff A.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Data-model comparison of the Younger Dryas event: Discussion
    Renssen, H
    Isarin, RFB
    CANADIAN JOURNAL OF EARTH SCIENCES, 2001, 38 (03) : 477 - 478
  • [25] Fire at high latitudes: Data-model comparisons and their consequences
    Kantzas, Euripides
    Lomas, Mark
    Quegan, Shaun
    GLOBAL BIOGEOCHEMICAL CYCLES, 2013, 27 (03) : 677 - 691
  • [26] SEMANTIC DATA-MODEL REQUIREMENTS AND REALIZATION WITH A RELATIONAL DATA-STRUCTURE
    GRABOWSKI, H
    EIGNER, M
    COMPUTER-AIDED DESIGN, 1979, 11 (03) : 158 - 168
  • [27] On the identification of a Pliocene time slice for data-model comparison
    Haywood, Alan M.
    Dolan, Aisling M.
    Pickering, Steven J.
    Dowsett, Harry J.
    McClymont, Erin L.
    Prescott, Caroline L.
    Salzmann, Ulrich
    Hill, Daniel J.
    Hunter, Stephen J.
    Lunt, Daniel J.
    Pope, James O.
    Valdes, Paul J.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (2001):
  • [28] A conceptual view on data-model driven reverse engineering
    Borja, V
    Harding, JA
    Bell, R
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2001, 39 (04) : 667 - 687
  • [30] Data-model coupling driven stress field measurements
    Guangbo Wang
    Jian Zhao
    Jiahui Liu
    Dong Zhao
    Theoretical & Applied Mechanics Letters, 2024, 14 (04) : 280 - 290