DataFoundry: Information management for scientific data

被引:21
|
作者
Critchlow, T [1 ]
Fidelis, K [1 ]
Ganesh, M [1 ]
Musick, R [1 ]
Slezak, T [1 ]
机构
[1] Univ Calif Lawrence Livermore Natl Lab, Ctr Appl Sci Comp, Livermore, CA 94550 USA
关键词
databases; data warehouses; informatics; integration; meta-data;
D O I
10.1109/4233.826859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data warehouses and data marts have been successfully applied to a multitude of commercial business applications. They have proven to be invaluable tools by integrating information from distributed, heterogeneous sources and summarizing this data for use throughout the enterprise. Although the need for information dissemination is as vital in science as in business, working warehouses in this community are scarce because traditional warehousing techniques do not transfer to scientific environments. There are two primary reasons for this difficulty, First, schema integration is more difficult for scientific databases than for business sources, because of the complexity of the concepts and the associated relationships. While this difference has not yet been fully explored, it is an important consideration when determining how to integrate autonomous sources. Second, scientific data sources have highly dynamic data representations (schemata), When a data source participating in a warehouse changes its schema, both the mediator transferring data to the warehouse and the warehouse itself need to be updated to reflect these modifications. The cost of repeatedly performing these updates in a traditional warehouse, as is required in a dynamic environment, is prohibitive. This paper discusses these issues within the context of the DataFoundry project, an ongoing research effort at Lawrence Livermore National Laboratory. DataFoundry utilizes a unique integration strategy to identify corresponding instances while maintaining differences between data from different sources, and a novel architecture and an extensive meta-data infrastructure, which reduce the cost of maintaining a warehouse.
引用
收藏
页码:52 / 57
页数:6
相关论文
共 50 条
  • [1] The role of scientific data management systems in handling laboratory information
    Goffredo, ME
    [J]. AMERICAN LABORATORY, 1999, 31 (06) : 65 - +
  • [2] Scientific information management
    不详
    [J]. NACHRICHTEN AUS CHEMIE TECHNIK UND LABORATORIUM, 1995, 43 (04): : 449 - +
  • [3] Sustainability in the management of scientific information
    Medina, Miguel Angel
    [J]. SUSTAINABILITY SCIENCE, 2021, 16 (01) : 329 - 336
  • [4] Scientific evidence: information management
    不详
    [J]. REVISTA DE SAUDE PUBLICA, 2009, 43 (06):
  • [5] Organization of biomedical data for collaborative scientific research: A research information management system
    Myneni, Sahiti
    Patel, Vimla L.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2010, 30 (03) : 256 - 264
  • [6] Scientific interdisciplinarity in the management of scientific-technological information
    Valiente Marquez, Jorge Felix
    Rodriguez Gomez, Ariel
    Quesada Pollero, Mirta de la Caridad
    Perera Cumerma, Leopoldo Fernando
    [J]. AVANCES, 2022, 24 (04): : 398 - 416
  • [7] Space systems for research of the earth - Basis for monitoring and management of scientific and applied information data - Reception and use of scientific and applied information data in automatic space complexes and systems
    Kozlov, DI
    Makarov, VP
    [J]. AUTOMATIC CONTROL IN AEROSPACE 1998, 1999, : 445 - 453
  • [8] Scientific Visualizer in personal information management
    Osinski, Grzegorz
    Osinska, Veslava
    Camacho, Brian
    [J]. 2017 TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2017, : 269 - 272
  • [9] INFORMATION MANAGEMENT - KEYSTONE OF THE SCIENTIFIC METHOD
    LISTON, DM
    HALLIDAY, TC
    [J]. PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1985, 22 : 271 - 277
  • [10] Active management of scientific data
    Plale, B
    Gannon, D
    Alameda, J
    Wilhelmson, B
    Hampton, S
    Rossi, A
    Droegemeier, K
    [J]. IEEE INTERNET COMPUTING, 2005, 9 (01) : 27 - 34