Metadata handling for Big Data projects

被引:0
|
作者
Golosova, M. [1 ]
Aulov, V [1 ]
Kaida, A. [2 ]
机构
[1] Kurchatov Inst, Natl Res Ctr, 1 Pl Kurchatova, Moscow, Russia
[2] Natl Res Tomsk Polytech Univ, 30 Lenina Ave, Tomsk, Russia
基金
俄罗斯科学基金会;
关键词
D O I
10.1088/1742-6596/1117/1/012007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Metadata is information about information. In business, industry or long living scientific experiments metadata grow and evolve with the project lifecycle. It leads to changes in the structure of the metadata, and with time it becomes complex, sophisticated and fluid, so that even simple lookup request appears to be complicated enough to require special tools. Another issue is that metadata can be produced and stored in different ways - paper or digital documents and tables, or databases, or something very specific - depending on the initial capabilities and requirements to its utilization. Due to this, to have a holistic view of the project one often has to perform so called multi source requests, aggregating information from a number of different sources. This kind of requests is not easy to implement, and can hardly be used for online services due to the significant execution time. This paper describes a possible solution by suggesting a method of metadata integration organization and providing an example of its application to information infrastructure of a HEP experiment.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Metadata handling: A video perspective
    Madhwacharyula, Chitra L.
    Davis, Marc
    Clips-Imag, Philippe Mulhem
    Kankanhalli, Mohan S.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2006, 2 (04) : 358 - 388
  • [42] Handling Metadata in a Neurophysiology Laboratory
    Zehl, Lyuba
    Jaillet, Florent
    Stoewer, Adrian
    Grewe, Jan
    Sobolev, Andrey
    Wachtler, Thomas
    Brochier, Thomas G.
    Riehle, Alexa
    Denker, Michael
    Gruen, Sonja
    FRONTIERS IN NEUROINFORMATICS, 2016, 10
  • [43] Uniformly handling metadata registries
    Jeong, D
    Kim, YG
    Park, SH
    Baik, DK
    SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, 2005, 3647 : 81 - 91
  • [44] Industrializing Data Integration Projects using a Metadata Driven Assembly Line
    Maier, Albert
    Oberhofer, Martin
    Schwarz, Thomas
    IT-INFORMATION TECHNOLOGY, 2012, 54 (03): : 114 - 122
  • [45] An Intelligent System for Cost Data Handling in Construction Projects
    Martinez-Rojas, Maria
    Marin, Nicolas
    Molina, Carlos
    Amparo Vila, Ma
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 359 - 366
  • [46] From Big Data to Big Projects: a Step-by-step Roadmap
    Mousanif, Hajar
    Sabah, Hasna
    Douiji, Yasmina
    Sayad, Younes Oulad
    2014 INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD), 2014, : 373 - 378
  • [47] A proteomics sample metadata representation for multiomics integration and big data analysis
    Dai, Chengxin
    Fullgrabe, Anja
    Pfeuffer, Julianus
    Solovyeva, Elizaveta M.
    Deng, Jingwen
    Moreno, Pablo
    Kamatchinathan, Selvakumar
    Kundu, Deepti Jaiswal
    George, Nancy
    Fexova, Silvie
    Gruening, Bjoern
    Foell, Melanie Christine
    Griss, Johannes
    Vaudel, Marc
    Audain, Enrique
    Locard-Paulet, Marie
    Turewicz, Michael
    Eisenacher, Martin
    Uszkoreit, Julian
    Van den Bossche, Tim
    Schwammle, Veit
    Webel, Henry
    Schulze, Stefan
    Bouyssie, David
    Jayaram, Savita
    Duggineni, Vinay Kumar
    Samaras, Patroklos
    Wilhelm, Mathias
    Choi, Meena
    Wang, Mingxun
    Kohlbacher, Oliver
    Brazma, Alvis
    Papatheodorou, Irene
    Bandeira, Nuno
    Deutsch, Eric W.
    Vizcaino, Juan Antonio
    Bai, Mingze
    Sachsenberg, Timo
    Levitsky, Lev I.
    Perez-Riverol, Yasset
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [48] Managing metadata for digital projects
    Ma, Jin
    LIBRARY COLLECTIONS ACQUISITIONS & TECHNICAL SERVICES, 2006, 30 (1-2): : 3 - 17
  • [49] A proteomics sample metadata representation for multiomics integration and big data analysis
    Chengxin Dai
    Anja Füllgrabe
    Julianus Pfeuffer
    Elizaveta M. Solovyeva
    Jingwen Deng
    Pablo Moreno
    Selvakumar Kamatchinathan
    Deepti Jaiswal Kundu
    Nancy George
    Silvie Fexova
    Björn Grüning
    Melanie Christine Föll
    Johannes Griss
    Marc Vaudel
    Enrique Audain
    Marie Locard-Paulet
    Michael Turewicz
    Martin Eisenacher
    Julian Uszkoreit
    Tim Van Den Bossche
    Veit Schwämmle
    Henry Webel
    Stefan Schulze
    David Bouyssié
    Savita Jayaram
    Vinay Kumar Duggineni
    Patroklos Samaras
    Mathias Wilhelm
    Meena Choi
    Mingxun Wang
    Oliver Kohlbacher
    Alvis Brazma
    Irene Papatheodorou
    Nuno Bandeira
    Eric W. Deutsch
    Juan Antonio Vizcaíno
    Mingze Bai
    Timo Sachsenberg
    Lev I. Levitsky
    Yasset Perez-Riverol
    Nature Communications, 12
  • [50] Handling big data: research challenges and future directions
    I. Anagnostopoulos
    S. Zeadally
    E. Exposito
    The Journal of Supercomputing, 2016, 72 : 1494 - 1516