Efficient techniques to explore and rank paths in life science data sources

被引:0
|
作者
Lacroix, Zoé [1 ]
Raschid, Louiqa [2 ]
Vidal, Maria-Esther [3 ]
机构
[1] Arizona State University, Tempe, AZ 85287-6106, United States
[2] University of Maryland, College Park, MD 20742, United States
[3] Universidad Simon Bolivar, Caracas 1080, Venezuela
基金
英国医学研究理事会; 澳大利亚国家健康与医学研究理事会; 美国国家卫生研究院; 英国惠康基金;
关键词
Semantics - Polynomial approximation;
D O I
10.1007/978-3-540-24745-6_13
中图分类号
学科分类号
摘要
Life science data sources represent a complex link-driven federation of publicly available Web accessible sources. A fundamental need for scientists today is the ability to completely explore all relationships between scientific classes, e.g., genes and citations, that may be retrieved from various data sources. A challenge to such exploration is that each path between data sources potentially has different domain specific semantics and yields different benefit to the scientist. Thus, it is important to efficiently explore paths so as to generate paths with the highest benefits. In this paper, we explore the search space of paths that satisfy queries expressed as regular expressions. We propose an algorithm ESearch that runs in polynomial time in the size of the graph when the graph is acyclic. We present expressions to determine the benefit of a path based on metadata (statistics). We develop a heuristic search OnlyBestXX%. Finally, we compare OnlyBestXX% and ESearch. © Springer-Verlag 2004.
引用
收藏
页码:187 / 202
相关论文
共 50 条
  • [31] Qualifying Explore and Exploit for Efficient Data Dissemination in Emergency Adhoc Networks
    Kolios, Panayiotis
    Pitsillides, Andreas
    Mokryn, Osnat
    Papadaki, Katerina
    2014 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2014, : 303 - 307
  • [32] Data science in healthcare: techniques, challenges and opportunities
    Devi, Pushpa
    Bansal, Kishori Lal
    HEALTH AND TECHNOLOGY, 2024, 14 (04) : 623 - 634
  • [33] Preface: Some Advanced Techniques in Data Science
    Shi, Yong
    Tian, Yingjie
    Annals of Data Science, 2015, 2 (01) : 1 - 3
  • [34] Data Science Techniques for Sustainable Dairy Management
    Fauvel, Kevin
    Masson, Veronique
    Faverdin, Philippe
    Termier, Alexandre
    ERCIM NEWS, 2018, (113): : 29 - 30
  • [35] Usage of Visualization Techniques in Data Science Workflows
    Schmidt, Johanna
    IVAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 3: IVAPP, 2020, : 309 - 316
  • [36] On Scalable Data Mining Techniques for Earth Science
    Goetz, Markus
    Richerzhagen, Matthias
    Bodenstein, Christian
    Cavallaro, Gabriele
    Glock, Philipp
    Riedel, Morris
    Benediktsson, Jon Atli
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2188 - 2197
  • [37] Energy-efficient data gathering techniques using multiple paths for providing resilience to node failures in wireless sensor networks
    Department of Computer Engineering, Kyungpook National University, Daegu, Korea, Republic of
    J. Commun., 2006, 3 (21-28):
  • [38] A MODIFIED LOG RANK TEST EFFICIENT FOR HIGHLY STRATIFIED DATA
    TSIATIS, AA
    SCHOENFELD, D
    BIOMETRICS, 1985, 41 (01) : 319 - 319
  • [39] Combating the Life Science Data Avalanche
    Hock, Lindsay
    R&D MAGAZINE, 2015, 57 (04): : 14 - 15
  • [40] Efficient Estimation for Rank-Based Regression with Clustered Data
    Fu, Liya
    Wang, You-Gan
    BIOMETRICS, 2012, 68 (04) : 1074 - 1082