Analysis of Metabolomics Datasets with High-Performance Computing and Metabolite Atlases

被引:37
|
作者
Yao, Yushu [1 ,2 ]
Sun, Terence [1 ,2 ]
Wang, Tony [1 ,2 ]
Ruebel, Oliver [1 ,2 ]
Northen, Trent [3 ]
Bowen, Benjamin P. [3 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Natl Energy Res Sci Comp Ctr NERSC, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Div Life Sci, Berkeley, CA 94720 USA
关键词
SciDB; metabolite atlas; metabolomics; data analysis; I[!text type='Python']Python[!/text; !text type='Python']Python[!/text; LC/MS; MS/MS; biology;
D O I
10.3390/metabo5030431
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Even with the widespread use of liquid chromatography mass spectrometry (LC/MS) based metabolomics, there are still a number of challenges facing this promising technique. Many, diverse experimental workflows exist; yet there is a lack of infrastructure and systems for tracking and sharing of information. Here, we describe the Metabolite Atlas framework and interface that provides highly-efficient, web-based access to raw mass spectrometry data in concert with assertions about chemicals detected to help address some of these challenges. This integration, by design, enables experimentalists to explore their raw data, specify and refine features annotations such that they can be leveraged for future experiments. Fast queries of the data through the web using SciDB, a parallelized database for high performance computing, make this process operate quickly. By using scripting containers, such as IPython or Jupyter, to analyze the data, scientists can utilize a wide variety of freely available graphing, statistics, and information management resources. In addition, the interfaces facilitate integration with systems biology tools to ultimately link metabolomics data with biological models.
引用
收藏
页码:431 / 442
页数:12
相关论文
共 50 条
  • [1] Dealing with the Unknown: Metabolomics and Metabolite Atlases
    Bowen, Benjamin P.
    Northen, Trent R.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2010, 21 (09) : 1471 - 1476
  • [2] Improving Data Analysis Performance for High-Performance Computing with Integrating Statistical Metadata in Scientific Datasets
    Liu, Jialin
    Chen, Yong
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1292 - 1295
  • [3] Graph analysis with high-performance computing
    Hendrickson, Bruce
    Berry, JonatHan W.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (02) : 14 - 19
  • [4] High-performance computing for surface modelling and analysis
    Clematis, A
    Coda, A
    Falcidieno, B
    Spagnuolo, M
    [J]. VISUAL COMPUTER, 2000, 16 (01): : 62 - 78
  • [5] Data Analysis and Visualization in High-Performance Computing
    Szczepariski, Amy F.
    Huang, Jian
    Baer, Troy
    Mack, Yashema C.
    Ahern, Sean
    [J]. COMPUTER, 2013, 46 (05) : 84 - 92
  • [6] High-performance computing for surface modelling and analysis
    Andrea Clematis
    Andrea Coda
    Bianca Falcidieno
    Michela Spagnuolo
    [J]. The Visual Computer, 2000, 16 : 62 - 78
  • [7] High-Performance Computing
    Bungartz, Hans-Joachim
    [J]. IT-INFORMATION TECHNOLOGY, 2013, 55 (03): : 83 - 85
  • [9] High-performance computing
    Holland, CJ
    Peterkin, RE
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2004, 6 (06) : 8 - 11
  • [10] HIGH-PERFORMANCE COMPUTING
    KOCHER, B
    [J]. COMMUNICATIONS OF THE ACM, 1990, 33 (01) : 3 - 3