Large-scale open bioinformatics data resources

被引:0
|
作者
Stupka, E [1 ]
机构
[1] Natl Univ Singapore, Inst Mol & Cell Biol, Singapore 117609, Singapore
关键词
bioinformatics; database; genome; integration; microarray; open source; proteome; transcriptome;
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The data explosion in bioinformatics is relentless. More and more genomes are being sequenced and many new types of datasets are being generated in large-scale projects. Integration and true open access to the data are still difficult issues, although they are gradually being addressed. Notably, certain fields have good standardization and interoperability, while others lag behind. This review summarizes the latest developments in genome and sequences databases, transcriptomics data (ESTs, ORESTES, full-length cDNAs), proteomics data (protein databases, protein structures, family and domain classification) as well as loosely integrated fields, such as microarray experiments, mutation databases and databases of regulatory regions and elements. The review attempts to resist simply summarizing what data are available, and aims to provide a critical look at some of the integration and access issues associated with several of these resources.
引用
收藏
页码:265 / 274
页数:10
相关论文
共 50 条
  • [1] Efficient bioinformatics approaches for large-scale data analysis
    Hautaniemi, S.
    [J]. FEBS JOURNAL, 2011, 278 : 27 - 27
  • [2] Large-Scale Machine Learning and Optimization for Bioinformatics Data Analysis
    Cheng, Jianlin
    [J]. ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [3] Bioinformatics Strategies for the Analysis and Integration of Large-Scale Multiomics Data
    Tesi, Niccolo
    van der Lee, Sven
    Hulsman, Marc
    Holstege, Henne
    Reinders, Marcel
    [J]. JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2023, 78 (04): : 659 - 662
  • [4] A generic bioinformatics pipeline to integrate large-scale trait data with large phylogenies
    Fernando, Pasan C.
    Jackson, Laura M.
    Zeng, Erliang
    Mabee, Paula M.
    Balhoff, James P.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 2235 - 2237
  • [5] Challenges in large-scale bioinformatics projects
    Morrison-Smith, Sarah
    Boucher, Christina
    Sarcevic, Aleksandra
    Noyes, Noelle
    O'Brien, Catherine
    Cuadros, Nazaret
    Ruiz, Jaime
    [J]. HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2022, 9 (01):
  • [6] Challenges in large-scale bioinformatics projects
    Sarah Morrison-Smith
    Christina Boucher
    Aleksandra Sarcevic
    Noelle Noyes
    Catherine O’Brien
    Nazaret Cuadros
    Jaime Ruiz
    [J]. Humanities and Social Sciences Communications, 9
  • [7] Optimization of hadoop cluster for analyzing large-scale sequence data in bioinformatics
    Toth, Adam
    Karimi, Ramin
    [J]. ANNALES MATHEMATICAE ET INFORMATICAE, 2019, 50 : 187 - 202
  • [8] Intelligent mining of large-scale bio-data: Bioinformatics applications
    Hashemi, Farahnaz Sadat Golestan
    Ismail, Mohd Razi
    Yusop, Mohd Rafii
    Hashemi, Mahboobe Sadat Golestan
    Shahraki, Mohammad Hossein Nadimi
    Rastegari, Hamid
    Miah, Gous
    Aslani, Farzad
    [J]. BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2018, 32 (01) : 10 - 29
  • [9] Anytime Large-Scale Analytics of Linked Open Data
    Soulet, Arnaud
    Suchanek, Fabian M.
    [J]. SEMANTIC WEB - ISWC 2019, PT I, 2019, 11778 : 576 - 592
  • [10] pmparser and PMDB: resources for large-scale, open studies of the biomedical literature
    Schoenbachler, Joshua L.
    Hughey, Jacob J.
    [J]. PEERJ, 2021, 9