Big data in genomic research for big questions with examples from covid-19 and other zoonoses

被引:1
|
作者
Wassenaar, Trudy M. [1 ]
Ussery, David W. [2 ]
Rosel, Adriana Cabal [3 ]
机构
[1] Mol Microbiol & Genom Consultants, Tannenstr 7, D-55576 Zotzenheim, Germany
[2] Univ Arkansas Med Sci, Dept Biomed Informat, 4301 W Markham St, Little Rock, AR 72205 USA
[3] Austrian Agcy Hlth & Food Safety, Inst Med Microbiol & Hyg, Div Publ Hlth, Wahringerstr 25a, A-1096 Vienna, Austria
基金
美国国家科学基金会;
关键词
omics; genomics; zoonoses; COVID-19; Salmonella; scientific publishing; big data; SALMONELLA-ENTERICA; MICROBIOME; COLI;
D O I
10.1093/jambio/lxac055
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Omics research inevitably involves the collection and analysis of big data, which can only be handled by automated approaches. Here we point out that the analysis of big data in the field of genomics dictates certain requirements, such as specialized software, quality control of input data, and simplification for visualization of the results. The latter results in a loss of information, as is exemplified for phylogenetic trees. Clear communication of big data analyses can be enhanced by novel visualization strategies. The interpretation of findings is sometimes hampered when dedicated analytical tools are not fully understood by microbiologists, while the researchers performing these analyses may not have a full overview of the biology of the microbes under study. These issues are illustrated here, using SARS-Cov-2 and Salmonella enterica as zoonotic examples. Whereas in scientific communications jargon should be avoided or explained, nomenclature to group similar organisms and distinguish these from more distant relatives is not only essential, but also influences the interpretation of results. Unfortunately, changes in taxonomically accepted names are now so frequent that they hamper rather than assist research, as is illustrated with difficulties of microbiome studies. Nomenclature to group viral isolates, as is done for SARS-Cov2, is also not without difficulties. Some weaknesses in current omics research stem from poor quality of data or biased databases, and problems can be magnified by machine learning approaches. Moreover, the overall opus of scientific publications can now be considered "big data", as is illustrated by the avalanche of COVID-19-related publications. The peer-review model of scientific publishing is only barely coping with this novel situation, resulting in retractions and the publication of bogus works. The avalanche of scientific publications that originated from the current pandemic can obstruct literature searches, and this will unfortunately continue over time.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Mobile Big Data in the fight against COVID-19
    Benjamins, Richard
    Vos, Jeanine
    Verhulst, Stefaan
    DATA & POLICY, 2022, 4
  • [22] The Role of Big Data and Machine Learning in COVID-19
    Ababneh, Mustafa
    Aljarrah, Aayat
    Karagozlu, Damla
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2020, 11 (02) : 1 - 20
  • [23] Combat COVID-19 with artificial intelligence and big data
    Lin, Leesa
    Hou, Zhiyuan
    JOURNAL OF TRAVEL MEDICINE, 2020, 27 (05)
  • [24] Big Questions and Big Data: A Reply from the Collaboratory
    Hofmeester, Karin
    Moll-Murata, Christine
    INTERNATIONAL REVIEW OF SOCIAL HISTORY, 2017, 62 (01) : 123 - 130
  • [25] Leveraging BIG Data from BIG Databases to Answer BIG Questions
    Whittier, Joanna
    Sievert, Nick
    Loftus, Andrew
    Defilippi, Julie M.
    Krogman, Rebecca M.
    Ojala, Jeffrey
    Litts, Thom
    Kopaska, Jeff
    Eiden, Nicole
    FISHERIES, 2016, 41 (07) : 417 - 419
  • [26] Directions for research and training in plant omics: Big Questions and Big Data
    Argueso, Cristiana T.
    Assmann, Sarah M.
    Birnbaum, Kenneth D.
    Chen, Sixue
    Dinneny, Jose R.
    Doherty, Colleen J.
    Eveland, Andrea L.
    Friesner, Joanna
    Greenlee, Vanessa R.
    Law, Julie A.
    Marshall-Colon, Amy
    Mason, Grace Alex
    O'Lexy, Ruby
    Peck, Scott C.
    Schmitz, Robert J.
    Song, Liang
    Stern, David
    Varagona, Marguerite J.
    Walley, Justin W.
    Williams, Cranos M.
    PLANT DIRECT, 2019, 3 (04)
  • [27] Big data, privacy and COVID-19 – learning from humanitarian expertise in data protection
    Andrej Zwitter
    Oskar J. Gstrein
    Journal of International Humanitarian Action, 2020, 5 (1)
  • [28] Characteristics of COVID-19 and Research Progresses on Genetic Engineering Vaccine Based on Big Data
    Wei, Qixing
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [29] Impact of COVID-19 on pornography use: Evidence from big data analyses
    Lau, Way Kwok-Wai
    Ngan, Lionel Ho-Man
    Chan, Randolph Chun-Ho
    Wu, William Ka-Kei
    Lau, Benson Wui-Man
    PLOS ONE, 2021, 16 (12):
  • [30] A COVID-19 Vaccine: Big Strides Come with Big Challenges
    Mellet, Juanita
    Pepper, Michael S.
    VACCINES, 2021, 9 (01) : 1 - 14