Prevention and Control of Pathogens Based on Big-Data Mining and Visualization Analysis

被引:0
|
作者
Chen, Cui-Xia [1 ,2 ]
Sun, Li-Na [3 ]
Hou, Xue-Xin [3 ]
Du, Peng-Cheng [4 ]
Wang, Xiao-Long [5 ]
Du, Xiao-Chen [6 ]
Yu, Yu-Fei [1 ,2 ]
Cai, Rui-Kun [1 ,2 ]
Yu, Lei [1 ,2 ]
Li, Tian-Jun [1 ,2 ]
Luo, Min-Na [1 ,2 ]
Shen, Yue [1 ,2 ]
Lu, Chao [1 ,2 ]
Li, Qian [1 ,2 ]
Zhang, Chuan [1 ,2 ]
Gao, Hua-Fang [1 ,2 ]
Ma, Xu [1 ,2 ]
Lin, Hao [7 ]
Cao, Zong-Fu [1 ,2 ]
机构
[1] Natl Res Inst Family Planning, Beijing, Peoples R China
[2] Natl Ctr Human Genet Resources, Beijing, Peoples R China
[3] Natl Inst Communicable Dis Control & Prevent, Beijing, Peoples R China
[4] Bejing Ditan Hosp, Beijing, Peoples R China
[5] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China
[6] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[7] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu, Peoples R China
关键词
big data mining; visualization; pathogen identification; genome analysis; virulence; drug-resistance; ANTIBIOTIC-RESISTANCE GENES; RESPIRATORY-TRACT; GENOME SEQUENCE; MICROBIAL GENOMES; PAN-GENOME; SP NOV; PLATFORM; CORE; TOOL; ANNOTATION;
D O I
10.3389/fmolb.2020.626595
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Morbidity and mortality caused by infectious diseases rank first among all human illnesses. Many pathogenic mechanisms remain unclear, while misuse of antibiotics has led to the emergence of drug-resistant strains. Infectious diseases spread rapidly and pathogens mutate quickly, posing new threats to human health. However, with the increasing use of high-throughput screening of pathogen genomes, research based on big data mining and visualization analysis has gradually become a hot topic for studies of infectious disease prevention and control. In this paper, the framework was performed on four infectious pathogens (Fusobacterium, Streptococcus, Neisseria, and Streptococcus salivarius) through five functions: 1) genome annotation, 2) phylogeny analysis based on core genome, 3) analysis of structure differences between genomes, 4) prediction of virulence genes/factors with their pathogenic mechanisms, and 5) prediction of resistance genes/factors with their signaling pathways. The experiments were carried out from three angles: phylogeny (macro perspective), structure differences of genomes (micro perspective), and virulence and drug-resistance characteristics (prediction perspective). Therefore, the framework can not only provide evidence to support the rapid identification of new or unknown pathogens and thus plays a role in the prevention and control of infectious diseases, but also help to recommend the most appropriate strains for clinical and scientific research. This paper presented a new genome information visualization analysis process framework based on big data mining technology with the accommodation of the depth and breadth of pathogens in molecular level research.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Visualization analysis of big data research based on Citespace
    Wang, Weihong
    Lu, Chang
    SOFT COMPUTING, 2020, 24 (11) : 8173 - 8186
  • [32] The Big Data Analysis and Visualization of Mass Messages under "Smart Government Affairs" Based on Text Mining
    Wang, Donghong
    Guo, Jiliang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [33] Big-data Integration Methodologies for Effective Management and Data Mining of Petroleum Digital Ecosystems
    Nimmagadda, Shastri L.
    Dreher, Heinz V.
    2013 7TH IEEE INTERNATIONAL CONFERENCE ON DIGITAL ECOSYSTEMS AND TECHNOLOGIES (DEST), 2013, : 148 - 153
  • [34] A review on sentiment discovery and analysis of educational big-data
    Han, Zhongmei
    Wu, Jiyi
    Huang, Changqin
    Huang, Qionghao
    Zhao, Meihua
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (01)
  • [35] Big-Data analysis used NGS and study of evolution
    Ikeo, Kazuho
    GENES & GENETIC SYSTEMS, 2015, 90 (06) : 360 - 360
  • [36] IPGOD: An Integrated Visualization Platform Based on Big Data Mining and Cloud Computing
    Chen, Wei-Yu
    Lu, Peggy Joy
    Shiau, Steven
    ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 11 - 16
  • [37] Big-data platform based on open source ecosystem
    Lei J.
    Ye H.
    Wu Z.
    Zhang P.
    Xie L.
    He Y.
    1600, Science Press (54): : 80 - 93
  • [38] Big-data based infrastructure management: toward Assetmetrics
    Kobayashi, K.
    Kaito, K.
    LIFE-CYCLE OF STRUCTURAL SYSTEMS: DESIGN, ASSESSMENT, MAINTENANCE AND MANAGEMENT, 2015, : 70 - 80
  • [39] A measurement-based study of big-data movement
    Addanki, Ranjana
    Maji, Sourav
    Veeraraghavan, Malathi
    Tracy, Chris
    2015 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2015, : 445 - 449
  • [40] Model Training Task Scheduling Algorithm Based on Greedy-Genetic Algorithm for Big-Data Mining
    Wang, Yiqi
    Sun, Yipin
    Zhang, Ziwei
    2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168