Prevention and Control of Pathogens Based on Big-Data Mining and Visualization Analysis

被引:0
|
作者
Chen, Cui-Xia [1 ,2 ]
Sun, Li-Na [3 ]
Hou, Xue-Xin [3 ]
Du, Peng-Cheng [4 ]
Wang, Xiao-Long [5 ]
Du, Xiao-Chen [6 ]
Yu, Yu-Fei [1 ,2 ]
Cai, Rui-Kun [1 ,2 ]
Yu, Lei [1 ,2 ]
Li, Tian-Jun [1 ,2 ]
Luo, Min-Na [1 ,2 ]
Shen, Yue [1 ,2 ]
Lu, Chao [1 ,2 ]
Li, Qian [1 ,2 ]
Zhang, Chuan [1 ,2 ]
Gao, Hua-Fang [1 ,2 ]
Ma, Xu [1 ,2 ]
Lin, Hao [7 ]
Cao, Zong-Fu [1 ,2 ]
机构
[1] Natl Res Inst Family Planning, Beijing, Peoples R China
[2] Natl Ctr Human Genet Resources, Beijing, Peoples R China
[3] Natl Inst Communicable Dis Control & Prevent, Beijing, Peoples R China
[4] Bejing Ditan Hosp, Beijing, Peoples R China
[5] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China
[6] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[7] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu, Peoples R China
关键词
big data mining; visualization; pathogen identification; genome analysis; virulence; drug-resistance; ANTIBIOTIC-RESISTANCE GENES; RESPIRATORY-TRACT; GENOME SEQUENCE; MICROBIAL GENOMES; PAN-GENOME; SP NOV; PLATFORM; CORE; TOOL; ANNOTATION;
D O I
10.3389/fmolb.2020.626595
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Morbidity and mortality caused by infectious diseases rank first among all human illnesses. Many pathogenic mechanisms remain unclear, while misuse of antibiotics has led to the emergence of drug-resistant strains. Infectious diseases spread rapidly and pathogens mutate quickly, posing new threats to human health. However, with the increasing use of high-throughput screening of pathogen genomes, research based on big data mining and visualization analysis has gradually become a hot topic for studies of infectious disease prevention and control. In this paper, the framework was performed on four infectious pathogens (Fusobacterium, Streptococcus, Neisseria, and Streptococcus salivarius) through five functions: 1) genome annotation, 2) phylogeny analysis based on core genome, 3) analysis of structure differences between genomes, 4) prediction of virulence genes/factors with their pathogenic mechanisms, and 5) prediction of resistance genes/factors with their signaling pathways. The experiments were carried out from three angles: phylogeny (macro perspective), structure differences of genomes (micro perspective), and virulence and drug-resistance characteristics (prediction perspective). Therefore, the framework can not only provide evidence to support the rapid identification of new or unknown pathogens and thus plays a role in the prevention and control of infectious diseases, but also help to recommend the most appropriate strains for clinical and scientific research. This paper presented a new genome information visualization analysis process framework based on big data mining technology with the accommodation of the depth and breadth of pathogens in molecular level research.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] NodeMerge: Template Based Efficient Data Reduction For Big-Data Causality Analysis
    Tang, Yutao
    Li, Ding
    Li, Zhichun
    Zhang, Mu
    Jee, Kangkook
    Xiao, Xusheng
    Wu, Zhenyu
    Rhee, Junghwan
    Xu, Fengyuan
    Li, Qun
    PROCEEDINGS OF THE 2018 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'18), 2018, : 1324 - 1337
  • [22] Interpreting big-data analysis of retrospective observational data
    Huizinga, Tom W. J.
    Knevel, Rachel
    LANCET RHEUMATOLOGY, 2020, 2 (11): : E652 - E653
  • [23] Scalable Cloud-based Analysis Framework for Medical Big-data
    Pakdel, Rezvan
    Herbert, John
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC), VOL 2, 2016, : 647 - 652
  • [24] Visualization analysis of CBL application in Chinese and international medical education based on big data mining
    Yue, Junyan
    Shang, Yun
    Cui, Hongkai
    Liang, Changhua
    Wu, Qingwu
    Zhao, Junqiang
    Wang, Huifang
    Han, Dongmin
    Zhu, Zhiping
    BMC MEDICAL EDUCATION, 2025, 25 (01)
  • [25] Failure Analysis and Prediction for Big-Data Systems
    Rosa, Andrea
    Chen, Lydia Y.
    Binder, Walter
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2017, 10 (06) : 984 - 998
  • [26] Federated learning based futuristic biomedical big-data analysis and standardization
    Fathima, Afifa Salsabil
    Basha, Syed Muzamil
    Ahmed, Syed Thouheed
    Mathivanan, Sandeep Kumar
    Rajendran, Sukumar
    Mallik, Saurav
    Zhao, Zhongming
    PLOS ONE, 2023, 18 (10):
  • [27] Analysis and Optimization of Big-Data Stream Processing
    Vakilinia, Shahin
    Zhang, Xinyao
    Qiu, Dongyu
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [28] Big-Data Based English-Chinese Corpus Collection and Mining and Machine Translation Framework
    Guo, Hang
    Jiang, Liu
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 418 - 421
  • [29] A Campus Big-Data Platform Architecture for Data Mining and Business Intelligence in Education Institutes
    Zhang, Ningcheng
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS, ENVIRONMENT, BIOTECHNOLOGY AND COMPUTER (MMEBC), 2016, 88 : 282 - 288
  • [30] Visualization analysis of big data research based on Citespace
    Weihong Wang
    Chang Lu
    Soft Computing, 2020, 24 : 8173 - 8186