RETRACTED: Text mining and network analysis of molecular interaction in non-small cell lung cancer by using natural language processing (Retracted article. See vol. 42, pg. 1489, 2015)

被引:10
|
作者
Li, Jun [1 ]
Bi, Lintao [1 ]
Sun, Yanxia [1 ]
Lu, Zhenxia [1 ]
Lin, Yumei [1 ]
Bai, Ou [2 ]
Shao, Hui [1 ]
机构
[1] Jilin Univ, China Japan Union Hosp, Dept Hematol & Oncol, Changchun 130031, Jilin, Peoples R China
[2] Jilin Univ, Hosp 1, Tumor Ctr, Changchun 130021, Jilin, Peoples R China
关键词
Non-small cell lung cancer; Molecular networks; Text mining using natural language processing; KEGG pathway analysis; MAMMALIAN PHENOTYPE ONTOLOGY; GENE-EXPRESSION; RAS MUTATIONS; ADENOCARCINOMA; ASSOCIATION; MECHANISM; PREDICT; MODEL; KRAS; PRB;
D O I
10.1007/s11033-014-3705-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Lung cancer including non-small cell lung cancer (NSCLC) and small cell lung cancer is one of the most aggressive tumors with high incidence and low survival rate. The typical NSCLC patients account for 80-85 % of the total lung cancer patients. To systemically explore the molecular mechanisms of NSCLC, we performed a molecular network analysis between human and mouse to identify key genes (pathways) involved in the occurrence of NSCLC. We automatically extracted the human-to-mouse orthologous interactions using the GeneWays system by natural language processing and further constructed molecular (gene and its products) networks by mapping the human-to-mouse interactions to NSCLC-related mammalian phenotypes, followed by module analysis using ClusterONE of Cytoscape and pathway enrichment analysis using the database for annotation, visualization and integrated discovery (DAVID) successively. A total of 70 genes were proven to be related to the mammalian phenotypes of NSCLC, and seven genes (ATAD5, BECN1, CDKN2A, FNTB, E2F1, KRAS and PTE1V) were found to have a bearing on more than one mammalian phenotype (MP) each. Four network clusters centered by four genes thyroglobulin (TG), neurofibromatosis type-1 (NF1), neurofibromatosis type 2 (NF2) and E2F transcription factor 1 (E2F1) were generated. Genes in the four network modules were enriched in eight KEGG pathways (p value <0.05), including pathways in cancer, small cell lung cancer, cell cycle and p53 signaling pathway. Genes p53 and E2F1 may play important roles in NSCLC occurrence, and thus can be considered as therapeutic targets for NSCLC.
引用
收藏
页码:8071 / 8079
页数:9
相关论文
共 50 条