A data-driven, knowledge-based approach to biomarker discovery: application to circulating microRNA markers of colorectal cancer prognosis

被引:44
|
作者
Vafaee, Fatemeh [1 ]
Diakos, Connie [2 ]
Kirschner, Michaela B. [3 ]
Reid, Glen [3 ,4 ]
Michael, Michael Z. [5 ]
Horvath, Lisa G. [4 ,6 ,7 ]
Alinejad-Rokny, Hamid [8 ]
Cheng, Zhangkai Jason [9 ,10 ]
Kuncic, Zdenka [9 ,10 ]
Clarke, Stephen [2 ]
机构
[1] Univ New South Wales, Sch Biotechnol & Biomol Sci, Sydney, NSW 2033, Australia
[2] Univ Sydney, Royal North Shore Hosp, Kolling Inst Med Res, Reserve Rd, St Leonards, NSW 2065, Australia
[3] Asbestos Dis Res Inst, Hosp Rd, Concord, NSW 2139, Australia
[4] Univ Sydney, Sydney Med Sch, Sydney, NSW 2050, Australia
[5] Flinders Univ S Australia, Flinders Med Ctr, Flinders Ctr Innovat Canc, Adelaide, SA 5042, Australia
[6] Chris OBrien Lifehouse, Missenden Rd, Camperdown, NSW 2050, Australia
[7] Royal Prince Alfred Hosp, Camperdown, NSW 2050, Australia
[8] Univ New South Wales, Sydney, NSW 2052, Australia
[9] Univ Sydney, Charles Perkins Ctr, Sydney, NSW 2006, Australia
[10] Univ Sydney, Sch Phys, Sydney, NSW 2006, Australia
关键词
GENE SELECTION; SERUM MIR-21; EXPRESSION; COLON; TARGET; INFLAMMATION; ANNOTATION; DIAGNOSIS; SURVIVAL; PLASMA;
D O I
10.1038/s41540-018-0056-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in high-throughput technologies have provided an unprecedented opportunity to identify molecular markers of disease processes. This plethora of complex-omics data has simultaneously complicated the problem of extracting meaningful molecular signatures and opened up new opportunities for more sophisticated integrative and holistic approaches. In this era, effective integration of data-driven and knowledge-based approaches for biomarker identification has been recognised as key to improving the identification of high-performance biomarkers, and necessary for translational applications. Here, we have evaluated the role of circulating microRNA as a means of predicting the prognosis of patients with colorectal cancer, which is the second leading cause of cancer-related death worldwide. We have developed a multi-objective optimisation method that effectively integrates a data-driven approach with the knowledge obtained from the microRNA-mediated regulatory network to identify robust plasma microRNA signatures which are reliable in terms of predictive power as well as functional relevance. The proposed multi-objective framework has the capacity to adjust for conflicting biomarker objectives and to incorporate heterogeneous information facilitating systems approaches to biomarker discovery. We have found a prognostic signature of colorectal cancer comprising 11 circulating microRNAs. The identified signature predicts the patients' survival outcome and targets pathways underlying colorectal cancer progression. The altered expression of the identified microRNAs was confirmed in an independent public data set of plasma samples of patients in early stage vs advanced colorectal cancer. Furthermore, the generality of the proposed method was demonstrated across three publicly available miRNA data sets associated with biomarker studies in other diseases.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A data-driven, knowledge-based approach to biomarker discovery: application to circulating microRNA markers of colorectal cancer prognosis
    Fatemeh Vafaee
    Connie Diakos
    Michaela B. Kirschner
    Glen Reid
    Michael Z. Michael
    Lisa G. Horvath
    Hamid Alinejad-Rokny
    Zhangkai Jason Cheng
    Zdenka Kuncic
    Stephen Clarke
    [J]. npj Systems Biology and Applications, 4
  • [2] Assessment of Cardiovascular Risk based on a Data-driven Knowledge Discovery Approach
    Mendes, D.
    Paredes, S.
    Rocha, T.
    Carvalho, P.
    Henriques, J.
    Cabiddu, R.
    Morais, J.
    [J]. 2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 6800 - 6803
  • [3] Modeling Transmission Lines Using a Hybrid Knowledge-Based and Data-Driven Approach
    Zhang, Yanming
    Jiang, Lijun
    [J]. IEEE Transactions on Signal and Power Integrity, 2022, 1 : 12 - 21
  • [4] Assessment of living quality in Guangdong: A hybrid knowledge-based and data-driven approach
    Zhou, Xin-Hui
    Shen, Shui-Long
    [J]. ECOLOGICAL INFORMATICS, 2024, 82
  • [5] A hybrid knowledge-based and data-driven approach to identifying semantically similar concepts
    Pivovarov, Rimma
    Elhadad, Noemie
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (03) : 471 - 481
  • [6] A Comparison of Methods for Data-Driven Cancer Outlier Discovery, and An Application Scheme to Semisupervised Predictive Biomarker Discovery
    Karrila, Seppo
    Lee, Julian Hock Ean
    Tucker-Kellogg, Greg
    [J]. CANCER INFORMATICS, 2011, 10 : 109 - 120
  • [7] Pandemic vulnerability index of US cities: A hybrid knowledge-based and data-driven approach
    Rahman, Md. Shahinoor
    Paul, Kamal Chandra
    Rahman, Md. Mokhlesur
    Samuel, Jim
    Thill, Jean-Claude
    Hossain, Md. Amjad
    Ali, G. G. Md. Nawaz
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2023, 95
  • [8] Understanding building occupant activities at scale: An integrated knowledge-based and data-driven approach
    Sonta, Andrew J.
    Simmons, Perry E.
    Jain, Rishee K.
    [J]. ADVANCED ENGINEERING INFORMATICS, 2018, 37 : 1 - 13
  • [9] Knowledge-based and data-driven fuzzy modeling for rockburst prediction
    Adoko, Amoussou Coffi
    Gokceoglu, Candan
    Wu, Li
    Zuo, Qing Jun
    [J]. INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 2013, 61 : 86 - 95
  • [10] Fusion of knowledge-based and data-driven approaches to grammar induction
    Georgiladakis, Spiros
    Unger, Christina
    Iosif, Elias
    Walter, Sebastian
    Cimiano, Philipp
    Petrakis, Euripides
    Potamianos, Alexandros
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 288 - 292