Identifying valuable features from complex omics data is of great significance for disease diagnosis study. This paper proposes a new feature selection algorithm based on sample network (FS-SN) to mine important information from omics data. The sample network is constructed according to the sample neighbor relationship at the molecular (feature) expression level, and the distinguishing ability of the feature is evaluated based on the topology of the sample network. The sample network established on a feature with a strong discriminating ability tends to have many edges between the same group samples and few edges between the different group samples. At the same time, FS-SN removes redundant features according to the gravitational interaction between features. To show the validation of FS-SN, it was compared on ten public datasets with ERGS, mRMR, ReliefF, ATSD-DN, and INDEED which are efficient in omics data analysis. Experimental results show that FS-SN performed better than the compared methods in accuracy, sensitivity and specificity in most cases. Hence, FS-SN making use of the topology of the sample network is effective for analyzing omics data, it can identify key features that reflect the occurrence and development of diseases, and reveal the underlying biological mechanism.
机构:
Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
Harvard Med Sch, Boston, MA 02115 USABrigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
De Marzio, Margherita
Glass, Kimberly
论文数: 0引用数: 0
h-index: 0
机构:
Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
Harvard Med Sch, Boston, MA 02115 USA
Harvard Chan Sch Publ Hlth, Biostat Dept, Boston, MA 02115 USABrigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
Glass, Kimberly
Kuijjer, Marieke L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Oslo, Ctr Mol Med Norway NCMM, Nord EMBL Partnership, Oslo, NorwayBrigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
机构:
Univ Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
INSERM, U1161, F-F75010 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Gwinner, Frederik
论文数: 引用数:
h-index:
机构:
Boulday, Gwenola
Vandiedonck, Claire
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Diderot, Sorbonne Paris Cite, UMRS 958, F-75010 Paris, France
INSERM, U958, F-75010 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Vandiedonck, Claire
Arnould, Minh
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
INSERM, U1161, F-F75010 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Arnould, Minh
论文数: 引用数:
h-index:
机构:
Cardoso, Cecile
Nikolayeva, Iryna
论文数: 0引用数: 0
h-index: 0
机构:
Inst Pasteur, Syst Biol Lab, Inst Pasteur CNRS, C3BI,USR 3756, F-75015 Paris, France
Inst Pasteur, Funct Genet Infect Dis Unit, F-75015 Paris, France
Univ Paris 05, Sorbonne Paris Cite, F-75006 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Nikolayeva, Iryna
Guitart-Pla, Oriol
论文数: 0引用数: 0
h-index: 0
机构:
Inst Pasteur, Syst Biol Lab, Inst Pasteur CNRS, C3BI,USR 3756, F-75015 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Guitart-Pla, Oriol
Denis, Cecile V.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Saclay, Univ Paris Sud, Unite 1176, INSERM, F-94270 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Denis, Cecile V.
Christophe, Olivier D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Saclay, Univ Paris Sud, Unite 1176, INSERM, F-94270 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France
Christophe, Olivier D.
Beghain, Johann
论文数: 0引用数: 0
h-index: 0
机构:
Inst Pasteur, Funct Genet Infect Dis Unit, F-75015 Paris, France
Inst Pasteur, Genet & Genom Insect Vectors, F-75015 Paris, FranceUniv Paris Diderot, Sorbonne Paris Cite, UMRS 1161, F-75010 Paris, France