HCS-hierarchical algorithm for simulation of omics datasets

被引:0
|
作者
Stomma, Piotr [1 ,2 ]
Rudnicki, Witold R. [1 ,2 ]
机构
[1] Univ Bialystok, Fac Comp Sci, PL-15245 Bialystok, Poland
[2] Univ Bialystok, Computat Ctr, PL-15245 Bialystok, Poland
关键词
IDENTIFICATION; NETWORKS; MODULES; MATRIX;
D O I
10.1093/bioinformatics/btae392
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Analysis of the omics data with the help of machine learning (ML) methods is limited by small sample sizes and a large number of variables. One possible approach to deal with such data is using algorithms for feature selection and reducing the dataset to include only those variables that are related to the studied phenomena. Existing simulators of the omics data were mostly developed with the goal of improving the methods for generations of high-quality data, that correspond with the highest possible fidelity to the real level of molecular markers in the biological materials. The current study aims to simulate the data on a higher level of generalization. Such datasets can then be used to perform tests of the feature selection and ML algorithms on systems that have structures mimicking those of real data, but where the ground truth may be implanted by design. They can also be used to generate contrast variables with the desired correlation structure for the feature selection.Results We proposed the algorithm for the reconstruction of the omic dataset that, with high fidelity, preserves the correlation structure of the original data with a reduced number of parameters. It is based on the hierarchical clustering of variables and uses principal components of the clusters. It reproduces well topological descriptors of the correlation structure. The correlation structure of the principal components of the clusters then is used to obtain datasets with correlation structures similar to the original data but not correlated with the original variables.Availability and implementation The code and data is available at: https://github.com/p100mma/hcrs_omics.
引用
收藏
页码:ii98 / ii104
页数:7
相关论文
共 50 条
  • [31] Special issue: Integration of OMICs datasets into Metabolic Pathway Analysis
    Kaleta, Christoph
    de Figueiredo, Luis F.
    Heiland, Ines
    Klamt, Steffen
    Schuster, Stefan
    [J]. BIOSYSTEMS, 2011, 105 (02) : 107 - 108
  • [32] Characterizing the omics landscape based on 10,000+ datasets
    Eva Brombacher
    Oliver Schilling
    Clemens Kreutz
    [J]. Scientific Reports, 15 (1)
  • [33] Kronos: Circadian rhythmicity analysis in microbiome and other 'omics datasets
    Bastiaanssen, T. F. S.
    Leigh, S.
    Tofani, G. S. S.
    Gheorghe, C. E.
    Clarke, G.
    Cryan, J. F.
    [J]. NEUROGASTROENTEROLOGY AND MOTILITY, 2023, 35
  • [34] Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
    Sompairac, Nicolas
    Nazarov, Petr V.
    Czerwinska, Urszula
    Cantini, Laura
    Biton, Anne
    Molkenov, Askhat
    Zhumadilov, Zhaxybay
    Barillot, Emmanuel
    Radvanyi, Francois
    Gorban, Alexander
    Kairov, Ulykbek
    Zinovyev, Andrei
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (18)
  • [35] Analyzing 'omics data using hierarchical models
    Ji, Hongkai
    Liu, X. Shirley
    [J]. NATURE BIOTECHNOLOGY, 2010, 28 (04) : 337 - 340
  • [36] Analyzing 'omics data using hierarchical models
    Hongkai Ji
    X Shirley Liu
    [J]. Nature Biotechnology, 2010, 28 : 337 - 340
  • [37] An integrative imputation method based on multi-omics datasets
    Lin, Dongdong
    Zhang, Jigang
    Li, Jingyao
    Xu, Chao
    Deng, Hong-Wen
    Wang, Yu-Ping
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [38] FAST ELECTROMAGNETIC SIMULATION ALGORITHM BASED ON HIERARCHICAL AND CURVILINEAR FINITE ELEMENTS
    Ping, X. W.
    Zhou, X. Y.
    Yu, W. M.
    Cui, T. J.
    [J]. MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2011, 53 (02) : 324 - 331
  • [39] HCS: hierarchical cluster-based forwarding scheme for mobile social networks
    Kim, Sun-Kyum
    Yoon, Ji-Hyeun
    Lee, Junyeop
    Yang, Sung-Bong
    [J]. WIRELESS NETWORKS, 2015, 21 (05) : 1699 - 1711
  • [40] Hierarchical decomposition of datasets on irregular surface meshes
    Bonneau, GP
    Gerussi, A
    [J]. COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1998, : 59 - 63