Simple method for cutoff point identification in descriptive high-throughput biological studies

被引:3
|
作者
Suvorov, Alexander [1 ]
机构
[1] Univ Massachusetts, Sch Publ Hlth & Hlth Sci, Dept Environm Hlth Sci, 686 North Pleasant St, Amherst, MA 01003 USA
关键词
Cutoff; Dichotomization; Descriptive genomics; Threshold; -omics;
D O I
10.1186/s12864-022-08427-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Rapid development of high-throughput omics technologies generates an increasing interest in algorithms for cutoff point identification. Existing cutoff methods and tools identify cutoff points based on an association of continuous variables with another variable, such as phenotype, disease state, or treatment group. These approaches are not applicable for descriptive studies in which continuous variables are reported without known association with any biologically meaningful variables. Results The most common shape of the ranked distribution of continuous variables in high-throughput descriptive studies corresponds to a biphasic curve, where the first phase includes a big number of variables with values slowly growing with rank and the second phase includes a smaller number of variables rapidly growing with rank. This study describes an easy algorithm to identify the boundary between these phases to be used as a cutoff point. Discussion The major assumption of that approach is that a small number of variables with high values dominate the biological system and determine its major processes and functions. This approach was tested on three different datasets: human genes and their expression values in the human cerebral cortex, mammalian genes and their values of sensitivity to chemical exposures, and human proteins and their expression values in the human heart. In every case, the described cutoff identification method produced shortlists of variables (genes, proteins) highly relevant for dominant functions/pathways of the analyzed biological systems. Conclusions The described method for cutoff identification may be used to prioritize variables in descriptive omics studies for a focused functional analysis, in situations where other methods of dichotomization of data are inaccessible.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A SIMPLE HIGH-THROUGHPUT ENZYMEIMMUNOASSAY FOR NORETHISTERONE (NORETHINDRONE)
    TURKES, A
    READ, GF
    RIADFAHMY, D
    CONTRACEPTION, 1982, 25 (05) : 505 - 514
  • [32] Simple high-throughput annotation pipeline (SHAP)
    DeMaere, Matthew Z.
    Lauro, Federico M.
    Thomas, Torsten
    Yau, Sheree
    Cavicchioli, Ricardo
    BIOINFORMATICS, 2011, 27 (17) : 2431 - 2432
  • [33] Magneto-lithography, a simple and inexpensive method for high-throughput, surface patterning
    Bardea, Amos
    Yoffe, Alexander
    2016 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING (ICSEE), 2016,
  • [34] Development of a simple and high-throughput method for detecting aflatoxins production in culture media
    Degola, F.
    Dall'Asta, C.
    Restivo, F. M.
    LETTERS IN APPLIED MICROBIOLOGY, 2012, 55 (01) : 82 - 89
  • [35] High-throughput screening for induced point mutations
    Colbert, T
    Till, BJ
    Tompa, R
    Reynolds, S
    Steine, MN
    Yeung, AT
    McCallum, CM
    Comai, L
    Henikoff, S
    PLANT PHYSIOLOGY, 2001, 126 (02) : 480 - 484
  • [36] Quantitative analysis of high-throughput biological data
    Juan, Hsueh-Fen
    Huang, Hsuan-Cheng
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2023, 13 (04)
  • [37] Droplet microfluidics for high-throughput biological assays
    Guo, Mira T.
    Rotem, Assaf
    Heyman, John A.
    Weitz, David A.
    LAB ON A CHIP, 2012, 12 (12) : 2146 - 2155
  • [38] Biological Processes Discovered by High-Throughput Sequencing
    Reon, Brian J.
    Dutta, Anindya
    AMERICAN JOURNAL OF PATHOLOGY, 2016, 186 (04): : 722 - 732
  • [39] Establishing a High-Throughput Locomotion Tracking Method for Multiple Biological Assessments in Tetrahymena
    Suryanto, Michael Edbert
    Vasquez, Ross D.
    Roldan, Marri Jmelou M.
    Chen, Kelvin H-C
    Huang, Jong-Chin
    Hsiao, Chung-Der
    Tsao, Che-Chia
    CELLS, 2022, 11 (15)
  • [40] A Bayesian method for biological pathway discovery from high-throughput experimental data
    Wang, W
    Cooper, GF
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 645 - 646