Analysis of a large structure/biological activity data set using recursive partitioning

被引:154
|
作者
Rusinko, A [1 ]
Farmen, MW [1 ]
Lambert, CG [1 ]
Brown, PL [1 ]
Young, SS [1 ]
机构
[1] Glaxo Wellcome Inc, Res Informat Syst, Res Triangle Pk, NC 27709 USA
关键词
D O I
10.1021/ci9903049
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Combinatorial chemistry and high-throughput screening are revolutionizing the process of lead discovery in the pharmaceutical industry. Large numbers of structures and vast quantities of biological assay data are quickly being accumulated, overwhelming traditional structure/activity relationship (SAR) analysis technologies. Recursive partitioning is a method for statistically determining rules that classify objects into similar categories or, in this case, structures into groups of molecules with similar potencies. SCAM is a computer program implemented to make extremely efficient use of this methodology. Depending on the size of the data set, rules explaining biological data can be determined interactively. An example data set of 1650 monoamine oxidase inhibitors exemplifies the method, yielding substructural rules and leading to general classifications of these inhibitors. The method scales linearly with the number of descriptors, so hundreds of thousands of structures can be analyzed utilizing thousands to millions of molecular descriptors. There are currently no methods to deal with statistical analysis problems of this size. An important aspect of this analysis is the ability to deal with mixtures, i.e., identify SAR rules for classes of compounds in the same data set that might be binding in different ways. Most current quantitative structure/activity relationship methods require that the compounds follow a single mechanism. Advantages and limitations of this methodology are presented.
引用
收藏
页码:1017 / 1026
页数:10
相关论文
共 50 条
  • [31] Identifying Symptom Cluster Profiles with Latent Class Analysis using Secondary Data Analysis of a Large Data Set
    Conley, Samantha
    [J]. NURSING RESEARCH, 2020, 69 (03) : E47 - E47
  • [32] Structure-Activity Relationship Modeling for Predicting Interactions with Pregnane X Receptor by Recursive Partitioning
    Yoshida, Shuya
    Yamashita, Fumiyoshi
    Itoh, Takayuki
    Hashida, Mitsuru
    [J]. DRUG METABOLISM AND PHARMACOKINETICS, 2012, 27 (05) : 506 - 512
  • [33] A recursive partitioning approach for subgroup identification in individual patient data meta-analysis
    Mistry, Dipesh
    Stallard, Nigel
    Underwood, Martin
    [J]. STATISTICS IN MEDICINE, 2018, 37 (09) : 1550 - 1561
  • [34] Data mining for energy analysis of a large data set of flats
    Capozzoli, Alfonso
    Serale, Gianluca
    Piscitelli, Marco Savino
    Grassi, Daniele
    [J]. PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-ENGINEERING SUSTAINABILITY, 2017, 170 (01) : 3 - 18
  • [35] Detecting associations between intact connectomes and clinical covariates using recursive partitioning object-oriented data analysis
    Yang, Dake
    Deych, Elena
    Shands, Berkley
    Campbell, Meghan C.
    Perlmutter, Joel S.
    Petersen, Steve
    Schlaggar, Bradley L.
    Shannon, William
    [J]. STATISTICS IN MEDICINE, 2019, 38 (29) : 5486 - 5496
  • [36] Predicting Survival of Patients With Spine Metastasis Using the Recursive Partitioning Analysis (RPA)
    Walker, A.
    Mongoue-Tchkote, S.
    Kubicky, C. Dai
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2016, 96 (02): : S198 - S198
  • [37] Recursive Partitioning Analysis to Define Prognostic Groups Using a Large Single Institution Experience of Multimodality Radiation Therapy for Bladder Cancer
    Nair, M.
    Scarborough, J.
    Gupta, S.
    Gilligan, T.
    Ornstein, M. C.
    Wee, C.
    Klein, E. A.
    Haber, G. P.
    Campbell, S.
    Almassi, N.
    Gill, B.
    Berglund, R.
    Lee, B.
    Weight, C.
    Gray, M.
    Ciezki, J. P.
    Stephans, K. L.
    Tendulkar, R. D.
    Scott, J. G.
    Mian, O. Y.
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2022, 114 (03): : E204 - E205
  • [38] SOLUTE PARTITIONING AND MOLECULAR-STRUCTURE, BIOLOGICAL-ACTIVITY APPLICATIONS AND IMPLICATIONS
    TAFT, RW
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1993, 206 : 17 - COMP
  • [39] Recursive control using structure analysis of control systems
    Liu, B
    Zhang, ZK
    Zhou, YM
    [J]. ACC: Proceedings of the 2005 American Control Conference, Vols 1-7, 2005, : 159 - 164
  • [40] DUE-B: Data-driven urban energy benchmarking of buildings using recursive partitioning and stochastic frontier analysis
    Yang, Zheng
    Roth, Jonathan
    Jain, Rishee K.
    [J]. ENERGY AND BUILDINGS, 2018, 163 : 58 - 69