Controlling coverage of D-optimal onion designs and selections

被引:15
|
作者
Olsson, IM [1 ]
Gottfries, J
Wold, S
机构
[1] Umea Univ, Chemometr Res Grp, Dept Chem, SE-90187 Umea, Sweden
[2] AstraZeneca R&D, Med Chem, SE-43183 Molndal, Sweden
关键词
statistical molecular design; space-filling design; D-optimal design; D-optimal onion designs; principal properties; PLS;
D O I
10.1002/cem.901
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Statistical molecular design (SMD) is a powerful approach for selection of compound sets in medicinal chemistry and quantitative structure-activity relationships (QSARs) as well as other areas. Two techniques often used in SMD are space-filling and D-optimal designs. Both on occasions lead to unwanted redundancy and replication. To remedy such shortcomings, a generalization of D-optimal selection was recently developed. This new method divides the compound candidate set into a number of subsets ('layers' or 'shells'), and a D-optimal selection is made from each layer. This improves the possibility to select representative molecular structures throughout any property space independently of requested sample size. This is important in complex situations where any given model is unlikely to be valid over the whole investigated domain of experimental conditions. The number of selected molecules can be controlled by varying the number of subsets or by altering the complexity of the model equation in each layer and/or the dependency of previous layers. The new method, called D-optimal onion design (DOOD), will allow the user to choose the model equation complexity independently of sample size while still avoiding unwarranted redundancy. The focus of the present work is algorithmic improvements of DOOD in comparison with classical D-optimal design. As illustrations, extended DOODs have been generated for two applications by in-house programming, including some modifications of the D-optimal algorithm. The performances of the investigated approaches are expected to differ depending on the number of principal properties of the compounds in the design, sample sizes and the investigated model, i.e. the aim of the design. QSAR models have been generated from the selected compound sets, and root mean squared error of prediction (RMSEP) values have been used as measures of performance of the different designs. Copyright (C) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:548 / 557
页数:10
相关论文
共 50 条
  • [41] D-Optimal designs for quadratic regression models
    van Berkum, EEM
    Pauwels, B
    Upperman, PM
    [J]. ADVANCES IN STOCHASTIC SIMULATION METHODS, 2000, : 189 - 195
  • [42] D-optimal minimax fractional factorial designs
    Lin, Dennis K. J.
    Zhou, Julie
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (02): : 325 - 340
  • [43] D-Optimal Experimental Designs for Uniaxial Expression
    Munson-Mcgee, Stuart H.
    [J]. JOURNAL OF FOOD PROCESS ENGINEERING, 2014, 37 (03) : 248 - 256
  • [44] D-optimal designs for weighted polynomial regression
    Chang, FC
    Lin, GC
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1997, 62 (02) : 317 - 331
  • [45] COMPARISON OF ROBUST CRITERIA FOR D-OPTIMAL DESIGNS
    Foo, Lee Kien
    McGree, James
    Eccleston, John
    Duffull, Stephen
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2012, 22 (06) : 1193 - 1205
  • [46] D-OPTIMAL DESIGNS OF EXPERIMENTS WITH NONINTERACTING FACTORS
    SCHWABE, R
    WIERICH, W
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1995, 44 (03) : 371 - 384
  • [47] D-OPTIMAL DESIGNS FOR POISSON REGRESSION MODELS
    Russell, K. G.
    Woods, D. C.
    Lewis, S. M.
    Eccleston, J. A.
    [J]. STATISTICA SINICA, 2009, 19 (02) : 721 - 730
  • [48] D-optimal designs via a cocktail algorithm
    Yaming Yu
    [J]. Statistics and Computing, 2011, 21 : 475 - 481
  • [49] D-OPTIMAL DESIGNS FOR MULTINOMIAL LOGISTIC MODELS
    Bu, Xianwei
    Majumdar, Dibyen
    Yang, Jie
    [J]. ANNALS OF STATISTICS, 2020, 48 (02): : 983 - 1000
  • [50] A modified simplex algorithm of D-optimal designs
    Chen, LS
    Li, ZY
    Zhu, WY
    [J]. Proceedings of the Third International Symposium on Magnetic Industry (ISMI'04) & First International Symposium on Physics and IT Industry (ISITI'04), 2005, : 292 - 294