Multi-objective molecular generation via clustered Pareto-based reinforcement learning

被引:1
|
作者
Wang, Jing [1 ]
Zhu, Fei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
基金
中国国家自然科学基金;
关键词
Molecular generation; Multi-objective; Pareto optimization; Molecular clustering; Diversity; OPTIMIZATION;
D O I
10.1016/j.neunet.2024.106596
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
De novo molecular design is the process of learning knowledge from existing data to propose new chemical structures that satisfy the desired properties. By using de novo design to generate compounds in a directed manner, better solutions can be obtained in large chemical libraries with less comparison cost. But drug design needs to take multiple factors into consideration. For example, in polypharmacology, molecules that activate or inhibit multiple target proteins produce multiple pharmacological activities and are less susceptible to drug resistance. However, most existing molecular generation methods either focus only on affinity for a single target or fail to effectively balance the relationship between multiple targets, resulting in insufficient validity and desirability of the generated molecules. To address the problems, an approach called clustered Pareto-based reinforcement learning (CPRL) is proposed. In CPRL, a pre-trained model is constructed to grasp existing molecular knowledge in a supervised learning manner. In addition, the clustered Pareto optimization algorithm is presented to find the best solution between different objectives. The algorithm first extracts an update set from the sampled molecules through the designed aggregation-based molecular clustering. Then, the final reward is computed by constructing the Pareto frontier ranking of the molecules from the updated set. To explore the vast chemical space, a reinforcement learning agent is designed in CPRL that can be updated under the guidance of the final reward to balance multiple properties. Furthermore, to increase the internal diversity of the molecules, a fixed-parameter exploration model is used for sampling in conjunction with the agent. The experimental results demonstrate that CPRL is capable of balancing multiple properties of the molecule and has higher desirability and validity, reaching 0.9551 and 0.9923, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Pareto-based multi-objective differential evolution
    Xue, F
    Sanderson, AC
    Graves, RJ
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 862 - 869
  • [2] Pareto-based multi-objective optimization for classification in data mining
    Kamila, Narendra Kumar
    Jena, Lambodar
    Bhuyan, Hemanta Kumar
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1723 - 1745
  • [3] Pareto-based multi-objective optimization for classification in data mining
    Narendra Kumar Kamila
    Lambodar Jena
    Hemanta Kumar Bhuyan
    Cluster Computing, 2016, 19 : 1723 - 1745
  • [4] A Pareto-based search methodology for multi-objective nurse scheduling
    Burke, Edmund K.
    Li, Jingpeng
    Qu, Rong
    ANNALS OF OPERATIONS RESEARCH, 2012, 196 (01) : 91 - 109
  • [5] Studies on Pareto-based Multi-objective Competitive Coevolutionary Dynamics
    Zeng, Fanchao
    Decraene, James
    Low, Malcolm Yoke Hean
    Cai, Wentong
    Hingston, Philip
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 2383 - 2390
  • [6] A new pareto-based algorithm for multi-objective graph partitioning
    Baños, R
    Gil, C
    Montoya, MG
    Ortega, J
    COMPUTER AND INFORMATION SCIENCES - ISCIS 2004, PROCEEDINGS, 2004, 3280 : 779 - 788
  • [7] Peptide identification via constrained multi-objective optimization: Pareto-based genetic algorithms
    Malard, JM
    Heredia-Langner, A
    Cannon, WR
    Mooney, R
    Baxter, DJ
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2005, 17 (14): : 1687 - 1704
  • [8] A Pareto-based search methodology for multi-objective nurse scheduling
    Edmund K. Burke
    Jingpeng Li
    Rong Qu
    Annals of Operations Research, 2012, 196 : 91 - 109
  • [9] A Survey on Pareto-Based EAs to Solve Multi-objective Optimization Problems
    Dutta, Saykat
    Das, Kedar Nath
    SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 807 - 820
  • [10] DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
    Liu, Xuhan
    Ye, Kai
    van Vlijmen, Herman W. T.
    Emmerich, Michael T. M.
    IJzerman, Adriaan P.
    van Westen, Gerard J. P.
    JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)