Multi-objective molecular generation via clustered Pareto-based reinforcement learning

被引：1

作者：

Wang, Jing ^{[1
]}

Zhu, Fei ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 179卷

基金：

中国国家自然科学基金;

关键词：

Molecular generation; Multi-objective; Pareto optimization; Molecular clustering; Diversity; OPTIMIZATION;

D O I：

10.1016/j.neunet.2024.106596

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

De novo molecular design is the process of learning knowledge from existing data to propose new chemical structures that satisfy the desired properties. By using de novo design to generate compounds in a directed manner, better solutions can be obtained in large chemical libraries with less comparison cost. But drug design needs to take multiple factors into consideration. For example, in polypharmacology, molecules that activate or inhibit multiple target proteins produce multiple pharmacological activities and are less susceptible to drug resistance. However, most existing molecular generation methods either focus only on affinity for a single target or fail to effectively balance the relationship between multiple targets, resulting in insufficient validity and desirability of the generated molecules. To address the problems, an approach called clustered Pareto-based reinforcement learning (CPRL) is proposed. In CPRL, a pre-trained model is constructed to grasp existing molecular knowledge in a supervised learning manner. In addition, the clustered Pareto optimization algorithm is presented to find the best solution between different objectives. The algorithm first extracts an update set from the sampled molecules through the designed aggregation-based molecular clustering. Then, the final reward is computed by constructing the Pareto frontier ranking of the molecules from the updated set. To explore the vast chemical space, a reinforcement learning agent is designed in CPRL that can be updated under the guidance of the final reward to balance multiple properties. Furthermore, to increase the internal diversity of the molecules, a fixed-parameter exploration model is used for sampling in conjunction with the agent. The experimental results demonstrate that CPRL is capable of balancing multiple properties of the molecule and has higher desirability and validity, reaching 0.9551 and 0.9923, respectively.

引用

页数：15

共 50 条

[1] Pareto-based multi-objective differential evolution
Xue, F
Sanderson, AC
Graves, RJ
CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 862 - 869
[2] Pareto-based multi-objective optimization for classification in data mining
Kamila, Narendra Kumar
Jena, Lambodar
Bhuyan, Hemanta Kumar
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1723 - 1745
[3] Pareto-based multi-objective optimization for classification in data mining
Narendra Kumar Kamila
Lambodar Jena
Hemanta Kumar Bhuyan
Cluster Computing, 2016, 19 : 1723 - 1745
[4] A Pareto-based search methodology for multi-objective nurse scheduling
Burke, Edmund K.
Li, Jingpeng
Qu, Rong
ANNALS OF OPERATIONS RESEARCH, 2012, 196 (01) : 91 - 109
[5] Studies on Pareto-based Multi-objective Competitive Coevolutionary Dynamics
Zeng, Fanchao
Decraene, James
Low, Malcolm Yoke Hean
Cai, Wentong
Hingston, Philip
2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 2383 - 2390
[6] A new pareto-based algorithm for multi-objective graph partitioning
Baños, R
Gil, C
Montoya, MG
Ortega, J
COMPUTER AND INFORMATION SCIENCES - ISCIS 2004, PROCEEDINGS, 2004, 3280 : 779 - 788
[7] Peptide identification via constrained multi-objective optimization: Pareto-based genetic algorithms
Malard, JM
Heredia-Langner, A
Cannon, WR
Mooney, R
Baxter, DJ
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2005, 17 (14): : 1687 - 1704
[8] A Pareto-based search methodology for multi-objective nurse scheduling
Edmund K. Burke
Jingpeng Li
Rong Qu
Annals of Operations Research, 2012, 196 : 91 - 109
[9] A Survey on Pareto-Based EAs to Solve Multi-objective Optimization Problems
Dutta, Saykat
Das, Kedar Nath
SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 807 - 820
[10] DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
Liu, Xuhan
Ye, Kai
van Vlijmen, Herman W. T.
Emmerich, Michael T. M.
IJzerman, Adriaan P.
van Westen, Gerard J. P.
JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)

← 1 2 3 4 5 →