中国物理B ›› 2025, Vol. 34 ›› Issue (1): 18701-018701.doi: 10.1088/1674-1056/ad8ecb

• • 上一篇    下一篇

Improving performance of screening MM/PBSA in protein-ligand interactions via machine learning

Yuan-Qiang Chen(陈远强)1, Yao Xu(徐耀)2, Yu-Qiang Ma(马余强)2, and Hong-Ming Ding(丁泓铭)1,†   

  1. 1 Center for Soft Condensed Matter Physics and Interdisciplinary Research, School of Physical Science and Technology, Soochow University, Suzhou 215006, China;
    2 National Laboratory of Solid State Microstructures and Department of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
  • 收稿日期:2024-09-29 修回日期:2024-10-20 接受日期:2024-11-05 发布日期:2025-01-02
  • 通讯作者: Hong-Ming Ding E-mail:dinghm@suda.edu.cn
  • 基金资助:
    Project supported by the National Natural Science Foundation of China (Grant Nos. 12222506, 12347102, 12447164, and 12174184).

Improving performance of screening MM/PBSA in protein-ligand interactions via machine learning

Yuan-Qiang Chen(陈远强)1, Yao Xu(徐耀)2, Yu-Qiang Ma(马余强)2, and Hong-Ming Ding(丁泓铭)1,†   

  1. 1 Center for Soft Condensed Matter Physics and Interdisciplinary Research, School of Physical Science and Technology, Soochow University, Suzhou 215006, China;
    2 National Laboratory of Solid State Microstructures and Department of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
  • Received:2024-09-29 Revised:2024-10-20 Accepted:2024-11-05 Published:2025-01-02
  • Contact: Hong-Ming Ding E-mail:dinghm@suda.edu.cn
  • Supported by:
    Project supported by the National Natural Science Foundation of China (Grant Nos. 12222506, 12347102, 12447164, and 12174184).

摘要: Accurately estimating protein-ligand binding free energy is crucial for drug design and biophysics, yet remains a challenging task. In this study, we applied the screening molecular mechanics/Poisson-Boltzmann surface area (MM/PBSA) method in combination with various machine learning techniques to compute the binding free energies of protein-ligand interactions. Our results demonstrate that machine learning outperforms direct screening MM/PBSA calculations in predicting protein-ligand binding free energies. Notably, the random forest (RF) method exhibited the best predictive performance, with a Pearson correlation coefficient ($r_{\rm p}$) of 0.702 and a mean absolute error (MAE) of 1.379 kcal/mol. Furthermore, we analyzed feature importance rankings in the gradient boosting (GB), adaptive boosting (AdaBoost), and RF methods, and found that feature selection significantly impacted predictive performance. In particular, molecular weight (MW) and van der Waals (VDW) energies played a decisive role in the prediction. Overall, this study highlights the potential of combining machine learning methods with screening MM/PBSA for accurately predicting binding free energies in biosystems.

关键词: molecular mechanics/Poisson-Boltzmann surface area (MM/PBSA), binding free energy, machine learning, protein-ligand interaction

Abstract: Accurately estimating protein-ligand binding free energy is crucial for drug design and biophysics, yet remains a challenging task. In this study, we applied the screening molecular mechanics/Poisson-Boltzmann surface area (MM/PBSA) method in combination with various machine learning techniques to compute the binding free energies of protein-ligand interactions. Our results demonstrate that machine learning outperforms direct screening MM/PBSA calculations in predicting protein-ligand binding free energies. Notably, the random forest (RF) method exhibited the best predictive performance, with a Pearson correlation coefficient ($r_{\rm p}$) of 0.702 and a mean absolute error (MAE) of 1.379 kcal/mol. Furthermore, we analyzed feature importance rankings in the gradient boosting (GB), adaptive boosting (AdaBoost), and RF methods, and found that feature selection significantly impacted predictive performance. In particular, molecular weight (MW) and van der Waals (VDW) energies played a decisive role in the prediction. Overall, this study highlights the potential of combining machine learning methods with screening MM/PBSA for accurately predicting binding free energies in biosystems.

Key words: molecular mechanics/Poisson-Boltzmann surface area (MM/PBSA), binding free energy, machine learning, protein-ligand interaction

中图分类号:  (Molecular dynamics simulation)

  • 87.10.Tf
87.15.-v (Biomolecules: structure and physical properties) 87.15.A- (Theory, modeling, and computer simulation) 87.15.kp (Protein-ligand interactions)