Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations

Dengue fever, prevalent in Southeast Asian countries, currently lacks effective pharmaceutical interventions for virus replication control. This study employs a strategy that combines machine learning (ML)-based quantitative-structure-activity relationship (QSAR), molecular docking, and molecular dy...

Full description

Bibliographic Details
Main Authors: Chongjun, Y., Nasr, A.M.S., Latif, M.A.M., Rahman, M.B.A., Marlisah, E., Tejo, B.A.
Format: Article
Published: Taylor and Francis 2024
Online Access:http://psasir.upm.edu.my/id/eprint/116181/
_version_ 1848866943146131456
author Chongjun, Y.
Nasr, A.M.S.
Latif, M.A.M.
Rahman, M.B.A.
Marlisah, E.
Tejo, B.A.
author_facet Chongjun, Y.
Nasr, A.M.S.
Latif, M.A.M.
Rahman, M.B.A.
Marlisah, E.
Tejo, B.A.
author_sort Chongjun, Y.
building UPM Institutional Repository
collection Online Access
description Dengue fever, prevalent in Southeast Asian countries, currently lacks effective pharmaceutical interventions for virus replication control. This study employs a strategy that combines machine learning (ML)-based quantitative-structure-activity relationship (QSAR), molecular docking, and molecular dynamics simulations to discover potential inhibitors of the NS3 protease of the dengue virus. We used nine molecular fingerprints from PaDEL to extract features from the NS3 protease dataset of dengue virus type 2 in the ChEMBL database. Feature selection was achieved through the low variance threshold, F-Score, and recursive feature elimination (RFE) methods. Our investigation employed three ML models–support vector machine (SVM), random forest (RF), and extreme gradient boosting (XGBoost)–for classifier development. Our SVM model, combined with SVM-RFE, had the best accuracy (0.866) and ROC_AUC (0.964) in the testing set. We identified potent inhibitors on the basis of the optimal classifier probabilities and docking binding affinities. SHAP and LIME analyses highlighted the significant molecular fingerprints (e.g. ExtFP69, ExtFP362, ExtFP576) involved in NS3 protease inhibitory activity. Molecular dynamics simulations indicated that amphotericin B exhibited the highest binding energy of −212 kJ/mol and formed a hydrogen bond with the critical residue Ser196. This approach enhances NS3 protease inhibitor identification and expedites the discovery of dengue therapeutics.
first_indexed 2025-11-15T14:28:38Z
format Article
id upm-116181
institution Universiti Putra Malaysia
institution_category Local University
last_indexed 2025-11-15T14:28:38Z
publishDate 2024
publisher Taylor and Francis
recordtype eprints
repository_type Digital Repository
spelling upm-1161812025-04-21T03:15:33Z http://psasir.upm.edu.my/id/eprint/116181/ Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations Chongjun, Y. Nasr, A.M.S. Latif, M.A.M. Rahman, M.B.A. Marlisah, E. Tejo, B.A. Dengue fever, prevalent in Southeast Asian countries, currently lacks effective pharmaceutical interventions for virus replication control. This study employs a strategy that combines machine learning (ML)-based quantitative-structure-activity relationship (QSAR), molecular docking, and molecular dynamics simulations to discover potential inhibitors of the NS3 protease of the dengue virus. We used nine molecular fingerprints from PaDEL to extract features from the NS3 protease dataset of dengue virus type 2 in the ChEMBL database. Feature selection was achieved through the low variance threshold, F-Score, and recursive feature elimination (RFE) methods. Our investigation employed three ML models–support vector machine (SVM), random forest (RF), and extreme gradient boosting (XGBoost)–for classifier development. Our SVM model, combined with SVM-RFE, had the best accuracy (0.866) and ROC_AUC (0.964) in the testing set. We identified potent inhibitors on the basis of the optimal classifier probabilities and docking binding affinities. SHAP and LIME analyses highlighted the significant molecular fingerprints (e.g. ExtFP69, ExtFP362, ExtFP576) involved in NS3 protease inhibitory activity. Molecular dynamics simulations indicated that amphotericin B exhibited the highest binding energy of −212 kJ/mol and formed a hydrogen bond with the critical residue Ser196. This approach enhances NS3 protease inhibitor identification and expedites the discovery of dengue therapeutics. Taylor and Francis 2024-08-30 Article PeerReviewed Chongjun, Y. and Nasr, A.M.S. and Latif, M.A.M. and Rahman, M.B.A. and Marlisah, E. and Tejo, B.A. (2024) Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations. SAR and QSAR in Environmental Research, 35 (8). pp. 707-728. ISSN 1062-936X; eISSN: 1029-046X https://www.tandfonline.com/doi/full/10.1080/1062936X.2024.2392677 10.1080/1062936x.2024.2392677
spellingShingle Chongjun, Y.
Nasr, A.M.S.
Latif, M.A.M.
Rahman, M.B.A.
Marlisah, E.
Tejo, B.A.
Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title_full Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title_fullStr Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title_full_unstemmed Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title_short Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations
title_sort predicting repurposed drugs targeting the ns3 protease of dengue virus using machine learning-based qsar, molecular docking, and molecular dynamics simulations
url http://psasir.upm.edu.my/id/eprint/116181/
http://psasir.upm.edu.my/id/eprint/116181/
http://psasir.upm.edu.my/id/eprint/116181/