185
Views
6
CrossRef citations to date
0
Altmetric
Research Articles

Machine learning-based drug design for identification of thymidylate kinase inhibitors as a potential anti-Mycobacterium tuberculosis

, , , , , , , & show all
Pages 3874-3886 | Received 07 Mar 2023, Accepted 15 May 2023, Published online: 26 May 2023
 

Abstract

The rise of antibiotic-resistant Mycobacterium tuberculosis (Mtb) has reduced the availability of medications for tuberculosis therapy, resulting in increased morbidity and mortality globally. Tuberculosis spreads from the lungs to other parts of the body, including the brain and spine. Developing a single drug can take several decades, making drug discovery costly and time-consuming. Machine learning algorithms like support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF) and Gaussian naive base (GNB) are fast and effective and are commonly used in drug discovery. These algorithms are ideal for the virtual screening of large compound libraries to classify molecules as active or inactive. For the training of the models, a dataset of 307 was downloaded from BindingDB. Among 307 compounds, 85 compounds were labeled as active, having an IC50 below 58 mM, while 222 compounds were labeled inactive against thymidylate kinase, with 87.2% accuracy. The developed models were subjected to an external ZINC dataset of 136,564 compounds. Furthermore, we performed the 100-ns dynamic simulation and post trajectories analysis of compounds having good interaction and score in molecular docking. As compared to the standard reference compound, the top three hits revealed greater stability and compactness. In conclusion, our predicted hits can inhibit thymidylate kinase overexpression to combat Mycobacterium tuberculosis.

Communicated by Ramaswamy H. Sarma

Acknowledgment

This work was supported by the State Key Laboratories of Chemical Resources Engineering, Beijing University of Chemical Technology, Beijing 100029, China.

Author’s contributions

M.S & M.D.: Methodology, Visualization, Data Curation and Conceptualization, X.D & D.W: Investigation, C.L, H.G & M.C.: review & editing, Data Curation, Methodology write-up. G.Z.: Supervision, Project administration, Writing - review & editing, Software, Funding acquisition.

Disclosure statement

The authors declare that they have no competing interests

Data availability statement

All data generated or analyzed during this study are included in the article

Additional information

Funding

This research was funded by the National Key R&D Program of China No. 2021YFC2102900 and Beijing Natural Science Foundation No. L212001

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,074.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.