Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning

dc.contributor.authorYu T.
dc.contributor.authorHuang T.
dc.contributor.authorYu L.
dc.contributor.authorNantasenamat C.
dc.contributor.authorAnuwongcharoen N.
dc.contributor.authorPiacham T.
dc.contributor.authorRen R.
dc.contributor.authorChiang Y.C.
dc.contributor.otherMahidol University
dc.date.accessioned2023-05-19T07:35:41Z
dc.date.available2023-05-19T07:35:41Z
dc.date.issued2023-02-01
dc.description.abstractCytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure–activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure–activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts.
dc.identifier.citationMolecules Vol.28 No.4 (2023)
dc.identifier.doi10.3390/molecules28041679
dc.identifier.eissn14203049
dc.identifier.pmid36838665
dc.identifier.scopus2-s2.0-85149053552
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/81655
dc.rights.holderSCOPUS
dc.subjectBiochemistry, Genetics and Molecular Biology
dc.titleExploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85149053552&origin=inward
oaire.citation.issue4
oaire.citation.titleMolecules
oaire.citation.volume28
oairecerif.author.affiliationThe Chinese University of Hong Kong, Shenzhen
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationFudan University
oairecerif.author.affiliationSnowflake Inc.
oairecerif.author.affiliationShanghai Qi Zhi Institute

Files

Collections