StackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy

dc.contributor.authorSchaduangrat N.
dc.contributor.authorAnuwongcharoen N.
dc.contributor.authorMoni M.A.
dc.contributor.authorLio’ P.
dc.contributor.authorCharoenkwan P.
dc.contributor.authorShoombuatong W.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-29T18:13:10Z
dc.date.available2023-06-29T18:13:10Z
dc.date.issued2022-12-01
dc.description.abstractProgesterone receptors (PRs) are implicated in various cancers since their presence/absence can determine clinical outcomes. The overstimulation of progesterone can facilitate oncogenesis and thus, its modulation through PR inhibition is urgently needed. To address this issue, a novel stacked ensemble learning approach (termed StackPR) is presented for fast, accurate, and large-scale identification of PR antagonists using only SMILES notation without the need for 3D structural information. We employed six popular machine learning (ML) algorithms (i.e., logistic regression, partial least squares, k-nearest neighbor, support vector machine, extremely randomized trees, and random forest) coupled with twelve conventional molecular descriptors to create 72 baseline models. Then, a genetic algorithm in conjunction with the self-assessment-report approach was utilized to determine m out of the 72 baseline models as means of developing the final meta-predictor using the stacking strategy and tenfold cross-validation test. Experimental results on the independent test dataset show that StackPR achieved impressive predictive performance with an accuracy of 0.966 and Matthew’s coefficient correlation of 0.925. In addition, analysis based on the SHapley Additive exPlanation algorithm and molecular docking indicates that aliphatic hydrocarbons and nitrogen-containing substructures were the most important features for having PR antagonist activity. Finally, we implemented an online webserver using StackPR, which is freely accessible at http://pmlabstack.pythonanywhere.com/StackPR. StackPR is anticipated to be a powerful computational tool for the large-scale identification of unknown PR antagonist candidates for follow-up experimental validation.
dc.identifier.citationScientific Reports Vol.12 No.1 (2022)
dc.identifier.doi10.1038/s41598-022-20143-5
dc.identifier.eissn20452322
dc.identifier.pmid36180453
dc.identifier.scopus2-s2.0-85139231644
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/87717
dc.rights.holderSCOPUS
dc.subjectMultidisciplinary
dc.titleStackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85139231644&origin=inward
oaire.citation.issue1
oaire.citation.titleScientific Reports
oaire.citation.volume12
oairecerif.author.affiliationDepartment of Computer Science and Technology
oairecerif.author.affiliationThe University of Queensland
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationChiang Mai University

Files

Collections