SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations

dc.contributor.authorBoonyarit B.
dc.contributor.authorKositchutima M.
dc.contributor.authorPhattalung T.N.
dc.contributor.authorYamprasert N.
dc.contributor.authorThuwajit C.
dc.contributor.authorRungrotmongkol T.
dc.contributor.authorNutanong S.
dc.contributor.correspondenceBoonyarit B.
dc.contributor.otherMahidol University
dc.date.accessioned2025-08-22T18:15:14Z
dc.date.available2025-08-22T18:15:14Z
dc.date.issued2025-01-01
dc.description.abstractMotivation: Drug combination therapy plays a pivotal role in addressing the molecular heterogeneity of cancer, improving treatment efficacy, minimizing resistance, and reducing toxicity. Deep learning approaches have significantly advanced drug combination discovery by addressing the limitations of conventional laboratory experiments, which are time-consuming and costly. While most existing models rely on the molecular structure of drugs and gene expression data, incorporating protein-level expression provides a more accurate representation of cellular behavior and drug responses. In this study, we introduce SynProtX, an enhanced deep learning model that explicitly integrates large-scale proteomics with deep neural networks (DNNs) and the molecular structure of drugs with graph neural networks (GNNs). Results: The SynProtX-GATFP model, which combines molecular graphs and fingerprints through a graph attention network architecture, demonstrated superior predictive performance for the FRIEDMAN study dataset. We further evaluated its cell line–specific performance, which achieved accuracy across diverse tissue and study datasets. By incorporating protein expression data, the model consistently enhanced predictive performance over gene expression–only models, reflecting the functional state of cancer cells. The generalizability of SynProtX was rigorously validated using cold-start prediction, including leave-drug-combination-out, leave-drug-out, and leave-cell-line-out validation strategies, highlighting its robust performance and potential for clinical applicability. Additionally, SynProtX identified key cancer-associated proteins and molecular substructures, offering novel insights into the biological mechanisms underlying drug synergy. These findings highlight the potential of integrating large-scale proteomics and multiomics data to advance anticancer drug design and combination therapy strategies for personalized medicine.
dc.identifier.citationGigascience Vol.14 (2025)
dc.identifier.doi10.1093/gigascience/giaf080
dc.identifier.eissn2047217X
dc.identifier.scopus2-s2.0-105013054151
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/111728
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.subjectMedicine
dc.titleSynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105013054151&origin=inward
oaire.citation.titleGigascience
oaire.citation.volume14
oairecerif.author.affiliationChulalongkorn University
oairecerif.author.affiliationSiriraj Hospital
oairecerif.author.affiliationSirindhorn International Institute of Technology Thammasat University
oairecerif.author.affiliationVidyasirimedhi Institute of Science and Technology
oairecerif.author.affiliationKamnoetvidya Science Academy

Files

Collections