SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations
6
Issued Date
2025-01-01
Resource Type
eISSN
2047217X
Scopus ID
2-s2.0-105013054151
Journal Title
Gigascience
Volume
14
Rights Holder(s)
SCOPUS
Bibliographic Citation
Gigascience Vol.14 (2025)
Suggested Citation
Boonyarit B., Kositchutima M., Phattalung T.N., Yamprasert N., Thuwajit C., Rungrotmongkol T., Nutanong S. SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations. Gigascience Vol.14 (2025). doi:10.1093/gigascience/giaf080 Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/111728
Title
SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations
Corresponding Author(s)
Other Contributor(s)
Abstract
Motivation: Drug combination therapy plays a pivotal role in addressing the molecular heterogeneity of cancer, improving treatment efficacy, minimizing resistance, and reducing toxicity. Deep learning approaches have significantly advanced drug combination discovery by addressing the limitations of conventional laboratory experiments, which are time-consuming and costly. While most existing models rely on the molecular structure of drugs and gene expression data, incorporating protein-level expression provides a more accurate representation of cellular behavior and drug responses. In this study, we introduce SynProtX, an enhanced deep learning model that explicitly integrates large-scale proteomics with deep neural networks (DNNs) and the molecular structure of drugs with graph neural networks (GNNs). Results: The SynProtX-GATFP model, which combines molecular graphs and fingerprints through a graph attention network architecture, demonstrated superior predictive performance for the FRIEDMAN study dataset. We further evaluated its cell line–specific performance, which achieved accuracy across diverse tissue and study datasets. By incorporating protein expression data, the model consistently enhanced predictive performance over gene expression–only models, reflecting the functional state of cancer cells. The generalizability of SynProtX was rigorously validated using cold-start prediction, including leave-drug-combination-out, leave-drug-out, and leave-cell-line-out validation strategies, highlighting its robust performance and potential for clinical applicability. Additionally, SynProtX identified key cancer-associated proteins and molecular substructures, offering novel insights into the biological mechanisms underlying drug synergy. These findings highlight the potential of integrating large-scale proteomics and multiomics data to advance anticancer drug design and combination therapy strategies for personalized medicine.
