DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies

dc.contributor.authorArif M.
dc.contributor.authorKabir M.
dc.contributor.authorAhmed S.
dc.contributor.authorKhan A.
dc.contributor.authorGe F.
dc.contributor.authorKhelifi A.
dc.contributor.authorYu D.J.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T16:51:17Z
dc.date.available2023-06-18T16:51:17Z
dc.date.issued2022-01-01
dc.description.abstractCell-penetrating peptides (CPPs) are special peptides capable of carrying a variety of bioactive molecules, such as genetic materials, short interfering RNAs and nanoparticles, into cells. Recently, research on CPP has gained substantial interest from researchers, and the biological mechanisms of CPPS have been assessed in the context of safe drug delivery agents and therapeutic applications. Correct identification and synthesis of CPPs using traditional biochemical methods is an extremely slow, expensive and laborious task particularly due to the large volume of unannotated peptide sequences accumulating in the World Bank repository. Hence, a powerful bioinformatics predictor that rapidly identifies CPPs with a high recognition rate is urgently needed. To date, numerous computational methods have been developed for CPP prediction. However, the available machine-learning (ML) tools are unable to distinguish both the CPPs and their uptake efficiencies. This study aimed to develop a two-layer deep learning framework named DeepCPPred to identify both CPPs in the first phase and peptide uptake efficiency in the second phase. The DeepCPPred predictor first uses four types of descriptors that cover evolutionary, energy estimation, reduced sequence and amino-acid contact information. Then, the extracted features are optimized through the elastic net algorithm and fed into a cascade deep forest algorithm to build the final CPP model. The proposed method achieved 99.45 percent overall accuracy with the CPP924 benchmark dataset in the first layer and 95.43 percent accuracy in the second layer with the CPPSite3 dataset using a 5-fold cross-validation test. Thus, our proposed bioinformatics tool surpassed all the existing state-of-the-art sequence-based CPP approaches.
dc.identifier.citationIEEE/ACM Transactions on Computational Biology and Bioinformatics Vol.19 No.5 (2022) , 2749-2759
dc.identifier.doi10.1109/TCBB.2021.3102133
dc.identifier.eissn15579964
dc.identifier.issn15455963
dc.identifier.pmid34347603
dc.identifier.scopus2-s2.0-85112628148
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/83953
dc.rights.holderSCOPUS
dc.subjectBiochemistry, Genetics and Molecular Biology
dc.titleDeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85112628148&origin=inward
oaire.citation.endPage2759
oaire.citation.issue5
oaire.citation.startPage2749
oaire.citation.titleIEEE/ACM Transactions on Computational Biology and Bioinformatics
oaire.citation.volume19
oairecerif.author.affiliationAbu Dhabi University
oairecerif.author.affiliationUniversity of Management and Technology Lahore
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationNanjing University of Science and Technology

Files

Collections