DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins

dc.contributor.authorHosen M.F.
dc.contributor.authorMahmud S.M.H.
dc.contributor.authorAhmed K.
dc.contributor.authorChen W.
dc.contributor.authorMoni M.A.
dc.contributor.authorDeng H.W.
dc.contributor.authorShoombuatong W.
dc.contributor.authorHasan M.M.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T17:01:51Z
dc.date.available2023-06-18T17:01:51Z
dc.date.issued2022-06-01
dc.description.abstractAccurate identification of DNA-binding proteins (DBPs) is critical for both understanding protein function and drug design. DBPs also play essential roles in different kinds of biological activities such as DNA replication, repair, transcription, and splicing. As experimental identification of DBPs is time-consuming and sometimes biased toward prediction, constructing an effective DBP model represents an urgent need, and computational methods that can accurately predict potential DBPs based on sequence information are highly desirable. In this paper, a novel predictor called DeepDNAbP has been developed to accurately predict DBPs from sequences using a convolutional neural network (CNN) model. First, we perform three feature extraction methods, namely position-specific scoring matrix (PSSM), pseudo-amino acid composition (PseAAC) and tripeptide composition (TPC), to represent protein sequence patterns. Secondly, SHapley Additive exPlanations (SHAP) are employed to remove the redundant and irrelevant features for predicting DBPs. Finally, the best features are provided to the CNN classifier to construct the DeepDNAbP model for identifying DBPs. The final DeepDNAbP predictor achieves superior prediction performance in K-fold cross-validation tests and outperforms other existing predictors of DNA–protein binding methods. DeepDNAbP is poised to be a powerful computational resource for the prediction of DBPs. The web application and curated datasets in this study are freely available at: http://deepdbp.sblog360.blog/.
dc.identifier.citationComputers in Biology and Medicine Vol.145 (2022)
dc.identifier.doi10.1016/j.compbiomed.2022.105433
dc.identifier.eissn18790534
dc.identifier.issn00104825
dc.identifier.pmid35378437
dc.identifier.scopus2-s2.0-85127362204
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/84275
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleDeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127362204&origin=inward
oaire.citation.titleComputers in Biology and Medicine
oaire.citation.volume145
oairecerif.author.affiliationMawlana Bhashani Science and Technology University
oairecerif.author.affiliationAmerican International University - Bangladesh
oairecerif.author.affiliationThe University of Queensland
oairecerif.author.affiliationTulane University School of Medicine
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationUniversity of Electronic Science and Technology of China

Files

Collections