Publication:
NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning

dc.contributor.authorMd Mehedi Hasanen_US
dc.contributor.authorMd Ashad Alamen_US
dc.contributor.authorWatshara Shoombuatongen_US
dc.contributor.authorHong Wen Dengen_US
dc.contributor.authorBalachandran Manavalanen_US
dc.contributor.authorHiroyuki Kurataen_US
dc.contributor.otherKyushu Institute of Technologyen_US
dc.contributor.otherAjou University School of Medicineen_US
dc.contributor.otherJapan Society for the Promotion of Scienceen_US
dc.contributor.otherTulane University School of Medicineen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2022-08-04T08:04:08Z
dc.date.available2022-08-04T08:04:08Z
dc.date.issued2021-11-05en_US
dc.description.abstractNeuropeptides (NPs) are the most versatile neurotransmitters in the immune systems that regulate various central anxious hormones. An efficient and effective bioinformatics tool for rapid and accurate large-scale identification of NPs is critical in immunoinformatics, which is indispensable for basic research and drug development. Although a few NP prediction tools have been developed, it is mandatory to improve their NPs' prediction performances. In this study, we have developed a machine learning-based meta-predictor called NeuroPred-FRL by employing the feature representation learning approach. First, we generated 66 optimal baseline models by employing 11 different encodings, six different classifiers and a two-step feature selection approach. The predicted probability scores of NPs based on the 66 baseline models were combined to be deemed as the input feature vector. Second, in order to enhance the feature representation ability, we applied the two-step feature selection approach to optimize the 66-D probability feature vector and then inputted the optimal one into a random forest classifier for the final meta-model (NeuroPred-FRL) construction. Benchmarking experiments based on both cross-validation and independent tests indicate that the NeuroPred-FRL achieves a superior prediction performance of NPs compared with the other state-of-the-art predictors. We believe that the proposed NeuroPred-FRL can serve as a powerful tool for large-scale identification of NPs, facilitating the characterization of their functional mechanisms and expediting their applications in clinical therapy. Moreover, we interpreted some model mechanisms of NeuroPred-FRL by leveraging the robust SHapley Additive exPlanation algorithm.en_US
dc.identifier.citationBriefings in bioinformatics. Vol.22, No.6 (2021)en_US
dc.identifier.doi10.1093/bib/bbab167en_US
dc.identifier.issn14774054en_US
dc.identifier.other2-s2.0-85108972351en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/75960
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85108972351&origin=inwarden_US
dc.subjectBiochemistry, Genetics and Molecular Biologyen_US
dc.subjectComputer Scienceen_US
dc.titleNeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learningen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85108972351&origin=inwarden_US

Files

Collections