NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning

Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hong Wen Deng; Balachandran Manavalan; Hiroyuki Kurata

Publication:
NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning

dc.contributor.author	Md Mehedi Hasan	en_US
dc.contributor.author	Md Ashad Alam	en_US
dc.contributor.author	Watshara Shoombuatong	en_US
dc.contributor.author	Hong Wen Deng	en_US
dc.contributor.author	Balachandran Manavalan	en_US
dc.contributor.author	Hiroyuki Kurata	en_US
dc.contributor.other	Kyushu Institute of Technology	en_US
dc.contributor.other	Ajou University School of Medicine	en_US
dc.contributor.other	Japan Society for the Promotion of Science	en_US
dc.contributor.other	Tulane University School of Medicine	en_US
dc.contributor.other	Mahidol University	en_US
dc.date.accessioned	2022-08-04T08:04:08Z
dc.date.available	2022-08-04T08:04:08Z
dc.date.issued	2021-11-05	en_US
dc.description.abstract	Neuropeptides (NPs) are the most versatile neurotransmitters in the immune systems that regulate various central anxious hormones. An efficient and effective bioinformatics tool for rapid and accurate large-scale identification of NPs is critical in immunoinformatics, which is indispensable for basic research and drug development. Although a few NP prediction tools have been developed, it is mandatory to improve their NPs' prediction performances. In this study, we have developed a machine learning-based meta-predictor called NeuroPred-FRL by employing the feature representation learning approach. First, we generated 66 optimal baseline models by employing 11 different encodings, six different classifiers and a two-step feature selection approach. The predicted probability scores of NPs based on the 66 baseline models were combined to be deemed as the input feature vector. Second, in order to enhance the feature representation ability, we applied the two-step feature selection approach to optimize the 66-D probability feature vector and then inputted the optimal one into a random forest classifier for the final meta-model (NeuroPred-FRL) construction. Benchmarking experiments based on both cross-validation and independent tests indicate that the NeuroPred-FRL achieves a superior prediction performance of NPs compared with the other state-of-the-art predictors. We believe that the proposed NeuroPred-FRL can serve as a powerful tool for large-scale identification of NPs, facilitating the characterization of their functional mechanisms and expediting their applications in clinical therapy. Moreover, we interpreted some model mechanisms of NeuroPred-FRL by leveraging the robust SHapley Additive exPlanation algorithm.	en_US
dc.identifier.citation	Briefings in bioinformatics. Vol.22, No.6 (2021)	en_US
dc.identifier.doi	10.1093/bib/bbab167	en_US
dc.identifier.issn	14774054	en_US
dc.identifier.other	2-s2.0-85108972351	en_US
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/20.500.14594/75960
dc.rights	Mahidol University	en_US
dc.rights.holder	SCOPUS	en_US
dc.source.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85108972351&origin=inward	en_US
dc.subject	Biochemistry, Genetics and Molecular Biology	en_US
dc.subject	Computer Science	en_US
dc.title	NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning	en_US
dc.type	Article	en_US
dspace.entity.type	Publication
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85108972351&origin=inward	en_US

Collections

Scopus 2021

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Publication: NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning

Files

Collections

Publication:
NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning