Publication: Prednts: Improved and robust prediction of nitrotyrosine sites by integrating multiple sequence features
Issued Date
2021-03-01
Resource Type
ISSN
14220067
16616596
16616596
Other identifier(s)
2-s2.0-85102048754
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
International Journal of Molecular Sciences. Vol.22, No.5 (2021), 1-11
Suggested Citation
Andi Nur Nilamyani, Firda Nurul Auliah, Mohammad Ali Moni, Watshara Shoombuatong, Md Mehedi Hasan, Hiroyuki Kurata Prednts: Improved and robust prediction of nitrotyrosine sites by integrating multiple sequence features. International Journal of Molecular Sciences. Vol.22, No.5 (2021), 1-11. doi:10.3390/ijms22052704 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/76264
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Prednts: Improved and robust prediction of nitrotyrosine sites by integrating multiple sequence features
Abstract
Nitrotyrosine, which is generated by numerous reactive nitrogen species, is a type of protein post-translational modification. Identification of site-specific nitration modification on tyro-sine is a prerequisite to understanding the molecular function of nitrated proteins. Thanks to the progress of machine learning, computational prediction can play a vital role before the biological experimentation. Herein, we developed a computational predictor PredNTS by integrating multiple sequence features including K-mer, composition of k-spaced amino acid pairs (CKSAAP), AAindex, and binary encoding schemes. The important features were selected by the recursive feature elimination approach using a random forest classifier. Finally, we linearly combined the successive random forest (RF) probability scores generated by the different, single encoding-employing RF models. The resultant PredNTS predictor achieved an area under a curve (AUC) of 0.910 using five-fold cross validation. It outperformed the existing predictors on a comprehensive and independent dataset. Furthermore, we investigated several machine learning algorithms to demonstrate the superiority of the employed RF algorithm. The PredNTS is a useful computational resource for the prediction of nitrotyrosine sites. The web-application with the curated datasets of the PredNTS is publicly available.