Publication: Pup-fuse: Prediction of protein pupylation sites by integrating multiple sequence representations
Issued Date
2021-02-02
Resource Type
ISSN
14220067
16616596
16616596
Other identifier(s)
2-s2.0-85100933219
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
International Journal of Molecular Sciences. Vol.22, No.4 (2021), 1-12
Suggested Citation
Firda Nurul Auliah, Andi Nur Nilamyani, Watshara Shoombuatong, Md Ashad Alam, Md Mehedi Hasan, Hiroyuki Kurata Pup-fuse: Prediction of protein pupylation sites by integrating multiple sequence representations. International Journal of Molecular Sciences. Vol.22, No.4 (2021), 1-12. doi:10.3390/ijms22042120 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/76284
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Pup-fuse: Prediction of protein pupylation sites by integrating multiple sequence representations
Abstract
Pupylation is a type of reversible post-translational modification of proteins, which plays a key role in the cellular function of microbial organisms. Several proteomics methods have been developed for the prediction and analysis of pupylated proteins and pupylation sites. However, the traditional experimental methods are laborious and time-consuming. Hence, computational algorithms are highly needed that can predict potential pupylation sites using sequence features. In this research, a new prediction model, PUP-Fuse, has been developed for pupylation site prediction by integrating multiple sequence representations. Meanwhile, we explored the five types of feature encoding approaches and three machine learning (ML) algorithms. In the final model, we integrated the successive ML scores using a linear regression model. The PUP-Fuse achieved a Mathew correlation value of 0.768 by a 10-fold cross-validation test. It also outperformed existing predictors in an independent test. The web server of the PUP-Fuse with curated datasets is freely available.