Publication: Predicting the Oligomeric States of Fluorescent Proteins
dc.contributor.author | Saw Simeon | en_US |
dc.contributor.author | Watshara Shoombuatong | en_US |
dc.contributor.author | Likit Preeyanon | en_US |
dc.contributor.author | Virapong Prachayasittikul | en_US |
dc.contributor.author | Chanin Nantasenamat | en_US |
dc.contributor.other | Mahidol University. Faculty of Medical Technology. Center of Data Mining and Biomedical Informatics | en_US |
dc.contributor.other | Mahidol University. Faculty of Medical Technology. Department of Clinical Microbiology and Applied Technology | en_US |
dc.date.accessioned | 2015-08-13T05:05:01Z | |
dc.date.accessioned | 2017-06-20T16:43:09Z | |
dc.date.available | 2015-08-13T05:05:01Z | |
dc.date.available | 2017-06-20T16:43:09Z | |
dc.date.issued | 2015 | |
dc.description.abstract | Currently, monomeric fluorescent proteins (FP) are ideal markers for protein tagging. The prediction of oligomeric states is helpful for enhancing live biomedical imaging. Computational prediction of FP oligomeric states can accelerate the effort of protein engineering to create monomeric FPs by saving time and money. To the best of our knowledge, this study represents the first computational model for predicting and analyzing FP oligomerization directly from their amino acid sequences. An exhaustive dataset consisting of 397 unique FP oligomeric states was compiled from the literature. FP were described by 3 classes of protein descriptors including amino acid composition, dipeptide composition and physicochemical properties. The oligomeric states of FP was predicted using decision tree (DT) algorithm and results demonstrated that DT provided robust performance with accuracies in ranges of 79.97-81.72% and 80.76-82.63% for the internal (e.g. 10-fold cross-validation) and external sets, respectively. This approach was also benchmarked with other common machine learning algorithms such as artificial neural network, support vector machine and random forest. A thorough analysis of amino acid sequence features was conducted to provide informative insights into FP oligomerization, which may aid in engineering novel monomeric fluorescent proteins. The following differentiating characteristics of monomeric and oligomeric fluorescent proteins were derived from DT: (i) substitution of any amino acid to Glu led to the reduction of aggregated proteins and (ii) oligomerization of FP appears to be stabilized by several hydrophobic contacts. | en_US |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/2121 | |
dc.language.iso | eng | en_US |
dc.subject | fluorescent protein | en_US |
dc.subject | FP | en_US |
dc.subject | green fluorescent protein | en_US |
dc.subject | GFP | en_US |
dc.subject | oligomeric state | en_US |
dc.subject | data mining | en_US |
dc.subject | Open Access article | en_US |
dc.title | Predicting the Oligomeric States of Fluorescent Proteins | en_US |
dc.type | Article | en_US |
dspace.entity.type | Publication | |
mods.location.url | https://peerj.com/preprints/922.pdf |