Publication: BERT4Bitter: A bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides
Issued Date
2021-09-01
Resource Type
ISSN
14602059
13674803
13674803
Other identifier(s)
2-s2.0-85102066790
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Bioinformatics. Vol.37, No.17 (2021), 2556-2562
Suggested Citation
Phasit Charoenkwan, Chanin Nantasenamat, Md Mehedi Hasan, Balachandran Manavalan, Watshara Shoombuatong BERT4Bitter: A bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides. Bioinformatics. Vol.37, No.17 (2021), 2556-2562. doi:10.1093/bioinformatics/btab133 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/76067
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
BERT4Bitter: A bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides
Abstract
Motivation: The identification of bitter peptides through experimental approaches is an expensive and timeconsuming endeavor. Due to the huge number of newly available peptide sequences in the post-genomic era, the development of automated computational models for the identification of novel bitter peptides is highly desirable. Results: In this work, we present BERT4Bitter, a bidirectional encoder representation from transformers (BERT)- based model for predicting bitter peptides directly from their amino acid sequence without using any structural information. To the best of our knowledge, this is the first time a BERT-based model has been employed to identify bitter peptides. Compared to widely used machine learning models, BERT4Bitter achieved the best performance with an accuracy of 0.861 and 0.922 for cross-validation and independent tests, respectively. Furthermore, extensive empirical benchmarking experiments on the independent dataset demonstrated that BERT4Bitter clearly outperformed the existing method with improvements of 8.0% accuracy and 16.0% Matthews coefficient correlation, highlighting the effectiveness and robustness of BERT4Bitter. We believe that the BERT4Bitter method proposed herein will be a useful tool for rapidly screening and identifying novel bitter peptides for drug development and nutritional research.