Publication:
BERT4Bitter: A bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides

dc.contributor.authorPhasit Charoenkwanen_US
dc.contributor.authorChanin Nantasenamaten_US
dc.contributor.authorMd Mehedi Hasanen_US
dc.contributor.authorBalachandran Manavalanen_US
dc.contributor.authorWatshara Shoombuatongen_US
dc.contributor.otherKyushu Institute of Technologyen_US
dc.contributor.otherAjou University School of Medicineen_US
dc.contributor.otherTulane University School of Medicineen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherChiang Mai Universityen_US
dc.date.accessioned2022-08-04T08:06:35Z
dc.date.available2022-08-04T08:06:35Z
dc.date.issued2021-09-01en_US
dc.description.abstractMotivation: The identification of bitter peptides through experimental approaches is an expensive and timeconsuming endeavor. Due to the huge number of newly available peptide sequences in the post-genomic era, the development of automated computational models for the identification of novel bitter peptides is highly desirable. Results: In this work, we present BERT4Bitter, a bidirectional encoder representation from transformers (BERT)- based model for predicting bitter peptides directly from their amino acid sequence without using any structural information. To the best of our knowledge, this is the first time a BERT-based model has been employed to identify bitter peptides. Compared to widely used machine learning models, BERT4Bitter achieved the best performance with an accuracy of 0.861 and 0.922 for cross-validation and independent tests, respectively. Furthermore, extensive empirical benchmarking experiments on the independent dataset demonstrated that BERT4Bitter clearly outperformed the existing method with improvements of 8.0% accuracy and 16.0% Matthews coefficient correlation, highlighting the effectiveness and robustness of BERT4Bitter. We believe that the BERT4Bitter method proposed herein will be a useful tool for rapidly screening and identifying novel bitter peptides for drug development and nutritional research.en_US
dc.identifier.citationBioinformatics. Vol.37, No.17 (2021), 2556-2562en_US
dc.identifier.doi10.1093/bioinformatics/btab133en_US
dc.identifier.issn14602059en_US
dc.identifier.issn13674803en_US
dc.identifier.other2-s2.0-85102066790en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/76067
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85102066790&origin=inwarden_US
dc.subjectBiochemistry, Genetics and Molecular Biologyen_US
dc.subjectComputer Scienceen_US
dc.subjectMathematicsen_US
dc.titleBERT4Bitter: A bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptidesen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85102066790&origin=inwarden_US

Files

Collections