Publication: Critical evaluation of web-based DNA N6-methyladenine site prediction tools
Issued Date
2021-07-01
Resource Type
ISSN
20412657
20412649
20412649
Other identifier(s)
2-s2.0-85100938416
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Briefings in Functional Genomics. Vol.20, No.4 (2021), 258-272
Suggested Citation
Md Mehedi Hasan, Watshara Shoombuatong, Hiroyuki Kurata, Balachandran Manavalan Critical evaluation of web-based DNA N6-methyladenine site prediction tools. Briefings in Functional Genomics. Vol.20, No.4 (2021), 258-272. doi:10.1093/bfgp/elaa028 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/76134
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Critical evaluation of web-based DNA N6-methyladenine site prediction tools
Other Contributor(s)
Abstract
Methylation of DNA N6-methyladenosine (6mA) is a type of epigenetic modification that plays pivotal roles in various biological processes. The accurate genome-wide identification of 6mA is a challenging task that leads to understanding the biological functions. For the last 5 years, a number of bioinformatics approaches and tools for 6mA site prediction have been established, and some of them are easily accessible as web application. Nevertheless, the accurate genome-wide identification of 6mA is still one of the challenging works that lead to understanding the biological functions. Especially in practical applications, these tools have implemented diverse encoding schemes, machine learning algorithms and feature selection methods, whereas few systematic performance comparisons of 6mA site predictors have been reported. In this review, 11 publicly available 6mA predictors evaluated with seven different species-specific datasets (Arabidopsis thaliana, Tolypocladium, Diospyros lotus, Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans and Escherichia coli). Of those, few species are close homologs, and the remaining datasets are distant sequences. Our independent, validation tests demonstrated that Meta-i6mA and MM-6mAPred models for A. thaliana, Tolypocladium, S. cerevisiae and D. melanogaster achieved excellent overall performance when compared with their counterparts. However, none of the existing methods were suitable for E. coli, C. elegans and D. lotus. A feasibility of the existing predictors is also discussed for the seven species. Our evaluation provides useful guidelines for the development of 6mA site predictors and helps biologists selecting suitable prediction tools.