Wan Ting PohEryu XiaKwanrutai Chin-inmanuLai Ping WongAnthony Y. ChengPrida MalasitPrapat SuriyapholYik Ying TeoRick Twee Hee OngNational University of SingaporeMahidol UniversityThailand National Center for Genetic Engineering and BiotechnologyGenome Institute of Singapore2018-10-192018-10-192013-12-05BMC Bioinformatics. Vol.14, No.1 (2013)147121052-s2.0-84888988927https://repository.li.mahidol.ac.th/handle/123456789/31143Background: Many potentially life-threatening infectious viruses are highly mutable in nature. Characterizing the fittest variants within a quasispecies from infected patients is expected to allow unprecedented opportunities to investigate the relationship between quasispecies diversity and disease epidemiology. The advent of next-generation sequencing technologies has allowed the study of virus diversity with high-throughput sequencing, although these methods come with higher rates of errors which can artificially increase diversity.Results: Here we introduce a novel computational approach that incorporates base quality scores from next-generation sequencers for reconstructing viral genome sequences that simultaneously infers the number of variants within a quasispecies that are present. Comparisons on simulated and clinical data on dengue virus suggest that the novel approach provides a more accurate inference of the underlying number of variants within the quasispecies, which is vital for clinical efforts in mapping the within-host viral diversity. Sequence alignments generated by our approach are also found to exhibit lower rates of error.Conclusions: The ability to infer the viral quasispecies colony that is present within a human host provides the potential for a more accurate classification of the viral phenotype. Understanding the genomics of viruses will be relevant not just to studying how to control or even eradicate these viral infectious diseases, but also in learning about the innate protection in the human host against the viruses. © 2013 Poh et al.; licensee BioMed Central Ltd.Mahidol UniversityBiochemistry, Genetics and Molecular BiologyComputer ScienceMathematicsViral quasispecies inference from 454 pyrosequencingArticleSCOPUS10.1186/1471-2105-14-355