Publication:
INJclust: Iterative neighbor-joining tree clustering framework for inferring population structure

dc.contributor.authorTulaya Limpitien_US
dc.contributor.authorChainarong Amornbunchornvejen_US
dc.contributor.authorApichart Intarapanichen_US
dc.contributor.authorAnunchai Assawamakinen_US
dc.contributor.authorSissades Tongsimaen_US
dc.contributor.otherKing Mongkut's Institute of Technology Ladkrabangen_US
dc.contributor.otherThailand National Electronics and Computer Technology Centeren_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherThailand National Center for Genetic Engineering and Biotechnologyen_US
dc.date.accessioned2018-11-09T01:50:45Z
dc.date.available2018-11-09T01:50:45Z
dc.date.issued2014-09-01en_US
dc.description.abstract© 2014 IEEE. Understanding genetic differences among populations is one of the most important issues in population genetics. Genetic variations, e.g., single nucleotide polymorphisms, are used to characterize commonality and difference of individuals from various populations. This paper presents an efficient graph-based clustering framework which operates iteratively on the Neighbor-Joining (NJ) tree called the iNJclust algorithm. The framework uses well-known genetic measurements, namely the allele-sharing distance, the neighbor-joining tree, and the fixation index. The behavior of the fixation index is utilized in the algorithm's stopping criterion. The algorithm provides an estimated number of populations, individual assignments, and relationships between populations as outputs. The clustering result is reported in the form of a binary tree, whose terminal nodes represent the final inferred populations and the tree structure preserves the genetic relationships among them. The clustering performance and the robustness of the proposed algorithm are tested extensively using simulated and real data sets from bovine, sheep, and human populations. The result indicates that the number of populations within each data set is reasonably estimated, the individual assignment is robust, and the structure of the inferred population tree corresponds to the intrinsic relationships among populations within the data.en_US
dc.identifier.citationIEEE/ACM Transactions on Computational Biology and Bioinformatics. Vol.11, No.5 (2014), 903-914en_US
dc.identifier.doi10.1109/TCBB.2014.2322372en_US
dc.identifier.issn15455963en_US
dc.identifier.other2-s2.0-84908003075en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/33228
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84908003075&origin=inwarden_US
dc.subjectBiochemistry, Genetics and Molecular Biologyen_US
dc.subjectMathematicsen_US
dc.titleINJclust: Iterative neighbor-joining tree clustering framework for inferring population structureen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84908003075&origin=inwarden_US

Files

Collections