Publication:
A simple grid implementation with Berkeley Open Infrastructure for Network Computing using BLAST as a model

dc.contributor.authorWatthanai Pinthongen_US
dc.contributor.authorPanya Muangruenen_US
dc.contributor.authorPrapat Suriyapholen_US
dc.contributor.authorDumrong Mairiangen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherThailand National Center for Genetic Engineering and Biotechnologyen_US
dc.date.accessioned2018-12-11T02:03:57Z
dc.date.accessioned2019-03-14T08:03:12Z
dc.date.available2018-12-11T02:03:57Z
dc.date.available2019-03-14T08:03:12Z
dc.date.issued2016-01-01en_US
dc.description.abstractDevelopment of high-throughput technologies, such as Next-generation sequencing, allows thousands of experiments to be performed simultaneously while reducing resource requirement. Consequently, a massive amount of experiment data is now rapidly generated. Nevertheless, the data are not readily usable or meaningful until they are further analysed and interpreted. Due to the size of the data, a high performance computer (HPC) is required for the analysis and interpretation. However, the HPC is expensive and difficult to access. Other means were developed to allow researchers to acquire the power of HPC without a need to purchase and maintain one such as cloud computing services and grid computing system. In this study, we implemented grid computing in a computer training center environment using Berkeley Open Infrastructure for Network Computing (BOINC) as a job distributor and data manager combining all desktop computers to virtualize the HPC. Fifty desktop computers were used for setting up a grid system during the off-hours. In order to test the performance of the grid system, we adapted the Basic Local Alignment Search Tools (BLAST) to the BOINC system. Sequencing results from Illumina platform were aligned to the human genome database by BLAST on the grid system. The result and processing time were compared to those from a single desktop computer and HPC. The estimated durations of BLAST analysis for 4 million sequence reads on a desktop PC, HPC and the grid system were 568, 24 and 5 days, respectively. Thus, the grid implementation of BLAST by BOINC is an efficient alternative to the HPC for sequence alignment. The grid implementation by BOINC also helped tap unused computing resources during the off-hours and could be easily modified for other available bioinformatics software.en_US
dc.identifier.citationPeerJ. Vol.2016, No.7 (2016)en_US
dc.identifier.doi10.7717/peerj.2248en_US
dc.identifier.issn21678359en_US
dc.identifier.other2-s2.0-84981495266en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/42170
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84981495266&origin=inwarden_US
dc.subjectAgricultural and Biological Sciencesen_US
dc.subjectBiochemistry, Genetics and Molecular Biologyen_US
dc.titleA simple grid implementation with Berkeley Open Infrastructure for Network Computing using BLAST as a modelen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84981495266&origin=inwarden_US

Files

Collections