PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis
Issued Date
2023-12-01
Resource Type
eISSN
17560500
Scopus ID
2-s2.0-85174153044
Journal Title
BMC Research Notes
Volume
16
Issue
1
Rights Holder(s)
SCOPUS
Bibliographic Citation
BMC Research Notes Vol.16 No.1 (2023)
Suggested Citation
Krajaejun T., Patumcharoenpol P., Rujirawat T., Kittichotirat W., Tangphatsornruang S., Lohnoo T., Yingyong W. PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis. BMC Research Notes Vol.16 No.1 (2023). doi:10.1186/s13104-023-06532-7 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/90794
Title
PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis
Other Contributor(s)
Abstract
Objectives: Pythium insidiosum is the causative agent of pythiosis, a difficult-to-treat condition, in humans and animals worldwide. Biological information about this filamentous microorganism is sparse. Genomes of several P. insidiosum strains were sequenced using the Illumina short-read NGS platform, producing incomplete genome sequence data. PacBio long-read platform was employed to obtain a better-quality genome of Pythium insidiosum. The obtained genome data could promote basic research on the pathogen’s biology and pathogenicity. Data description: gDNA sample was extracted from the P. insidiosum strain Pi-S for whole-genome sequencing by PacBio long-read NGS platform. Raw reads were assembled using CANU (v2.1), polished using ARROW (SMRT link version 5.0.1), aligned with the original raw PacBio reads using pbmm2 (v1.2.1), consensus sequence checked using ARROW, and gene predicted using Funannotate pipeline (v1.7.4). The genome completion was assessed using BUSCO (v4.0.2). As a result, 840 contigs (maximum length: 1.3 Mb; N 50: 229.9 Kb; L 50: 70) were obtained. Sequence assembly showed a genome size of 66.7 Mb (178x coverage; 57.2% G-C content) that contained 20,375 ORFs. A BUSCO-based assessment revealed 85.5% genome completion. All assembled contig sequences have been deposited in the NCBI database under the accession numbers BBXB02000001 - BBXB02000840.