Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization

dc.contributor.authorPapp C.
dc.contributor.authorMukundan V.T.
dc.contributor.authorJenjaroenpun P.
dc.contributor.authorWinnerdy F.R.
dc.contributor.authorOw G.S.
dc.contributor.authorPhan A.T.
dc.contributor.authorKuznetsov V.A.
dc.contributor.otherMahidol University
dc.date.accessioned2023-05-31T17:02:25Z
dc.date.available2023-05-31T17:02:25Z
dc.date.issued2023-05-22
dc.description.abstractDNA sequence composition determines the topology and stability of G-quadruplexes (G4s). Bulged G-quadruplex structures (G4-Bs) are a subset of G4s characterized by 3D conformations with bulges. Current search algorithms fail to capture stable G4-B, making their genome-wide study infeasible. Here, we introduced a large family of computationally defined and experimentally verified potential G4-B forming sequences (pG4-BS). We found 478 263 pG4-BS regions that do not overlap 'canonical' G4-forming sequences in the human genome and are preferentially localized in transcription regulatory regions including R-loops and open chromatin. Over 90% of protein-coding genes contain pG4-BS in their promoter or gene body. We observed generally higher pG4-BS content in R-loops and their flanks, longer genes that are associated with brain tissue, immune and developmental processes. Also, the presence of pG4-BS on both template and non-template strands in promoters is associated with oncogenesis, cardiovascular disease and stemness. Our G4-BS models predicted G4-forming ability in vitro with 91.5% accuracy. Analysis of G4-seq and CUT&Tag data strongly supports the existence of G4-BS conformations genome-wide. We reconstructed a novel G4-B 3D structure located in the E2F8 promoter. This study defines a large family of G4-like sequences, offering new insights into the essential biological functions and potential future therapeutic uses of G4-B.
dc.identifier.citationNucleic acids research Vol.51 No.9 (2023) , 4148-4177
dc.identifier.doi10.1093/nar/gkad252
dc.identifier.eissn13624962
dc.identifier.pmid37094040
dc.identifier.scopus2-s2.0-85159779532
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/82880
dc.rights.holderSCOPUS
dc.subjectBiochemistry, Genetics and Molecular Biology
dc.titleStable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85159779532&origin=inward
oaire.citation.endPage4177
oaire.citation.issue9
oaire.citation.startPage4148
oaire.citation.titleNucleic acids research
oaire.citation.volume51
oairecerif.author.affiliationSiriraj Hospital
oairecerif.author.affiliationNTU Institute of Structural Biology
oairecerif.author.affiliationSchool of Physical and Mathematical Sciences
oairecerif.author.affiliationA-Star, Bioinformatics Institute
oairecerif.author.affiliationSUNY Upstate Medical University

Files

Collections