Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
dc.contributor.author | Papp C. | |
dc.contributor.author | Mukundan V.T. | |
dc.contributor.author | Jenjaroenpun P. | |
dc.contributor.author | Winnerdy F.R. | |
dc.contributor.author | Ow G.S. | |
dc.contributor.author | Phan A.T. | |
dc.contributor.author | Kuznetsov V.A. | |
dc.contributor.other | Mahidol University | |
dc.date.accessioned | 2023-05-31T17:02:25Z | |
dc.date.available | 2023-05-31T17:02:25Z | |
dc.date.issued | 2023-05-22 | |
dc.description.abstract | DNA sequence composition determines the topology and stability of G-quadruplexes (G4s). Bulged G-quadruplex structures (G4-Bs) are a subset of G4s characterized by 3D conformations with bulges. Current search algorithms fail to capture stable G4-B, making their genome-wide study infeasible. Here, we introduced a large family of computationally defined and experimentally verified potential G4-B forming sequences (pG4-BS). We found 478 263 pG4-BS regions that do not overlap 'canonical' G4-forming sequences in the human genome and are preferentially localized in transcription regulatory regions including R-loops and open chromatin. Over 90% of protein-coding genes contain pG4-BS in their promoter or gene body. We observed generally higher pG4-BS content in R-loops and their flanks, longer genes that are associated with brain tissue, immune and developmental processes. Also, the presence of pG4-BS on both template and non-template strands in promoters is associated with oncogenesis, cardiovascular disease and stemness. Our G4-BS models predicted G4-forming ability in vitro with 91.5% accuracy. Analysis of G4-seq and CUT&Tag data strongly supports the existence of G4-BS conformations genome-wide. We reconstructed a novel G4-B 3D structure located in the E2F8 promoter. This study defines a large family of G4-like sequences, offering new insights into the essential biological functions and potential future therapeutic uses of G4-B. | |
dc.identifier.citation | Nucleic acids research Vol.51 No.9 (2023) , 4148-4177 | |
dc.identifier.doi | 10.1093/nar/gkad252 | |
dc.identifier.eissn | 13624962 | |
dc.identifier.pmid | 37094040 | |
dc.identifier.scopus | 2-s2.0-85159779532 | |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/82880 | |
dc.rights.holder | SCOPUS | |
dc.subject | Biochemistry, Genetics and Molecular Biology | |
dc.title | Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization | |
dc.type | Article | |
mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85159779532&origin=inward | |
oaire.citation.endPage | 4177 | |
oaire.citation.issue | 9 | |
oaire.citation.startPage | 4148 | |
oaire.citation.title | Nucleic acids research | |
oaire.citation.volume | 51 | |
oairecerif.author.affiliation | Siriraj Hospital | |
oairecerif.author.affiliation | NTU Institute of Structural Biology | |
oairecerif.author.affiliation | School of Physical and Mathematical Sciences | |
oairecerif.author.affiliation | A-Star, Bioinformatics Institute | |
oairecerif.author.affiliation | SUNY Upstate Medical University |