| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Biology, Pathobiology, and Genetics |
Departments of 1 Immunology and 2 Biochemistry, University of Washington School of Medicine, Seattle, Washington
Requests for reprints: Nancy Maizels, Department of Immunology, University of Washington School of Medicine, 1959 Northeast Pacific Street, Box 357650, Seattle, WA 98195. Phone 206-221-6876; Fax: 206-221-6781; E-mail: maizels{at}u.washington.edu.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Proto-oncogene activation by aberrant hypermutation and translocation has been causally related to B-cell lymphoma development and also correlates with aggressive tumor growth and poor disease prognosis. Somatic hypermutation of 5' noncoding regions in BCL6 and c-MYC can result in deregulation of proto-oncogene transcription, which contributes to transformation (9, 15, 19). Aberrant somatic hypermutation of c-MYC, PAX5, PIM1, and RhoH, as well as BCL6 translocation, are associated with progression from follicular lymphoma to the more aggressive diffuse large B-cell lymphoma (DLBCL), and PAX5/IgH translocations have been found in a subset of aggressive non-Hodgkin's lymphomas (5, 20). Thus, it is of considerable interest to understand the mechanisms that promote both aberrant hypermutation and translocation of proto-oncogenes.
Aberrant hypermutation and translocation events that alter proto-oncogene sequence and structure in activated B cells seem to depend on mechanisms that promote changes in genomic sequence and structure essential to the immune response (2123). In antigen-activated B cells, somatic hypermutation produces single-base changes in the rearranged and expressed variable regions and, when coupled with selection, generates antibodies with increased affinity and specificity. Class switch recombination deletes a large region of chromosomal DNA, replacing one constant region with another and thereby optimizing antigen clearance. Antigen-activated B cells occupy special microenvironments of secondary lymphoid tissue, called germinal centers. Germinal center B cells express a mutagenic factor, activation-induced deaminase (AID), which deaminates C to U in DNA to initiate both class switch recombination and somatic hypermutation. The sequence motif WRC (W = A or T; R = G or A) is a hotspot for hypermutation of immunoglobulin variable regions and a preferential target for deamination in vitro (24). A similar pattern of mutation is produced upon aberrant somatic mutation of proto-oncogenes and immunoglobulin variable regions: mutations are preferentially targeted to the WRC motif and single-nucleotide substitutions predominate, accompanied by occasional deletions and insertions (14, 25). AID is expressed in normal germinal center B cells, in B-cell tumors including germinal center B-cell non-Hodgkin's lymphomas, and in subsets of nongerminal center B-cell non-Hodgkin's lymphomas (26). AID is required for c-Myc translocation in the mouse model for Burkitt's lymphoma and pristane-induced plasmacytoma (27); translocation, in turn, promotes tumorigenesis (28). Translocations between c-Myc and IgH are induced rapidly on induction of AID expression in primary murine B cells and depend on AID deaminase activity (29).
The ability of AID to initiate genomic instability has stimulated considerable interest in understanding how this factor is targeted to specific genes. Transcription of the target gene is a prerequisite for deamination by AID, reflecting preferential deamination of single-stranded rather than double-stranded DNA substrates (30). Nonetheless, transcription is not sufficient for deamination, and many genes that are transcribed in activated B cells are not targets for AID. Strikingly, in DLBCL, aberrant hypermutation is restricted to a subset of proto-oncogenes, including BCL6, c-MYC, PAX5, PIM1, and RhoH; systematic analysis revealed no evidence of aberrant hypermutation in about a dozen other representative genes expressed at comparable levels in germinal center B cells, including a-MYB, CD10/Calla, NBS1, and L-Plastin (14). This suggests that identification of the features that distinguish unstable proto-oncogenes from other transcribed genes could provide insights into mechanisms that target AID to specific genes in activated B cells.
Two features of genomic sequence and structure could in principle contribute to AID attack on a transcribed gene: (a) abundance of the WRC sequence motif that is the preferential target for AID and (b) G-richness. Transcription of G-rich regions, like the S regions, results in formation of unusual DNA structures. These contain a stable RNA/DNA hybrid on the template strand and single-stranded regions interspersed with G4 DNA on the G-rich strand (31). G4 DNA is a four-stranded DNA structure in which interactions between strands are stabilized by G-quartets, planar arrays of four guanines (32, 33). The structures formed in transcribed S regions can readily be observed by electron microscopy as characteristic G-loops, which are several hundred base pairs in length (31). Systematic analysis has shown that G-loops readily form upon in vitro or intracellular transcription and form within a variety of G-rich repeats, including immunoglobulin S regions, the mammalian telomeric repeat TTAGGG, multimerized synthetic G-rich sequences, and a G-rich region of the human c-MYC proto-oncogene (31, 34). G-loop formation occurs only if the nontemplate strand is G-rich. G-rich regions are the sites of immunoglobulin class switch recombination, and this is the physiologic orientation for transcription of the S regions essential to induce switch recombination. G4 DNA within G-loops is recognized by factors that promote genomic stability, including MutS
(35) and the RecQ family helicase BLM (36). AID, which promotes genomic instability, binds to ssDNA and has been directly imaged to be bound within single-stranded regions of G-loops formed at c-MYC or the S regions (34).
We have now surveyed proto-oncogenes shown to be unstable in B-cell lymphomas to establish whether they are distinguished by abundance of WRC motifs or G-rich sequence composition. Using software that we developed to characterize both these features of genomic sequence, we show that proto-oncogenes that are targets of aberrant hypermutation are not characterized by an unusually high density of WRC motifs but they are G-rich. Using electron microscopic imaging, we verify experimentally that transcription-induced G-loops form in two G-rich and unstable proto-oncogenes, BCL6 and RhoH, but not in a control gene, a-MYB, which is not a target of translocation or aberrant hypermutation. By further genomic analysis, we show that genes that are targets of aberrant hypermutation in normal B cells, CD95/Fas, B29, and MB1, are similarly G-rich but do not contain an unusual density of WRC motifs. Conversely, we show that G-richness is not characteristic of regions near 105 independent breakpoints within 49 breakpoint clusters in 15 different genes that undergo translocations in AID-negative B- and T-cell malignancies, including AF6, AF9, AML1, CBFB, E2A, ETO, MLL, MYH11, NUP98, PBX1, PML, RAP1GDS1, RARA, TOP1, and TEL1. These results establish that genes targeted for instability in B-cell malignancies and normal B cells contain G-rich regions, which can form G-loops upon transcription. Thus, G-rich sequence composition is one feature of genomic structure that can contribute to genomic instability in B cells and B-cell malignancies.
| Materials and Methods |
|---|
|
|
|---|
Quantitation of density of G-runs using G-Finder. To assess the density of G-runs and potential of a gene for G-loop formation, we designed the computer program G-Finder (Supplementary Fig. S1B). This program reads 100 characters from a FASTA formatted sequence file and records the number of G-runs in this sequence window. A G-run is defined as a stretch of three or more consecutive guanines. Each run containing three or more consecutive guanines is counted once. The window to be analyzed is shifted along the sequence in single-nucleotide increments until the end of the sequence file is reached. Output files from G-Finder were imported into Excel and plotted as bar graphs displaying the number of G-runs per 100-bp window. The average number of G-runs per window analyzed per gene was determined by summing the output in Excel and dividing by the number of windows. The average for a group of genes was determined as the sum of the averages of the group, divided by the number of genes.
Genomic sequences. Human genomic sequences analyzed were immunoglobulin Sµ region, nucleotides (nt) 1 to 3,000 (GenBank accession no. X54713); c-MYC, nt 87,842 to 90,842 (GenBank accession no. AC103819.3); BCL6, nt 37,338 to 40,338 (GenBank accession no. AC072022.19); RhoH, nt 68,817 to 71,817 (GenBank accession no. AC095057.3); PIM1, nt 26,687 to 29,687 [European Molecular Biology Laboratory (EMBL) accession no. AL353579.17]; PAX5, nt 122,041 to 125,041 (EMBL accession no. AL161781.12); a-MYB, nt 23,786 to 26,786 (GenBank accession no. AC083928.11); CD10/Calla, nt 67,225 to 64,225 (GenBank accession no. AC117384.5); L-Plastin, nt 69,980 to 72,980 (EMBL accession no. AL137141); NBS1, nt 143,751 to 146,751 (GenBank accession no. AF069291); CD95/Fas, nt 144,196 to 147,696 (EMBL accession no. AL157394.15); B29, nt 42,083 to 39,084 (GenBank accession no. AC127029.12); MB1, nt 38,328 to 41,328 (GenBank accession no. AC010616.5); TBP, nt 76,089 to 79,090 (EMBL accession no. AL031259.1); and ribosomal protein S14, nt 9,559 to 12,558 (GenBank accession no. AC011388.7).
Statistical analysis. A Mann-Whitney test (unpaired two-tailed t test assuming the sample distribution is not Gaussian) was done on average G-runs per 100 bp values per gene, comparing unstable and stable proto-oncogenes in AID-positive lymphomas, and unstable proto-oncogenes in AID-positive lymphomas and germinal center B cells to unstable proto-oncogenes in AID-negative B-cell and T-cell malignancies.
Plasmids. pBCL6 contains a 2.6-kb fragment spanning exon 1 and intron 1 of the human BCL6 gene (nt 37,28839,865; GenBank accession no. AC072022.19), PCR amplified from human genomic DNA (Promega, Madison, WI) using synthetic oligonucleotide primers 5'-GGAGCAGGCCATACCATCGT and 5'-CTCTCTCCTGCCCCACTTTT. pRhoH contains a 4.3-kb region spanning exon 1 to intron 1 of the human RhoH gene (nt 68,54672,845; GenBank accession no. AC095057.3), PCR amplified from human genomic DNA using primers 5'-TGGTAATTTTACTTCCATGAGG and 5'-CACTGTGACTTCAGTTTTACG. pa-MYB contains a 1.2-kb region spanning exon 1 to the 5' region of exon 2 of human a-MYB (nt 23,78624,997; GenBank accession no. AC083928.11), PCR amplified from human genomic DNA using primers 5'-AGTGAGGATGAGGATGATGACC and 5'-GGTAAGCCTCAGATGATAAGCT. PCR products were cloned into the PCRII vector for Topo cloning according to the manufacturer's protocol (Invitrogen, Carlsbad, CA).
Transcription and electron microscopy. Transcription was carried out for 15 min at 37°C in reactions containing 60 µg/mL supercoiled plasmid DNA, 1 mmol/L each nucleotide triphosphate, 40 mmol/L KCl, and 50 units/mL of either T7 RNA polymerase (for pa-MYB reactions) or SP6 RNA polymerase (for pBCL6 and pRhoH/TTF reactions). T7 and SP6 RNA polymerases (NEB) were added in manufacturer's buffer. Free RNA was digested by incubation with 20 µg/mL RNase A for 15 min at 37°C. DNAs were linearized at unique restriction sites by digestion with restriction enzymes (NEB) in manufacturer's buffer: pBCL6 with HindIII; pRhoH/TTF with EcoRV; and pa-MYB with BglII. Samples were spread for transmission electron microscopy as previously described (31, 34) and imaged using a JEOL 1010 transmission electron microscope at 60 kV. Images were captured with a Gatan ultrascan camera (Gatan, Pleasanton, CA) and acquired using Gatan Digital Micrograph software. Size and location of loops relative to the unique restriction site for each plasmid were measured using ImageJ (NIH).
| Results |
|---|
|
|
|---|
|
|
|
8 G-runs per 100 bp (evident as peak heights
8; Fig. 2C). RhoH was the least G-rich of the unstable proto-oncogenes surveyed but it did contain several extensive regions with
4 G-runs per 100 bp. In comparison, three genes not targeted for aberrant hypermutation, a-MYB, CD10, and L-Plastin, contained at most three extended regions of G-runs, and the maximum density of G-runs in those peaks was only 4 runs per 100 bp (Fig. 2D). NBS1 did contain a G-rich region but this was located very close to the promoter and may correspond to a CpG island.
G-loops form in G-rich proto-oncogenes. Both c-MYC and the immunoglobulin S regions form G-loops upon transcription, and these loops are bound by AID (34). We asked if G-richness identified by G-Finder (Fig. 2) correlated with G-loop formation by analyzing structures formed on transcription of three proto-oncogenes: BCL6, one of the most G-rich of the unstable proto-oncogenes; RhoH, the least G-rich of this subset; and a-MYB, which is neither G-rich nor unstable. Regions of these genes were cloned into plasmid vectors just downstream of a promoter for in vitro transcription to create the corresponding plasmids pBCL6, pRhoH, and pa-MYB. Plasmid DNA templates were transcribed in vitro; free RNA was digested with RNase A; and DNA was then linearized at a unique restriction site and imaged by transmission electron microscopy. G-loops were evident in 10% to 20% of the BCL6 and RhoH DNA templates (in 100 and 330 molecules, respectively), as illustrated by representative images (Fig. 3A and B
). No loops were evident within transcribed a-MYB templates (0 loops in >100 molecules examined), as shown by a representative image (Fig. 3C). The sizes of G-loops ranged from 110 to 1,280 bp in BCL6 and from 120 to 770 bp in RhoH. Smaller loops may have been present but would not have been detected by electron microscopy, as the minimum loop size visible is
100 bp.
|
G-loops map to regions associated with genomic instability. The positions of the G-loops in transcribed plasmids pBCL6 and pRhoH were mapped with respect to the restriction cleavage sites in the plasmid templates (Fig. 3D). G-loops mapped to regions of BCL6 and RhoH targeted for aberrant hypermutation and translocation (5, 8, 9, 14, 38). Transcription-induced G-loops in c-MYC similarly map to the zone associated with instability (34).
G-richness characterizes genes that mutate in normal germinal center B cells. The results above show that G-richness correlates with genomic instability in DLBCL. Aberrant hypermutation can alter the sequences of non-immunoglobulin genes not only in tumors but also in normal germinal center B cells, where one target is BCL6, which is G-rich (Fig. 1). Mutations have also been documented in CD95/Fas, B29, and MB1 (39, 40), and somatic mutation of CD95/Fas and B29 also occurs in malignancies including multiple myeloma, Hodgkin's lymphoma, non-Hodgkin's lymphoma, and chronic lymphocytic leukemia (39, 4143). To establish how density of WRC motifs and G-richness might contribute to aberrant hypermutation of CD95/Fas, B29, and MB1, we analyzed these genes with WRC-Finder and G-Finder (Supplementary Fig. S2A). In comparison, we examined two genes shown not to be mutated in normal germinal center B cells, TBP and S14 (44), which encode the TATA binding protein and ribosomal protein S14, respectively (Supplementary Fig. S2B). In each case, analysis focused on a 3-kb region bounded by the 5' border of exon 1, which includes the hypermutating zone. The average density of WRC motifs was 15 ± 1.2 in genes targeted for aberrant hypermutation and 17 ± 2.0 in the control genes (Table 1). In contrast, the average density of G-runs was 2.5 ± 1.0 in the former group and 1.4 ± 0.9 in the controls (Table 1), which corresponds to a 1.6-fold difference in G-richness. Thus, G-richness correlated with instability in AID-positive B cells.
Local differences were noted within specific genes. In CD95/Fas, hypermutation is not uniformly distributed but concentrates within a short (400 bp) region near the 5' end of the gene. CD95/Fas was not uniformly G-rich but contained a very G-rich region at the 5' end, which coincided with the region prone to hypermutation. WRC motifs were uniformly distributed along the gene (Supplementary Fig. S2A), similar to the distribution of WRC motifs in VH1-2 and VH2-5 (Fig. 1A). Both B29 and MB1 were G-rich and both contained regions in which WRC motifs were quite abundant. Neither TBP nor S14 contained an unusual density of G-runs, although S14 did contain a 5' G-rich region; this may correspond to a CpG island, as in NBS1 (Fig. 2D).
G-richness does not correlate with translocation breakpoints in AID-negative leukemias. Translocation of specific genes characterizes not only AID-positive B-cell lymphomas but also a variety of hematopoietic malignancies. To establish whether G-richness correlates with translocation in AID-negative tumors, we analyzed recurrent translocation breakpoints in 15 genes from 7 different AID-negative B- and T-cell malignancies, including acute myeloid leukemia, childhood and adult acute lymphoblastic leukemia, therapy-related myelodysplastic syndrome, T-cell acute lymphocytic leukemia, childhood acute lymphoblastic leukemia, and acute promyelocytic leukemia. AID is not expressed in the originating cell types, nor has AID expression been documented in the resulting malignancies. The genes analyzed were MLL, AF9, AF6, AML1, CBFB, E2A, ETO, MYH11, NUP98, PBX, PML, RAP1GDS1, RARA, TEL1, and TOP1. Recurrent translocations have been mapped to clustered breakpoints within transcribed regions in all these genes. (See Supplementary Fig. S3 for references and breakpoint accession numbers.) Sequences within 1.5 kb upstream and downstream of breakpoints were analyzed by G-Finder, plotting G-runs per 100 bp separately for each breakpoint region analyzed. If more than one breakpoint fell within a 1.5-kb region, they were analyzed together as a cluster. A total of 105 breakpoints, within 49 separate clusters, were analyzed in these 15 genes (Supplementary Fig. S3). This analysis showed that the majority of breakpoints fell within regions in which G-runs peaked at or below 4 per 100 bp.
To compare G-richness of regions analyzed, we tabulated densities of G-runs as a per gene average (Supplementary Table S1). We graphed the average G-run density per unstable gene region analyzed for two groups of genes: those that are unstable in AID-positive DLBCL or in germinal center B cells and those that are unstable in AID-negative B- and T-cell malignancies (Fig. 4 ). The average density of G-runs near breakpoints in AID-negative malignancies was 1.00 ± 0.4 (Table 1). This is 2.1-fold lower than the average for genes that are targets of instability in AID-positive DLBCL alone (P = 0.0009) and 2.26-fold lower than the average for genes that are targets of instability in all AID-positive B cells examined, including DLBCL and normal germinal center B cells (P < 0.0001; Fig. 4). Thus, G-richness correlates with instability in AID-positive but not AID-negative B and T cells.
|
| Discussion |
|---|
|
|
|---|
Genes that were unstable in AID-positive B cells were not distinguished by an elevated density of WRC sequence motifs, the consensus target of AID (Table 1). In certain instances, locally high densities of the WRC motif may increase instability within limited regions of specific genes. This could occur within a region of intron 1 of c-MYC, which is G-rich and also contains a very high local density of WRC motifs (which corresponds to nt 3661,989 in Figs. 1B and 2C), and which is targeted for translocation (45).
Quantitation of densities of G-runs at regions of instability identified a significantly higher average density in genes that were targets of instability in AID-positive than in AID-negative cells. As shown in Fig. 4, the average density of G-runs near regions of gene instability was 2.26-fold higher in AID-positive than in AID-negative B and T cells (P < 0.0001). These results argue that instability is AID dependent in germinal center B cells and tumors arising from this cell type.
We propose that G-rich sequence composition contributes to genomic instability of specific oncogenes in B-cell lymphomas, just as it contributes to recombination of the G-rich mammalian switch regions, by enhancing AID-initiated DNA deamination. As outlined in Fig. 5 , transcription of a G-rich region would result in G-loop formation, increasing accessibility of single-stranded regions on the nontemplate DNA strand to the B-cellspecific DNA deaminase AID. G-loops are targets for AID (34), and AID preferentially deaminates single-stranded regions of DNA (30). AID is known to be critical to translocations leading to lymphomagenesis in murine models (14, 23, 46). A link between AID and aberrant hypermutation has recently been documented in human B cells infected with EBV, which results in AID overexpression and increased mutation of both BCL6 and P53 (47). The mechanism diagrammed in Fig. 5 could promote aberrant mutation as well as translocation.
|
(35). Analysis of tumors deficient in these factors may reveal other examples of instability at G-rich regions. | Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank the members of the Maizels laboratory, especially Johanna Eddy, for valuable discussions.
| Footnotes |
|---|
Present address for M.L. Duquette: Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037.
Present address for M.D. Huber: Department of Cellular Biology, The Scripps Research Institute, La Jolla, CA 92037.
Received 7/ 3/06. Revised 1/16/07. Accepted 1/18/07.
| References |
|---|
|
|
|---|
, CD79a). Proc Natl Acad Sci U S A 2003;100:412631.This article has been cited by other articles:
![]() |
Z. Du, Y. Zhao, and N. Li Genome-wide colonization of gene regulatory elements by G4 DNA motifs Nucleic Acids Res., September 16, 2009; (2009) gkp710v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. MacCarthy, S. L. Kalis, S. Roa, P. Pham, M. F. Goodman, M. D. Scharff, and A. Bergman V-region mutation in vitro, in vivo, and in silico reveal the importance of the enzymatic properties of AID and the sequence environment PNAS, May 26, 2009; 106(21): 8629 - 8634. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. C. Vallur and N. Maizels Activities of human exonuclease 1 that promote cleavage of transcribed immunoglobulin switch regions PNAS, October 28, 2008; 105(43): 16508 - 16512. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Eddy and N. Maizels Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes Nucleic Acids Res., March 27, 2008; 36(4): 1321 - 1333. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |