| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Epidemiology |
1 Hematopathology Section, Department of Anatomic Pathology, and 2 Department of Hematology, Hospital Clinic, Institut d'Investigacions Biomèdiques August Pi i Sunyer, University of Barcelona; 3 Unit of Infections and Cancer, Cancer Epidemiology Research Programme, Catalan Institut of Oncology, Institut d'Investigació Biomèdica de Bellvitge, CIBER en Epidemiología y Salud Pública, L'Hospitalet; 4 Biostatistics and Bioinformatics Unit, and Translational Research Laboratory, Catalan Institut of Oncology, L'Hospitalet, Barcelona, Spain; 5 Department of Genetics, Physical Anthropology, and Animal Physiology, University of the Basque Country, Leioa, Bizkaia, Spain; and 6 Hospital Txagorritxu, Vitoria, Alava, Spain
Requests for reprints: Elias Campo, Hematopathology Section, Hospital Clínic, Villarroel 170, 08036-Barcelona, Spain. Phone: 34-93-227-5450; Fax: 34-93-227-5717; E-mail: ecampo{at}clinic.ub.es.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Family and epidemiologic studies have shown the existence of an inherited susceptibility to CLL and other B-cell lymphoid neoplasms (3–7). However, genome-wide screening of families with CLL has not identified clear candidate genes (8–11). Although part of the familial incidence of CLL could be due to high-penetrance mutations in unidentified genes, it seems plausible that common genetic variants with modest effects on disease susceptibility may be related to this risk.
Multiple association studies are been carried out to identify common causal variants to CLL, comparing the frequency of some single-nucleotide polymorphisms (SNP) in candidate genes between patients and control subjects (12). Recent developments in high-throughput genotyping technology allow the analysis of thousands of SNPs (13). Usually, these studies include nonsynonymous SNPs, but recently, other approaches have been done investigating functional SNPs, SNPs in the coding and noncoding regions, or/and tag SNPs that may maximize the recognition of the common variation across a given gene (14, 15).
To evaluate the genetic susceptibility for CLL, we performed a case-control study genotyping 768 SNPs in 692 CLL patients and 738 healthy controls from the Spanish population. The study includes nonsynonymous SNPs selected from genes involved in cancer biology, SNPs in potentially functional regions (coding and regulatory gene regions), and tag SNPs from 48 genes involved in apoptosis, immune response, DNA repair, and cell adhesion pathways. Our results suggest that common genetic variants in apoptosis and immunoregulation related genes are associated with risk of CLL.
| Materials and Methods |
|---|
|
|
|---|
SNP selection. Candidate SNPs were identified following three strategies, mainly focused to include SNPs with putative functional effect in protein structure and gene expression. The initial step was to search for nonsynonymous SNPs with possible functional effect in genes that could be relevant in CLL pathogenesis. These nonsynonymous SNPs were searched in an initial list of 618 genes involved in hematologic system development and function, immune response, cell death, DNA replication, DNA recombination, DNA repair, cell cycle, cell signaling and interaction, cell motility, and cell growth. These genes were obtained from a systematic Medline search to identify the reports published on lymphoid neoplasm and CLL using these categories as key words in the queries.
In a second step, we selected SNPs located within putative functional regions [5' upstream, 5'-untranslated region (UTR), coding region, splicing sites, first intron, 3'-UTR, and downstream]. These SNPs were searched in 48 genes that we prioritized from the genes obtained in step 1. These genes were included based on their involvement in immune response, cell death, and DNA repair pathways that were considered of potential relevance in CLL pathogenesis according to the literature review (Supplementary Table S1). The SNPs in these 48 candidate genes were searched in the 10 kb 5'upstream from the start codon, first intron, and 1 kb downstream from the stop codon of each candidate gene. The genetic polymorphisms included in this group were SNPs disrupting potential transcription factor binding sites predicted by the web software Pupas View7 (17) and SNPs located in conserved sequences in several species determined using the UCSC Genome Browser Database.8 In addition, several tag SNPs were selected to cover those SNPs located in these putative regulatory gene regions that did not show evidence of functionality. In this study, tag SNPs are defined as variants with r2
0.98 with other SNPs in HapMap individuals with European ancestry (CEU; Tagger, Haploview 3.2; ref. 18). Finally, we selected all genes that contained SNPs previously reported in the literature to be associated with CLL and other lymphoid neoplasms.
We selected only SNPs with a minor allele frequency >5% (
0.05) in Caucasic population of the HapMap-CEU Europe (19) and PERLEGEN-EUR-Panel North America reported in public available ENSEMBL genome browser9 (v33-36) working with dbSNP build 124 and National Center for Biotechnology Information (NCBI) genome build 35.1. All the SNPs selected were then filtered by Illumina technology criteria (score
0.6 or GoldenGate validated status, Illumina, Inc.).
A total of 768 SNPs from 172 genes were genotyped (Supplementary Table S2), 42.5% were located in 5' upstream and first intron regions, 40.5% in coding regions (all nonsynonymous SNP), 4.8% in intronic regions (except first intron), and 12.2% in 3' downstream gene regions.
Genotyping. Genomic DNA was isolated from peripheral blood lymphocytes using conventional DNA extraction procedures. Blood samples and written informed consent were obtained in accordance with the Institutional Ethical Committees and the Hospital Clinic Human Investigation Review Committee for Genetics Procedures. Genomic DNA was quantified using PicoGreen and diluted to a final concentration of 50 ng/µL. Genotyping was carried out at the Spanish National Genotyping Centre (CeGen) using a high-throughput platform of Illumina Bead Array System (Illumina). The results obtained for blinded duplicate blood DNA samples were concordant for all SNPs.
Statistical analysis. Genotypic frequencies in control subjects for each SNP were tested for departure from Hardy-Weinberg equilibrium using a Fisher's exact test (20).
Gene-disease associations. To test the hypothesis of association between genetic polymorphisms and CLL, multivariate methods based on logistic regression analyses were used. To quantify the degree of the association, odds ratios (OR) and 95% confidence intervals (95% CI) were calculated for each group under the codominant, dominant, recessive, and log-additive inheritance models. All analyses were adjusted for age, sex, and center of recruitment.
Because many SNPs have been explored and several inheritance models have been used, some may show significant results by chance. We assessed the robustness of the findings by calculating an estimate of the false discovery rate for significant results. False discovery rate is the expected proportion of false positives (erroneous rejections) among the significant tests (rejections). For this purpose, Q values were computed for each SNPs. Furthermore, to select a set of significant SNPs with an estimated false discovery rate <5%, SNPs with Q < 0.05 were selected (21).
Population stratification. We applied the method of Genomic Control to estimate quantitatively the amount of population stratification (22). The Genomic Control model allows for extra variance by assuming that the statistic test is inflated by a factor
that can be estimated for each of the inheritance model. The Genomic Control model approach is based on the assumption that the variance inflation factor
is approximately constant across the genome for all loci that are not associated with the disease.
Haplotype analysis. For 82 genes in which more than one SNPs was genotyped, haplotype block structure was determined with Haploview 3.32 program using the confidence intervals algorithm, which defines pairs to be in "strong linkage disequilibrium" if the one-sided upper 95% confidence bound on D' is >0.98 (i.e., consistent with no historical recombination) and the lower bound is >0.7 (18, 23). For each block, haplotypes were analyzed using the haplo.stats package for R statistical software (24), which implements the expectation maximization algorithm to estimate the haplotype frequencies. For each individual, the compatible haplotypes and their posterior probabilities were computed and coded with dummy indicator variables. The posterior probabilities were used as weights in the logistic regression models to account for uncertainty in the identification of phase-unknown haplotypes.
Genotype-gene expression association analysis. Association between genetic variants and germ-line mRNA expression differences was assessed using experimental data of 60 HapMap CEU unrelated individuals, U.S. residents with northern and western European ancestry (25). HapMap genotypes were downloaded from the release 22 (NCBI build 36) and transcript measurements from the Gene Expression Omnibus record GSE6536, which used the Illumina Sentrix Human-6 Expression BeadChip. Thus, measurements belong to immortalized lymphocytes of healthy CEU individuals. Data were compiled and processed using the R programming language, and associations were assessed with the SNPStats web software (26) by fitting linear equations and P values obtained based on the F test. Results were corroborated using two independently generated data sets in immortalized lymphocytes (27).
| Results |
|---|
|
|
|---|
|
Population stratification. To determine whether the observed association results could be due to a stratification of the population, we applied the genomic control method. Estimation of population stratification inflation factor (
) under codominant model was 1.11 (95% CI, 0.81–1.40). Supplementary Fig. S1 shows the Q-Q plot based for the adjusted (considering the
inflation parameter) and unadjusted
2 test for all inheritance models. Unadjusted and adjusted
2 plots show very similar distribution, with inflation parameter
being greater in the additive model. Despite this, after adjustment for population stratification, the 27 SNPs with significant association with CLL risk continue (P
0.01) to be significant (data not shown).
SNPs and risk of CLL. The comparison between CLL patients and controls revealed that 95 SNPs had a significant statistical association with CLL risk (P < 0.05 for any of the inheritance models; Supplementary Table S2). The 27 SNPs with a P
0.01 are shown in Table 2
. After the stringent false discovery rate adjustment to take into account the multiple comparisons, six SNPs remain significantly associated with CLL risk (Q < 0.05). These SNPs were CCNH (rs2266690, V270A), APAF1 (rs17028658, 3'region), IL16 (rs4505265, first intron), CASP8 (rs1045485, D302H), NOS2A (rs2779251, promoter), and CCR7 (rs3136687, intron 1; Table 3
). The presence of the minor allele of the CCNH, APAF1, CASP8, NOS2A, and CCR7 variants resulted in a protective effect for the risk of CLL (ORs <1; Table 3). Although the IL16 variant (rs4505265) was strongly associated with CLL risk (P = 2.2 10–6, Q = 0.0005), the OR could not be estimated because the minor allele was not observed in cases and was only present in the heterozygous genotype in controls. The homozygote genotypes for the risk allele of the APAF1 and CCR7 variants were not detected in the CLL group.
|
|
|
|
The rs4505265 and rs17337098 variants are located within the first intron of IL16 and included in the same linkage disequilibrium block. The heterozygosis of these SNP are too low to be examined in the HapMap data set (25), but the analysis of larger sample series of gene expression in lymphocytes and paired genetic data shows that variation in these cis elements correlates with germ-line mRNA expression differences of IL16 [cis logarithm of the odds of linkage (LOD) scores of 20 and 2.5 and heritability estimates for gene expression P < 10–10]. Based on these analyses, the risk allele may be associated with lower expression levels of IL16 (27, 28).
The rs17028658 variant is located a few kilobases distal from the 3'-UTR region of APAF1. Examination of HapMap data set identified significant expression differences between genotypes. The minor allele C seems to be associated with relative lower levels of APAF1; each copy of this allele decreases APAF1 expression in lymphocytes with 0.35 log 2 ratio units (95% CI, –0.65 to 0.05; log-additive model P = 0.023). The association with this allele is protective for the risk of CLL (OR, 0.42; 95% CI, 0.29–0.61). No differences in the expression levels were observed for the CCR7 variant, and the NOS2A variant could not be evaluated because it is not expressed in normal lymphocytes.
In addition, we performed a similar analysis for the 93 SNPs with positive association with CLL risk (P < 0.05) that did not pass multiple testing corrections. SNP variants in BAX, BCL2, ERCC2, GSTM3, IL1A, IL6, and LTA correlated significantly with mRNA expression differences (data not shown). Some of these variants were included in haplotypes that also showed an increased risk for CLL. These data suggest that these additional candidates should be prioritized for replication.
| Discussion |
|---|
|
|
|---|
0.01), 6 of them with a strong statistical significant relationship after correction for multiple testing (P
0.0004 and Q
0.05). In addition, 22 haplotypes in 17 of these genes were also significantly associated with an increased risk for CLL. These genes were mainly related to apoptosis and immunoregulatory pathways. Several studies have evaluated the risk of CLL using a restricted number of genetic variants in a limited number of candidate genes, placed in the coding or in the putative promoter sequences (29–39). To date, only one study has published results of high-throughput genotyping to the study of CLL risk, analyzing a large number of nonsynonymous SNPs in 992 CLL patients and 2,707 controls (13). In our study, we not only investigated a series of nonsynonymous SNPs that change the coded amino acid but also expanded the selection strategy to tag SNPs to optimize the genotyping analysis and to SNPs located in regulatory regions, including variants in the promoter, first intron, and 3'-UTR gene regions, which may modify gene expression. This approach enriches the study with putative functional SNPs located in regulatory regions. However, it has also the limitation of including a high number of candidate genes with a restricted SNP coverage. Finally, the selection of several SNPs across the same gene allowed us to explore the effect of haplotypes in CLL risk.
Our study showed a strong association with CLL risk after adjustment for multiple testing in six gene variants: CCNH (rs2266690, V270A), APAF1 (rs17028658, 3'region), IL16 (rs4505265, first intron), CASP8 (rs1045485, D302H), NOS2A (rs2779251, promoter), and CCR7 (rs3136687, intron 1).
The CCNH rs22666690 variant (V270A) was associated with a reduced risk for CLL (OR, 0.63; 95% CI, 0.53–0.74). Cyclin H forms a cyclin-dependent kinase (CDK)–activating kinase complex with CDK7 kinase and ring finger protein MAT1 (40).This complex participates in the regulation of the cell cycle by activating CDK/cyclin complexes and also facilitates the transcriptional activity of RNA polymerase II (40). The functional role of the alanine-to-valine substitution in cyclin H protein is not known, and further studies are required to establish whether this variant itself or another variant in strong linkage disequilibrium may play a role in CLL pathogenesis.
Regulation of programmed cell death (apoptosis) has a relevant role in CLL pathogenesis and response to therapy (2). Caspase-8 and apoptotic protease activating factor 1 (Apaf1) are important initiators of the apoptosis in lymphoid cells. Ours findings suggest that the C variant of rs1045485 in CASP8, which results in an aspartic acid-to-histidine substitution, is associated with a reduced CLL risk (OR, 0.56; 95% CI, 0.44–0.72). The functional consequences of this change are not known, but it is interesting that this polymorphism has been recently found to be strongly associated with reduced breast cancer risk (41), suggesting a potential common effect of this variant in human cancer susceptibility. Although we have investigated other SNPs in CASP8 gene, rs1045485 was not in linkage disequilibrium with other SNPs, indicating that this variant itself could have a pathogenetic role in CLL. The minor allele C of the APAF1 rs17028658, located a few kilobases distal from the 3'-UTR region, was associated with a protective effect of CLL (OR, 0.42; 95% CI, 0.29–0.61). The functional expression analysis suggests that this allele is associated with significantly lower levels of APAF1 expression. However, this SNP was in linkage disequilibrium with rs10860361, and therefore it is not clear whether the risk effect may be influenced by other SNPs in the same block of linkage disequilibrium.
Ours results suggest that the allele A of the rs2779251 polymorphism in the promoter region of NOS2A is protective for CLL risk (OR, 0.70; 95% CI, 0.57–0.84). NOS2A encodes for the inducible nitric oxide synthase, which produces nitric oxide. Nitric oxide is a reactive free radical that acts as a biological mediator in many physiologic processes, including immune response and antitumoral activity (42). Although inducible nitric oxide synthase is not expressed in normal lymphocytes, it is up-regulated in CLL cells and the nitric oxide produced by the activity of the enzyme confers resistance to apoptosis in these cells (43). This SNP is not in linkage disequilibrium with other polymorphisms, suggesting that this variant itself could have a causal effect.
The IL16 rs4505265 variant was strongly related to CLL risk. This SNP is in linkage disequilibrium with the rs17337098 variant also located in the first intron, and a haplotype including both SNPs was also associated with CLL risk. The in silico analysis suggests that these genetic variants and others in the same linkage disequilibrium block influence the germ-line IL16 expression differences in lymphocytes. Although the frequency of this variant in the population examined is relatively low, it is interesting that a nonsynonymous SNP (rs4072111; P434S) of this gene that is not in linkage disequilibrium with the rs4505265 variant has been recently associated with a favorable prognosis in CLL patients, suggesting that variants of this gene may play a role in the pathogenesis of the disease (44).
The last variant in association with CLL risk detected after multiple testing adjustment was the G allele of rs3136687 polymorphism in the CCR7 gene (OR, 0.40; 95% CI, 0.24–0.67). This variant is located in the first intron and it is in linkage disequilibrium with rs11574665, and the GA haplotype was also associated with CLL risk. CCR7 is a chemokine receptor and the interaction between chemokines and chemokine receptors plays a critical role in B lymphocyte migration and survival (45).
In addition to the previous six variants, we identified 27 SNPs that showed a significant statistical association with CLL risk (P < 0.01 for someone of the inheritance models; Table 2). Among them, the rs1800067 variant (R415Q) in the ERCC4 gene predicts for a possibly damaging effect on the protein structure (46). ERCC4, also known as XPF, is involved in the nucleotide excision repair pathway and plays an important role in recombination repair, mismatch repair, and possibly immunoglobulin class switching. We also found that the rs4645878 variant (G125A) of the BAX promoter showed association with decreased CLL risk (OR, 0.73; 95% CI, 0.58–0.92; P = 0.0087), and it is included in a haplotype with significant association with low CLL risk (OR, 0.76; 95% CI, 0.60–0.95; P = 0.0188). The presence of this gene variant has been correlated with lower BAX transcript and protein levels in CLL cells (33, 36), and in our study the rs11667351 variant, which is in linkage disequilibrium with rs4645878, was also associated with lower BAX mRNA levels in lymphocytes. The role of BAX in the CLL pathogenesis is not clear because CLL cells seem to have higher BAX levels than normal B lymphocytes (35). On the other hand, our observation of a protective effect for the G125A change on CLL risk is paradoxical with its apparent relationship with more advanced stage, treatment resistance, and short overall survival of CLL patients observed in some studies (36–38), suggesting a potential different role for this variant in the development and progression of the disease. Among these 27 SNPs (Table 2), it is interesting that the rs2228014 variant in the CXCR4 gene, rs4647698 in CASP3, and rs2009658 in LTATNF are located at 2q21, 4q35.1, and 6p21.3, respectively, in genomic positions close to the chromosome regions that have been recently identified as susceptibility loci for CLL (8, 11).
Finally, 22 haplotypes in APAF1, BAK1, BAX, BCL2, CASP1, CASP3, CASP10/CASP8, CCL2, CCR7, CXCL12, IL16, IL6, IL1A, LTA/TNF, and TNFRSF13B genes were significantly associated with CLL risk. Allelic variants in some of these genes may play a role in CLL pathogenesis because they were associated with different transcript levels in our analysis and previous studies have also observed the association of individual SNPs in these genes with CLL risk (29–32, 34).
Rudd and colleagues (13), in a large study of nonsynonymous SNP in CLL patients, detected association of three gene variants implicated in the DNA damage response axis and CLL risk. The design of our study and that of Rudd and colleagues have some differences in the gene and SNP selection. Thus, in addition to a number of nonsynonymous SNP, we included also polymorphisms in regulatory gene regions, tag SNPs, and SNPs with a minor allele frequency >5%. Therefore, the two studies are complementary in the pathways and strategies investigated. The findings in our study emphasizing the association between allelic variants in apoptotic and immunomodulatory genes may reflect the complexity of the genetic variants involved in the susceptibility for CLL. However, the CASP8 and CCNH nonsynonymous SNPs included in both studies were associated with CLL risk only in our study. This association was very strong (P < 4.10–6) even after correction for multiple testing. These differences between the two studies done suggest that CLL genetic susceptibility may be related to population differences. In fact, the minor allele frequencies of the CASP8 and CCNH variants in our control group (17% and 35%, respectively) were significantly higher than in the control group of Rudd's study based on a British population (13% and 21%, respectively; P < 10–5). This finding may be similar to the geographic variation observed in the immunoglobulin gene usages in CLL patients of Mediterranean and Northern European areas, highlighting the potential population-based differences in the molecular pathogenesis of the disease (47–49).
In summary, the strong association between the allelic variants of genes related to apoptosis and immunoregulation and CLL risk observed in our study suggests that they may influence the development of the disease. The confirmation of these results requires its replication in larger studies including pooled analyses of larger data sets.
| Disclosure of Potential Conflicts of Interest |
|---|
|
|
|---|
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank Elisabet Vilella for the use of Spanish controls from the DNA collection (Institut de Recerca en Ciències de la Salut) of the Sant Joan University Hospital, Reus, Spain, and the Banco de Tumores del Institut d'Investigacions Biomèdiques August Pi i Sunyer and the Spanish National Genotyping Centre (CeGen) for assistance with SNP selection.
| Footnotes |
|---|
Received 6/17/08. Revised 9/ 8/08. Accepted 9/13/08.
| References |
|---|
|
|
|---|
promoter polymorphism and the risk of chronic lymphocytic leukemia and myeloma in the Chinese population. Leuk Lymphoma 2006;47:2189–93.[CrossRef][Medline]
promoter gene polymorphism and susceptibility to B-cell chronic lymphocytic leukaemia. Int J Immunogenet 2006;33:21–4.[CrossRef][Medline]
and lymphotoxin-
genes in chronic lymphocytic leukaemia. Br J Haematol 1997;97:107–12.[CrossRef][Medline]This article has been cited by other articles:
![]() |
D. Crowther-Swanepoel, M. Qureshi, M. J. S. Dyer, E. Matutes, C. Dearden, D. Catovsky, and R. S. Houlston Genetic variation in CXCR4 and risk of chronic lymphocytic leukemia Blood, November 26, 2009; 114(23): 4843 - 4846. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Crowther-Swanepoel and R. S. Houlston The molecular basis of familial chronic lymphocytic leukemia Haematologica, May 1, 2009; 94(5): 606 - 609. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |