| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Regular Articles |
1 Sequenom, Inc., San Diego, California; 2 I. Frauenklinik, Klinikum Innenstadt, University of Munich, Munich, Germany; 3 Department of Obstetrics & Gynecology, Technical University of Munich, Munich, Germany; 4 Genomics Research Centre, School of Health Science, Griffith University Gold Coast, Gold Coast, Queensland, Australia; 5 Urology Clinic Munich-Planegg, Munich, Germany; 6 Institute of Human Genetics, University Hospital of Tuebingen, Tuebingen, Germany; and 7 Genefinder Technologies Ltd., Munich, Germany
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Family based linkage studies have proven very successful for identifying genes with highly penetrant alleles that segregate in a simple Mendelian fashion within families (3) . However, successes in which linkage methods are used have largely been restricted to rare diseases and to selected "familial" subsets of common disorders, accounting for only a small proportion of disease in the population. At least two significant problems are associated with using traditional linkage approaches to map complex traits. First, linkage mapping has low resolution, and the genetic loci implicated in such studies can include hundreds of genes. Second, complex diseases are determined by the interaction of multiple genes throughout the genome (4) . Within different families, the contributions of the different genes will likely vary, thus complicating the ability to detect and replicate linkage.
For common disease susceptibility genes, direct association approaches that compare the distribution of genetic marker frequencies between groups of unrelated individuals are expected to have greater power than traditional linkage studies (5) . Recently, there has been increasing interest in the use of whole-genome association methods to identify genes that are involved in complex trait variation (6, 7, 8) . To date, however, few large-scale studies have been reported (9, 10, 11) , and the search for common polymorphisms that influence breast cancer risk has yet to produce useful markers. In an effort to identify genes and variants that influence breast cancer susceptibility, we conducted a large-scale case-control study using >25,000 single nucleotide polymorphisms (SNPs) located within approximately 16,000 genes. We report the identification of intercellular adhesion molecule variants on chromosome 19p13.2 that confer increased breast and prostate cancer risk and might predict disease outcome.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The German replication sample consisted of 188 cases and 150 controls recruited at the Department of Obstetrics and Gynecology, Technical University of Munich. The majority of breast cancer cases were recruited at preoperative visits, and female controls were recruited from healthy individuals or patients with nonmalignant diagnoses. Median age of diagnosis for cases was 59 years (range = 2287 years), and median age of controls was 50 years (range = 1991 years). All but two participants reported both parents were of German descent. The two exceptions each reported one parent of non-German, Eastern European origin.
The Australian replication sample comprised 180 breast cancer cases recruited by the Pathology Department of Gold Coast Hospital or by the Genomics Research Center, Southport. Median age of diagnosis was 50 years (range = 2474 years). Controls consisted of 180 healthy volunteers recruited through the Genomics Research Center. Only controls with no family history of cancer or precancerous conditions were included. Controls were individually age matched to cases (±5 years). Median age for controls was 60 years (range = 2894 years).
The prostate cancer sample consisted of 368 German patients with a median age of diagnosis of 65 years (range = 4390 years) recruited at the Urology Clinic Munich-Planegg, and 368 controls without symptomatic prostate disease with a median age of 68 years (range = 2592 years) collected at the University Hospital of Tuebingen. All subjects involved reported both parents to be of German ancestry.
All subjects involved in our studies signed a written informed consent and the institutional ethics committees of participating institutions approved the experimental protocols. For discovery samples, we extracted DNA from 5 mL of blood from each subject using a desalting method and quantitated fluorometrically (Fluoroskan Ascent CF, Labsystems, Helsinki, Finland) using Pico green. DNA pools were generated by combining equimolar amounts of each sample as described elsewhere (12 , 13) .
SNP Markers and Genotyping.
A set of 25,494 SNP markers was selected from a collection of 125,799 experimentally validated polymorphic variations (14)
. This set was limited to SNPs located within gene coding regions, minor allele frequencies >0.02 (95% have frequencies >0.1), and a target inter-marker spacing of 40 kb. SNP annotation is based on National Center for Biotechnology Information dbSNP database, refSNP, build 118.8
Genomic annotation is based on National Center for Biotechnology Information genome build 34. Gene annotation is based on LocusLink genes for which National Center for Biotechnology Information was providing positions on the Mapview FTP site.9
For pooled DNA assays, 25 ng of case and control DNA pools were used for amplification at each site. We conducted all PCR and MassEXTEND (Sequenom, Inc., San Diego, CA) reactions using standard conditions (13) . Relative allele frequency estimates were derived from area under the peak calculations of mass spectrometry measurements from four analyte aliquots as described elsewhere (13) . For individual genotyping, the same procedure was applied except only 2.5 ng of DNA was used, and only one mass spectrometry measurement was taken.
Statistical Analysis.
We carried out tests of association between disease status and each SNP using pooled DNA in a similar fashion as explained elsewhere (15)
. Sources of measurement variation included pool formation, PCR/mass extension, and chip measurement. When three or more replicate measurements of a SNP were available within a model level, the corresponding variance component was estimated from the data. Otherwise, the following historical laboratory averages were used: pool formation = 5.0 x 105, PCR/mass extension = 1.7 x 104, and chip measurement = 1.0 x 104. We carried out tests of association using individual genotypes that used a
2 test of heterogeneity based on allele and genotype frequencies. The DerSimonian-Laird random effects meta-analysis method (16)
was used for the analysis of replication samples to test for the consistency of association while permitting allele frequencies to differ among samples. All tests of allele frequencies involving only replication samples are one-sided, confirming the effect observed in the discovery sample. We derived P values using the log odds of each contrast and their standard errors. Multiple approaches were explored in an effort to identify haplotypes demonstrating a stronger association with disease status than single sites. These included analyses of 15 SNP haplotypes and subsets thereof using the coalescent theory-based PHASE v2.0 (17)
and the score method that relies on the expectation-maximization algorithm (18)
. No attempt was made to correct P values for multiple testing. Rather, P values are provided to compare the relative strength of association from multiple dependent (e.g., SNPs within samples) and independent (e.g., SNPs between samples) sources of information. P values <0.05 are referred to as statistically significant.
| RESULTS |
|---|
|
|
|---|
5%) with the most statistically significant associations were remeasured in triplicate on each DNA pool. In the third step, the 74 most significant SNPs (
5%) from step two were individually genotyped in each sample. Fifty-two SNPs were confirmed to have statistically significant differences between cases and controls (P < 0.05).
Genome-wide case-control studies that use tens of thousands of SNPs and liberal statistical selection criteria are expected to yield a high proportion of false-positive associations. To distinguish the true genetic effects from the false positives, the 52 SNPs were genotyped in two independent collections of breast cancer cases and controls from Germany and Australia. The most consistent and statistically significant association was observed with a SNP located on chromosome 19p13.2. The marker SNP rs1056538, a nonsynonymous variation (V301I) in exon 5 of intercellular adhesion molecule 5 (ICAM5), had a P value of 0.001 [odds ratio (OR) = 1.5] in the discovery sample and a P value of 0.03 (OR = 1.4) and a P of 0.07 (OR = 1.3) in the German and Australian replication samples, respectively (Table 1)
. The analysis of all three samples resulted in a combined significance of P < 0.001 (OR = 1.4) and a significance of P = 0.01 (OR = 1.3) within the replication samples only.
|
|
|
The discovery collection of German breast cancer patients included information related to family history, age of onset, and severity of disease. The genotyped SNPs were tested for association with these variables. The SNP rs1056538 and rs2228615 were most strongly associated with a positive family history of breast cancer (P = 0.0065, Fishers exact test). Fifteen percent of those homozygous for the susceptible allele (C) had a positive family history, compared with 9% of the heterozygotes and none of those homozygous for the protective allele. Comparing allele frequencies between cases with a positive family history and controls results in a substantially larger estimated effect size than in all cases (OR = 3.4, P = 0.001). There was no association between rs1056538 and age of diagnosis. Also noteworthy, the SNP most strongly associated with breast cancer status in the replication samples, rs281439 located 542 bp 5' of ICAM5 (Table 1)
, was also associated with indicators of cancer severity (Table 2)
. Patients carrying the allele associated with breast cancer susceptibility (G) had a significantly shorter time span between diagnosis and recruitment, suggesting shorter survival time, and higher rates of metastases to other organs. These results suggest that one or more variants in this region are risk factors for breast and prostate cancer and may influence disease progression and prognosis.
|
| DISCUSSION |
|---|
|
|
|---|
Our findings are in agreement with previous reports on the involvement of ICAM1 in tumor progression, and the K469E variant is a potential candidate to influence this process. However, the data also suggests an influence of other variants in this region on breast and prostate cancer susceptibility. Although the genetic evidence favors ICAM5, higher relative expression levels of ICAM1 make it a more favorable candidate for predisposition and potentially tumor progression. However, current data cannot exclude any of the three ICAM genes as biologically responsible for the observed association.
The route by which these genetic associations were arrived at and the potential for spurious association must certainly be considered. Recent published work has brought much needed light to the need for proper validation to verify genetic findings for complex traits (37, 38, 39) . In the current study, the initial association found between the ICAM5 marker and breast cancer status was one result from over 25,000 hypothesis tests. A conservative Bonferroni adjustment to yield an experiment-wide type I error rate of 0.05 would demand a test-wise P value on the order of 106. Given the modest sample size, only common variations with relatively large effects (OR > 2) would reach such significance levels. Instead, we chose to be more mindful of the role of type II error rates and apply a more liberal set of criteria in the initial phases of the study and verify true genetic effects by independent replication. The analysis of 52 selected markers in the German and Australian replication samples resulted in multiple associations of continuing interest, with the ICAM5 providing the most consistent and statistically significant association (P = 0.01, OR = 1.3). This would not be considered significant on an experiment-wide level after Bonferroni adjustment, which would require a test-wise P value on the order of 0.001. Indeed, if the true size of the genetic effect is 1.4, then the total replication sample size of 368 cases and 330 controls has <50% power to reject the false null hypothesis at this level.
It should be noted that we observed two additional lines of evidence supporting the validity of the association of this region to breast cancer susceptibility. First, the risk allele for coding SNP rs1056538 is significantly more common in cases with a family history of breast cancer. Second, the independent identification of the same variation with remarkably similar effects observed in a larger prostate cancer study gives us added reason to believe that one or more variations in these ICAM genes are influencing breast and prostate cancer susceptibility in populations of European origin. These are characteristics that have also been observed for the germ line variations in CHEK2 (40 , 41) .
The successful identification of a small gene region associated with breast and prostate cancer susceptibility through large-scale association supports the promise of more comprehensive genome-wide studies. The approach presented here using DNA pools and genome-wide gene-based SNPs is very useful to quickly discover susceptible genes for various cancer types and other complex disorders. However, given the complexity of the genetic architecture underlying complex trait variation, genetic analyses, alone, are not likely to unambiguously identify the genes or genetic variations responsible. Further validation in independent samples must be carried out to support genetic evidence, and functional experiments must be conducted to identify which variations in which genetic and environmental contexts are associated with disease susceptibility and prognosis. The observed association of genetic variants to disease severity and progression has promising implications for patient management. Because ICAMs are cell surface molecules and therefore obvious targets for antibodies and small molecules, these findings also offer new opportunities for therapeutic intervention.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Requests for reprints: Andreas Braun, Sequenom Inc., 3595 John Hopkins Court, San Diego, CA 92121. Phone: 858-202-9018, Fax: 858-202-9020; E-mail: abraun{at}sequenom.com
8 http://www.ncbi.nlm.nih.gov/SNP/ ![]()
9 ftp://ftp.ncbi.nih.gov/refseq/H_sapiens/H_sapiens/maps/mapview/BUILD.34/ ![]()
Received 5/20/04. Revised 9/ 1/04. Accepted 10/ 3/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. E. Sehl, L. R. Langer, J. C. Papp, L. Kwan, J. L. Seldon, G. Arellano, J. Reiss, E. F. Reed, S. Dandekar, Y. Korin, et al. Associations between Single Nucleotide Polymorphisms in Double-Stranded DNA Repair Pathway Genes and Familial Breast Cancer Clin. Cancer Res., March 15, 2009; 15(6): 2192 - 2203. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kim, N. Kim, B. Dong, D. Boren, S. A. Lee, J. Das Gupta, C. Gaughan, E. A. Klein, C. Lee, R. H. Silverman, et al. Integration Site Preference of Xenotropic Murine Leukemia Virus-Related Virus, a New Human Retrovirus Associated with Prostate Cancer J. Virol., October 15, 2008; 82(20): 9964 - 9977. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. P. Gorlov, P. Meyer, T. Liloglou, J. Myles, M. B. Boettger, A. Cassidy, L. Girard, J. D. Minna, R. Fischer, S. Duffy, et al. Seizure 6-Like (SEZ6L) Gene and Risk for Lung Cancer Cancer Res., September 1, 2007; 67(17): 8406 - 8411. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-M. Lee, M. Shen, R. S. Chapman, M. Yeager, R. Welch, X. He, T. Zheng, H. D. Hosgood, D. Yang, S. I. Berndt, et al. Polymorphisms in immunoregulatory genes, smoky coal exposure and lung cancer risk in Xuan Wei, China Carcinogenesis, July 1, 2007; 28(7): 1437 - 1441. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Milne, G. Ribas, A. Gonzalez-Neira, R. Fagerholm, A. Salas, E. Gonzalez, J. Dopazo, H. Nevanlinna, M. Robledo, and J. Benitez ERCC4 Associated with Breast Cancer Risk: A Two-Stage Case-Control Study Using High-throughput Genotyping Cancer Res., October 1, 2006; 66(19): 9420 - 9427. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Torosyan, A. Dobi, S. Naga, K. Mezhevaya, M. Glasman, C. Norris, G. Jiang, G. Mueller, H. Pollard, and M. Srivastava Distinct Effects of Annexin A7 and p53 on Arachidonate Lipoxygenation in Prostate Cancer Cells Involve 5-Lipoxygenase Transcription Cancer Res., October 1, 2006; 66(19): 9609 - 9616. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Spinola, P. Meyer, S. Kammerer, F. S. Falvella, M. B. Boettger, C. R. Hoyal, C. Pignatiello, R. Fischer, R. B. Roth, U. Pastorino, et al. Association of the PDCD5 Locus With Lung Cancer Risk and Prognosis in Smokers J. Clin. Oncol., April 10, 2006; 24(11): 1672 - 1678. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G. Cox, S. E. Hankinson, and D. J. Hunter Polymorphisms in the ICAM Gene Locus are not Associated with Breast Cancer Risk Cancer Epidemiol. Biomarkers Prev., January 1, 2006; 15(1): 178 - 179. [Full Text] [PDF] |
||||
![]() |
D. V. Zaykin and L. A. Zhivotovsky Ranks of Genuine Associations in Whole-Genome Scans Genetics, October 1, 2005; 171(2): 813 - 823. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Ehm, M. R. Nelson, and N. K. Spurr Guidelines for conducting and reporting whole genome/large-scale association studies Hum. Mol. Genet., September 1, 2005; 14(17): 2485 - 2488. [Full Text] [PDF] |
||||
![]() |
L. A. Hefler, C. Grimm, T. Lantzsch, D. Lampe, S. Leodolter, H. Koelbl, G. Heinze, A. Reinthaller, D. Tong-Cacsire, C. Tempfer, et al. Interleukin-1 and Interleukin-6 Gene Polymorphisms and the Risk of Breast Cancer in Caucasian Women Clin. Cancer Res., August 15, 2005; 11(16): 5718 - 5721. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rosette, R. B. Roth, P. Oeth, A. Braun, S. Kammerer, J. Ekblom, and M. F. Denissenko Role of ICAM1 in invasion of human breast cancer cells Carcinogenesis, May 1, 2005; 26(5): 943 - 950. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Atienza, R. B. Roth, C. Rosette, K. J. Smylie, S. Kammerer, J. Rehbock, J. Ekblom, and M. F. Denissenko Suppression of RAD21 gene expression decreases cell growth and enhances cytotoxicity of etoposide and bleomycin in human breast cancer cells Mol. Cancer Ther., March 1, 2005; 4(3): 361 - 368. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kammerer, R. B. Roth, C. R. Hoyal, R. Reneland, G. Marnellos, M. Kiechle, U. Schwarz-Boeger, L. R. Griffiths, F. Ebner, J. Rehbock, et al. Association of the NuMA region on chromosome 11q13 with breast cancer susceptibility PNAS, February 8, 2005; 102(6): 2004 - 2009. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |