| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Epidemiology and Prevention |
1 Broad Institute of MIT and Harvard, Cambridge, Massachusetts; Departments of 2 Medicine, 3 Molecular Biology, 4 Hematology/Oncology, and 5 Molecular Biology and Diabetes Unit, Massachusetts General Hospital, Boston, Massachusetts; 6 Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California; and 7 Etiology Program, Cancer Research Center of Hawaii, University of Hawaii, Honolulu, Hawaii
Requests for reprints: Christopher A. Haiman, University of Southern California/Norris Comprehensive Cancer Center, Room 4441, 1441 Eastlake Avenue, Los Angeles, CA 90089-9175. Phone: 323-865-0429; Fax: 323-865-0127; E-mail: Haiman{at}usc.edu.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Previous association studies of BRCA1 variants and breast cancer risk have focused primarily on variation in the coding region (9, 10). It is also a possibility, however, that common germ line variation in the noncoding (i.e., regulatory) regions of genes may influence predisposition to cancer (11). The BRCA1 gene lies in a region of extensive linkage disequilibrium (>200 kb), which makes it possible to test a subset of variants in the region that represents the bulk of all common genetic variation at the locus (12).8 Dunning et al. (9) typed four missense SNPs in a study of breast and ovarian cancer in White women, but found no significant associations between any of these variants or the three common haplotypes they formed and cancer risk. More recently, using sequencing data (across the majority of the locus) generated from National Institute of Environmental Health Sciences Environmental Genome Project,9 Cox et al. (13) identified five common haplotypes (
5%) that could be predicted by four tagging SNPs. Testing of these SNPs among White women in the Nurses' Heath Study showed a weakly significant association of one of the haplotypes with breast cancer risk (13).
Evaluation of variation in both coding and noncoding regions across a locus has only recently become possible because of large-scale efforts to identify and catalogue single nucleotide polymorphisms in the genome (14, 15). In the present study, we typed a large number of common variants, and by testing single sites and the ancestral haplotypes they form in a large and multiethnic breast cancer study, aimed to more comprehensively survey common genetic variation at the BRCA1 locus in relation to breast cancer risk.
| Materials and Methods |
|---|
|
|
|---|
Incident cancers in the Multiethnic Cohort Study are identified by cohort linkage to population-based cancer Surveillance, Epidemiology, and End Results registries covering Hawaii and Los Angeles County, and to the California State cancer registry covering all of California. Information on stage of disease at the time of diagnosis is also collected from the cancer registries; women were classified as having advanced breast cancer when there was evidence of dissemination beyond the breast at diagnosis (stage II or higher).
Beginning in 1994, blood samples were collected from incident breast cancer cases. At this time, blood collection was also initiated in a random sample of Multiethnic Cohort Study participants to serve as a control pool for genetic analyses in the cohort. The participation rates for providing a blood sample were 74% and 66% for cases and controls, respectively. Eligible cases in this nested breast cancer case-control study consisted of women with incident breast cancer (including second primaries) diagnosed after enrollment in the Multiethnic Cohort Study through May 31, 2003. Controls were women without breast cancer before entry into the cohort and without a diagnosis up to May 31, 2003. The breast cancer case-control study consists of 1,715 invasive breast cancer cases and 2,502 controls and was approved by the Institutional Review Boards at the University of Southern California and at the University of Hawaii.
Characterizing linkage disequilibrium and haplotypes. To characterize variation, we implemented a haplotype-based approach to examine common variation throughout the BRCA1 gene as described in detail previously (17). We initially surveyed common genetic variation across 108.1 kb of the BRCA1 gene using markers from the public SNP map. Our goal was to capture the common haplotype patterns across the BRCA1 locus. To do this, we genotyped 28 SNPs across the locus selected from the National Center of Biotechnology Information SNP database.10 This SNP map included six missense SNPs that are relatively common (
5% in at least one ethnic group in the Multiethnic Cohort Study) and have been previously reported in the BIC database11 and/or the literature (Q356R, P871L, E1038G, S1140G, K1183R, and S1613G; ref. 9). SNPs were genotyped in a multiethnic panel of 349 women in the Multiethnic Cohort Study without a history of cancer (n = 69-70 per ethnic group). This sample size guaranteed that any haplotype with a frequency of >5% will be represented at least once among the 140 chromosomes with probability >99%.
The |D'| and r2 statistics were used to assess pairwise linkage disequilibrium between SNPs as described (18, 19). Linkage disequilibrium block structure was examined using the 90% confidence bounds of D' to define sites of historical recombination between SNPs with minor allele frequencies >10% (20).
Haplotype construction and tagging single nucleotide polymorphism selection. Haplotype frequency estimates were constructed from the genotype data in the multiethnic panel (one ethnicity at a time) within linkage disequilibrium blocks using the expectation-maximization algorithm of Excoffier and Slatkin (21). The squared correlation (Rh2) between the true haplotypes (h) and their estimates were then calculated as described (22). Tagging SNPs for the case-control study were then chosen by finding the minimum set of SNPs which would have Rh2 > 0.7 for all haplotypes with an estimated frequency of >5%. We included all missense SNPs as tagging SNPs before minimizing the number of tagging SNPs required to predict the common haplotypes. The calculation of Rh2 is described in detail by Stram et al. (22) and a computer program (tagSNPs) for the calculation is available at D. Stram's website.12 Pairwise r2 measures were calculated13 to assess how well the chosen tagging SNPs were correlated with the variants that were not typed in the breast cancer cases and controls.
Comparison of tagging single nucleotide polymorphism and haplotype frequencies between cases and controls. We evaluated associations with the tagging SNPs and missense SNPs, and the common haplotypes within each block. Haplotype frequencies among breast cancer cases and controls were estimated using the tagging SNPs selected to distinguish the common haplotypes (>5% frequency) for each ethnic group in the multiethnic panel as described (17). We first did global likelihood ratio tests to test whether the frequency distributions of the common haplotypes within each block differed between cases and controls. We used the methods described by Zaykin et al. (23) to perform global tests of no association between haplotypes and cancer risk and to estimate haplotype-specific odds ratios (OR). Briefly, the expectation-maximization algorithm was run to estimate haplotype frequencies for the tagging SNPs for a combined data set (cases + controls) and individual estimates of haplotype count (expected number of copies of each haplotype carried by each individual) from the expectation-maximization were output to an external file and merged with case-control status. These estimates were then used as explanatory variables in logistic regression analysis, in place of true haplotype count, first to perform a global test of no association between haplotypes and risk and next to estimate haplotype-specific ORs for each common haplotype. ORs and 95% confidence intervals (95% CI) were estimated for each haplotype and missense SNP using unconditional logistic regression. Associations with SNPs and haplotypes were examined in ethnic-stratified analyses, and summary ORs were presented adjusted for ethnicity and age. Adjustment for the established breast cancer risk factors (24), first-degree family history of breast cancer, body mass index, parity, age at first birth, age at menarche, menopausal status, type of menopause, age at menopause, use of hormone replacement therapy, and alcohol consumption did not affect our results. We used the Statistical Analysis System for all analyses (version 8.2, SAS Institute, Inc., Cary, NC).
Genotyping. DNA for the multiethnic panel of breast cancer cases and controls was extracted from WBC fractions using the Qiagen Blood Kit (Qiagen, Chatsworth, CA). Genotyping for linkage disequilibrium and haplotype discovery was done by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry using the Sequenom platform at the Broad Institute. Genotyping of the tagging SNPs and missense SNPs was done by the 5' nuclease TaqMan allelic discrimination assay using the ABI 7900 (Applied Biosystems, Foster City, CA) in the USC Genotyping Laboratory. The E1038G missense SNP was not genotyped in the case-control study because of a nearby SNP (6 bp away) that interferes with probe binding and results in genotype misclassification. SNP 18 (K1183R; rs16942), however, is a good proxy for this common missense SNP (r2
0.94 in all ethnic groups). Replicate blinded quality control samples were included to assess reproducibility of the genotyping procedure; concordance was >99% in the multiethnic panel and in the case-control study.
| Results |
|---|
|
|
|---|
3.9 kb; Table 1)from 21.5 kb upstream through 7.0 kb downstream of the BRCA1 gene (total distance, 108.1 kb)to characterize the pattern of linkage disequilibrium in this region. Consistent with other reports (12), we observed a region of strong linkage disequilibrium (in all ethnic groups) across the BRCA1 locus (Fig. 1). Our goal was to genotype at least six high frequency markers (
10%) in the region of strong linkage disequilibrium to ensure that we thoroughly sampled the common diversity in the region (20).
|
|
5% in at least one ethnic population (Table 1). Common BRCA1 haplotypes accounted for at least 79% of all chromosomes in each ethnic group. We observed differences in haplotype number and distribution between the African Americans and the other four populations. The African American population was the most diverse population, possessing eight common haplotypes, followed by Whites with seven, Latinos with six, and Japanese and Native Hawaiians each with four. Interestingly, five of the haplotypes among African Americans were only rarely observed in the other populations (
1% in all populations; haplotype 11, 2% among Latinos), and yet, they accounted for the majority (56%) of the total haplotype diversity in the African American sample. We selected 14 markers as tagging SNPs (forcing in the five missense SNPs) that strongly predicted the common haplotypes (>5% frequency) in each ethnic group in the multiethnic panel (Table 1). The average Rh2 in predicting the common haplotypes across all ethnic groups was 0.87 and we observed only minor differences in haplotype frequencies predicted solely by the tagging set of SNPs versus haplotype frequencies as defined by all of the SNPs (Table 1). To assess how well the 14 tagging SNPs captured the unmeasured SNPs (the remaining 14 SNPs), we computed the pairwise r2 values (correlations) of the tag SNPs to each unmeasured variant. All r2 values were above 0.8 (average r2 across populations = 0.98) except for one variant among African Americans (rs799907, r2 = 0.53). Thus, we believe that the chosen tagging SNPs (and the haplotypes they define) provide good prediction of all SNPs assayed in the multiethnic panel and that common variation was thoroughly characterized at this locus.
Associations of missense and tagging single nucleotide polymorphisms in BRCA1 and breast cancer risk. All SNPs conformed to Hardy-Weinberg equilibrium among controls within each ethnic group (data not shown). We observed no evidence of significant associations between any of the missense SNPs and breast cancer risk (Table 2). In addition, no significant associations were noted with any of the tagging SNPs (data not shown).
|
|
| Discussion |
|---|
|
|
|---|
20% of familial breast cancer, but <5% of all breast cancers (28). Thus, the genetic elements contributing to the vast majority of breast cancer cases remain largely unknown. The role of common missense SNPs, as well as variation in noncoding regions (that may influence risk through expression levels and alternative splicing), have yet to be thoroughly explored (at single loci as well as throughout the genome) as markers of breast cancer susceptibility. Ongoing efforts to systematically characterize genetic polymorphisms, such as the International HapMap Project (15), provide the foundation for conducting comprehensive association studies of common variation. In the present study, we thoroughly tested common variation in BRCA1 in a large multiethnic population and found no evidence for common germ line variation in BRCA1 playing a significant role in sporadic breast cancer. Another recent study (13) reported a modestly positive association between a BRCA1 haplotype and breast cancer among White women in the Nurses' Health Study (OR, 1.18, 95% CI, 1.02-1.37). Our results do not support these findings; however, among the populations that carried this haplotype with a frequency of at least 5% (haplotype 2: Whites, 16%; Latinos, 9%), we only had 70% power to detect a relative risk of 1.3 for a dominant allele (assuming an Rh2 of 0.9). Overall, our study was well powered to detect effects for alleles that were shared across ethnic populations; we had 94% statistical power to detect a relative risk of 1.35 for a dominant allele with a frequency of 10%, and 70% power to detect a relative risk of 2.2 for a recessive allele with a frequency of 10%.
The vast majority of common variation is shared between populations; however, allele frequencies are known to vary across populations (29) and studies conducted in a multiethnic population may lend insight into better understanding ethnic differences in breast cancer risk (24). In this study, we observed most of the common BRCA1 haplotypes to be shared among Native Hawaiians, Japanese, Latinos, and Whites; four of the eight common haplotypes were found in at least three ethnic populations and six of the eight were found in at least two groups. African Americans, however, were observed to have five haplotypes that were uncommon in the other groups (most < 1%). The SNPs that defined these African Americanspecific haplotypes were rare or did not exist in the other populations (SNPs: 1, 2, 4, 8, and 11), which suggests that these alleles either did not make it through the Out of Africa bottleneck that occurred
50,000 to 100,000 years ago (30) or did make it through the bottleneck and then drifted to low frequencies.
Although we have thoroughly addressed the role of common variation in BRCA1 and sporadic breast cancer risk, it remains a possibility that this locus may still prove to be involved in breast cancer risk. Specifically, rare (<5%) variants (that our sample size is not adequately powered to test) may contribute to disease; to address this hypothesis, however, large-scale resequencing efforts (to discover the rare variants) and testing of these variants in even larger cohorts, such as the National Cancer Institute Consortium of Cohorts,14 will be required. Another possibility is that what we label as sporadic breast cancer is actually a collection of genetically distinct subclasses of breast cancer. In this scenario, it would be unlikely that the same set of underlying susceptibility alleles occur in all breast cancer cases (i.e., the genetic architecture of disease is not genetically homogeneous). If these subgroups are not recognized and separately analyzed, then the power to detect them will be diminished. At the histologic and molecular levels, breast tumors have different characteristics; subsets of breast tumors as defined by immunohistochemistry [e.g., estrogen-receptor (+/) and HER2/neu(+/)] often display different biological behaviors, such as time to disease progression and response to therapy, which may reflect different genetic origins. Studies have shown that breast tumors of women with hereditary breast cancer with mutations in BRCA1 and a subset of women with sporadic disease (
25%) share similar traits, including a basal cell histology, higher-grade tumors, cytokeratins 5/6, and estrogen receptor negativity, suggesting they may have a similar etiology (3133). The ability to stratify breast cancer cases by expression profiling, immunohistochemistry, methylation patterns, and/or clinical variables may facilitate the identification of more genetically homogeneous subsets of cancer cases, and therefore may help to identify causal variants underlying specific breast cancer phenotypes.
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank the participants of the Multiethnic Cohort Study for their participation and commitment. We also thank Loreall Pooler and David Wong for their laboratory assistance, and Dr. Kristine Monroe and Hank Huang for their support with the study design.
| Footnotes |
|---|
9 http://egp.gs.washington.edu/. ![]()
10 http://www.ncbi.nlm.nih.gov/SNP/. ![]()
11 http://research.nhgri.nih.gov/bic/. ![]()
12 http://www-rcf.usc.edu/~stram. ![]()
13 http://www.broad.mit.edu/mpg/tagger/. ![]()
14 http://epi.grants.cancer.gov/BPC3/. ![]()
Received 1/14/05. Revised 5/17/05. Accepted 6/ 2/05.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K Y-K Chan, W Liu, J-R Long, S-P Yip, S-Y Chan, X-O Shu, D T-T Chua, A N-Y Cheung, J C-Y Ching, H Cai, et al. Functional polymorphisms in the BRCA1 promoter influence transcription and are associated with decreased risk for breast cancer in Chinese women J. Med. Genet., January 1, 2009; 46(1): 32 - 39. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Chen, J. Liang, Z. Wang, X. Zhou, L. Chen, M. Li, D. Xie, Z. Hu, H. Shen, and H. Wang Association of Common PALB2 Polymorphisms with Breast Cancer Risk: A Case-Control Study Clin. Cancer Res., September 15, 2008; 14(18): 5931 - 5937. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Haiman, C. Hsu, P. I.W. de Bakker, M. Frasco, X. Sheng, D. Van Den Berg, J. T. Casagrande, L. N. Kolonel, L. Le Marchand, S. E. Hankinson, et al. Comprehensive association testing of common genetic variation in DNA repair pathway genes in relationship with breast cancer risk in multiple populations Hum. Mol. Genet., March 15, 2008; 17(6): 825 - 834. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Douglas, A. M. Levin, K. A. Zuhlke, A. M. Ray, G. R. Johnson, E. M. Lange, D. P. Wood, and K. A. Cooney Common Variation in the BRCA1 Gene and Prostate Cancer Risk Cancer Epidemiol. Biomarkers Prev., July 1, 2007; 16(7): 1510 - 1516. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. McKay, R. Kaaks, M. Johansson, C. Biessy, F. Wiklund, K. Balter, H.-O. Adami, C. Boillot, L. Gioia-Patricola, F. Canzian, et al. Haplotype-Based Analysis of Common Variation in the Growth Hormone Receptor Gene and Prostate Cancer Risk Cancer Epidemiol. Biomarkers Prev., January 1, 2007; 16(1): 169 - 173. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Haiman, D. O. Stram, I. Cheng, E. E. Giorgi, L. Pooler, K. Penney, L. Le Marchand, B. E. Henderson, and M. L. Freedman Common Genetic Variation at PTEN and Risk of Sporadic Breast and Prostate Cancer. Cancer Epidemiol. Biomarkers Prev., May 1, 2006; 15(5): 1021 - 1025. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |