| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Epidemiology |
1 Epidemiology Program, Cancer Research Center of Hawaii, University of Hawaii; 2 Kaiser Permanente Hawaii, Moanalua Medical Center; 3 Pacific Health Research Institute; 4 Division of Medical Technology, John A. Burns School of Medicine, University of Hawaii at Manoa; 5 Kaiser Permanente Center for Health Research, Honolulu, Hawaii; and 6 Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
Requests for reprints: Barbara S. Saltzman, Epidemiology Program, Cancer Research Center of Hawaii, University of Hawaii, Suite 407, 1236 Lauhala Street, Honolulu, HI 96813. Phone: 808-586-2954; E-mail: Barbara{at}crch.hawaii.edu.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Certain inherited variants in the promoter region of the TGFB1 gene (G–800A and C–509T) and in exon 1 (L10P and R25P) and exon 5 (T2631) have been associated with higher circulating concentrations in a small number of studies (5–7) and, thus, could possibly affect cancer risk (8, 9). The T263I single-nucleotide polymorphism (SNP) is located in the pro-protein portion of TGFβ-1 that is cleaved from the active part of the protein. It has been hypothesized that activation of the latent TGFβ-1 protein by cleavage of pro-TGFβ-1 is an important event in the signaling cascade leading to growth control (10, 11). Similarly, the G–800A SNP is located in a consensus cyclic AMP response element binding protein (CREB) half site and may cause reduced affinity for CREB transcription factors (5) whose binding is important for transcription control.
No comprehensive study of the genetic variation in the TGFB1 gene and its relationship to circulating levels or colorectal cancer risk has been reported. The only data available are from three recent reports, one on L10P with colorectal polyps (12), one on C–509T and colorectal cancer (13), and the third on the four previously mentioned SNPs and advanced colorectal adenoma (14). In the first study (12), no association was found with L10P and adenoma but a lower risk of hyperplastic polyps was suggested for P allele carriers who were current or past smokers (Ptrend = 0.05). In the second study, no association was found between C–509T and colorectal cancer. In the third study (14), –509TT and 10PP genotypes were associated with advanced colorectal adenoma {odds ratio (OR), 1.89 [95% confidence interval (95% CI), 1.16–3.09] and 1.37 (95% CI, 1.02–1.86), respectively}. The –509TT association was greater for the subsets of participants with multiple adenomas and those with rectal adenomas.
The goal of this study was to characterize the genetic variation in TGFB1 and use the data from two case-control studies to clarify the associations of SNPs and haplotypes at this locus with circulating levels of TGFβ-1 and risk of colorectal adenoma and adenocarcinoma.
| Materials and Methods |
|---|
|
|
|---|
Adenocarcinoma Subjects
The second study has previously been described in detail (16). Cases diagnosed with a first adenocarcinoma of the colon or rectum before age 85 years were identified through the Hawaii Tumor Registry, a member of the U.S. National Cancer Institute Surveillance, Epidemiology and End Results program, between July 1994 and December 1999. Cases were eligible if they were Japanese, Caucasian, or Native Hawaiian residents of the island of Oahu. Controls were selected from an ongoing health survey conducted by the Hawaii State Department of Health. This survey, modeled after the National Health Information Survey, takes an annual random 2% sample of households within the state. An additional source of controls
65 years of age was Medicare participants on Oahu. One control was individually matched to each case by sex, ethnicity, and age (±2 years). Personal interviews were obtained from 768 matched case-control pairs. The participation rate was 58.2% for cases and 53.2% for controls. The main reason for nonparticipation in both cases and controls was refusal (22.5% and 34.8%, respectively). Other reasons included death, severe diseases, and inability to locate (15). Twelve percent of the cases were in the proximal colon, 45% in the distal colon, and 43% in the rectum. DNA samples were available from 535 (70%) of interviewed cases and 656 (85%) of interviewed controls. DNA samples were unavailable for the remainder of participants due to refusal or sample depletion by previous analyses. Cases and controls who donated blood were similar to those who did not with regard to age, sex, ethnicity, as well as other factors associated with colorectal cancer risk in this study. All participants with available samples were used in analyses regardless of matched set completion status.
Data Collection
In both studies, exposure information was collected via virtually identical interview-administered questionnaires designed to obtain demographic and lifestyle information, including lifetime histories of physical activity, tobacco smoking, and alcohol drinking; medical history; family cancer history; and, for females, reproductive and hormone use history. The interview also included a validated food frequency questionnaire with 268 food items (17, 18), and a detailed history of vitamin and mineral supplement use was also taken. This research was approved by the University of Hawaii Committee on Human Studies, Straub Clinic and Hospital Review Board, Kaiser Permanente Hawaii Institutional Review Board, and the Johns Hopkins Bloomberg School of Public Health Committee on Human Research.
Blood Collection
Blood samples in the adenoma study were collected in the morning after a 10-h fast and stored on ice until processed. Processing was completed within 2 h of collection and specimens were stored at –80°C. Serum samples of cases and controls were analyzed together in the same analytic batch giving priority to matched sets and in situations where serum was not available for all members of a matched set (due to either refusal to give blood or depleted serum), participants of the same ethnicity, sex, and of similar age.
Serum TGFβ-1 Assays
Serum TGFβ-1 was determined in the adenoma study using a quantitative sandwich enzyme immunoassay technique with a Quantikine human TGFβ-1 kit (R&D Systems) following the manufacturer's instructions. The detection limit of the assay is 7 pg/mL and has <1% cross-reactivity with isoforms TGFβ-2 and TGFβ-3. We analyzed 36 paired quality control samples (1 paired sample from 10 individuals and 2 sets of paired samples from 13 individuals), yielding an intrabatch coefficient of variation of 4.6% for the assay and an interbatch coefficient of variation of 16.5%.
SNP Selection
The TGFB1 gene, located on chromosome 19, was surveyed for variation from 20 kb upstream to 10 kb downstream from the coding region. SNPs were identified from public databases, including the National Center for Biotechnology Information dbSNP database7 build 124 and the HapMap Project8 public release 14 (19), and were selected approximately every 1 to 2 kb to provide dense coverage of the gene. Preference for selection as potential tagSNPs was given first to SNPs that have previously been reported as potentially functional (resulting in either differences in expression in vitro or differences in circulating concentrations of TGFβ-1 by allele), nonsynonymous SNPs, or SNPs previously associated with cancer risk. Other considerations for selection included heterozygosity with a minor allele frequency of at least 5% and being in evolutionary conserved regions.
The selected SNPs were then genotyped on a multiethnic panel of Japanese (n = 70), Caucasian (n = 70), Native Hawaiian (n = 69), African American (n = 70), and Latino (n = 70) samples from the Multiethnic Cohort study (20, 21), plus an additional 31 (
10%) duplicate samples inserted throughout the panel. These quality control pairs showed 100% agreement for all SNPs. A sample size of 70 per ethnic group was selected so that any haplotype with frequency of
5% would be represented among the 140 chromosomes with a probability of >99%. These data were used to select haplotype-tagging SNPs (see below). The selected haplotype-tagging SNPs were then genotyped in the cases and controls of both studies.
Tests of Hardy-Weinberg equilibrium were carried out among controls from each study separately for all SNPs, after stratification by ethnicity. This test is a
2 test of (observed – expected)2 / expected with 1 degree of freedom. In the case of low allele frequencies, an exact test was used based on an exact calculation of the probability of observing H heterozygotes conditional on the number of copies of the minor SNP allele (22). Pairwise linkage disequilibrium was calculated using the r2 statistic between all pairs of SNPs.
Linkage Disequilibrium Block Determination and Tagging SNP Selection
The program Haploview (23) was used to determine blocks of linkage disequilibrium (LD) determined in each ethnic group using the criteria of Gabriel et al. (24). Separately by ethnicity, common haplotypes were predicted from the multiethnic panel genotype data within LD blocks using the methods described by Stram et al. (25) and implemented in their statistical package TagSNPsv2. We used the statistical criteria Rh2
0.8 and selected additional tagSNPs in regions not represented in the haplotype blocks by the methods of de Bakker et al. (26) with the program Tagger (available in Haploview), which prioritizes tagSNPs to serve as a proxy for others at a given r2, either singly or by a group of two to three SNPs. We selected two additional tagSNPs in that way for a total of 16 tagSNPs.
DNA Extraction and Genotyping Assays
DNA was purified from blood lymphocytes using QIAamp 96 DNA Blood Kits (Qiagen). Genotyping was done by the 5' nuclease TaqMan allelic discrimination assay [Applied Biosystems (ABI)]. If a genotyping assay failed manufacture or in-house quality control at ABI, another SNP was selected in its place to maintain the dense coverage of the gene. In two cases, one where a missense SNP failed design at ABI and another where no other SNPs existed nearby, MGB Eclipse assays were ordered from Epoch Biosciences and processed in a manner similar to the TaqMan assays. Sixty-five duplicate QC pairs were genotyped on the adenoma study plates with 100% agreement for all but one SNP (rs11466345), which had 99% agreement. Ninety-three duplicate QC pairs were genotyped on the adenocarcinoma study plates with 100% agreement for all but two SNPs (rs11466345 and 1800471), which had 97% and 99% agreement, respectively. The call rate was >99% for all assays. All ethnic-specific genotype frequencies in the controls met the criteria for Hardy-Weinberg equilibrium (P > 0.05) except for three SNPs in Caucasians [rs10416269 in the adenoma (P = 0.04) and adenocarcinoma (P = 0.01) studies, rs1982073 in the adenoma study (P = 0.001), and rs1549934 in the adenocarcinoma study (P = 0.04)]. Among these comparisons with Hardy-Weinberg expectations, 2.6 were expected to be significant by chance alone.
Statistical Analysis
Haplotype estimation. Haplotypes were inferred for cases and controls separately by ethnicity using the statistical software PHASE version 2.1 (27–29), which uses a Bayesian approach to evaluate the unknown haplotype frequencies in light of the known genotype data using Gibbs sampling, a type of Markov Chain-Monte Carlo algorithm. The output of this program includes a "best" reconstruction haplotype for each individual. For analyses, we assumed a haplotypes model comparing one or two copies of a specific haplotypes versus all other haplotypes.
Genetic variation as a predictor of serum concentration. We carried out a cross-sectional analysis of the adenoma study participants to examine genetic variation in TGFB1 as a predictor of serum concentration. All analyses were first stratified by case-control status to rule out the possibility that adenoma or recent polypectomy was associated with altered levels of serum TGFβ-1. Case-control status was not associated with serum level. Additionally, all blood was collected well after the removal of the cases' adenomas, and there is evidence (30) that serum levels are affected by colorectal cancer but return to baseline once tumors are removed. Therefore, all subjects were combined for the analyses presented in this article. For each SNP, the variant allele was considered to be the one with the lower frequency, based on the reference sequence in dbSNP. The distributions of continuous variables were checked for normality and transformations were carried out as indicated. T tests and ANOVA were used to test for differences in mean serum TGFβ-1 concentration by various categories. Least-squares means of serum TGFβ-1 by genotype and haplotype were then calculated to adjust for additional factors. The P value for trend for increasing number of variant alleles was obtained via an F test with 1 degree of freedom. Additionally, all least-squared means were adjusted for age (as a continuous variable), sex, race, and screening clinic. We further adjusted models for pack-years of smoking and lifetime alcohol drinking, two variables associated with serum TGFβ-1 in our study. However, they did not alter the means so the models presented are not adjusted for these variables. We also considered the effect of length of storage of blood samples by adjusting models for time between date of blood draw and serum TGFβ-1 assay and found that time of storage did not alter the results.
Adenoma and adenocarcinoma risks. Case-control analyses were carried out separately but in the same manner for both studies. We considered dominant, recessive, and codominant modes of inheritance by treating each genotype successively as an indicator variable of type "any variant" versus "no variant allele," as an indicator of two variants versus less than two variants, or as a dosage variable of 0, 1, or 2 for increasing number of variants, respectively. Using the method of Stephens et al. (27), we first carried out a permutation test as a global test for difference in haplotype frequency in each LD block. Disease risk was estimated with the ORs and 95% CIs from unconditional logistic regression adjusting for the study matching criteria. ORs were estimated for each haplotype using all other haplotypes as the reference category. Adjustment was made for known risk factors including pack-years of smoking, energy intake, drink-years of alcohol, and intakes of red meat, sweets, total daily fluids, dietary fiber, and total fruit for the adenoma study, and for factors regular aspirin use, pack-years of smoking, lifetime recreational physical activity, body mass index (BMI), and intake of energy, calcium, non-starch polysaccharides from vegetables, and folate for the adenocarcinoma study. These further adjustments did not materially alter the ORs for the SNPs or haplotypes studied and, therefore, results are presented adjusted only for the study matching criteria.
| Results |
|---|
|
|
|---|
|
|
|
For each study, we first carried out a global test for case-control difference in haplotype frequencies for each LD block, overall and stratified by ethnicity. For block 1, there were no statistically significant differences in haplotype frequencies in any ethnic group in either study. For block 2, there were statistically significant differences in haplotype frequencies between adenocarcinoma cases and controls among Japanese (P = 0.05) and Native Hawaiians (P = 0.03), but not in Caucasians (P = 0.66). This test was also statistically significant with all subjects combined (P = 0.01). For adenoma cases and controls, the difference in block 2 haplotype frequencies was of borderline significance (P = 0.09).
The risk estimates for the SNPs are presented by study for all subjects combined in Table 2 and for each ethnic/racial group in Supplementary Table S4. The G allele of rs11466345 (Table 2) was inversely associated with adenocarcinoma risk in a codominant model for all races combined [AG versus AA genotype: OR, 0.9 (95% CI, 0.7–1.2); GG versus AA: OR, 0.4 (95% CI, 0.2–0.7); Ptrend = 0.01]. The corresponding ORs for adenoma were of similar magnitude but did not reach statistical significance (Ptrend = 0.09). For adenocarcinoma, we also observed a Ptrend of borderline significance for SNP rs11466349 for all races (P = 0.05), although the ORs were not statistically significant [CG versus CC: OR, 1.4 (95% CI, 0.9–2.0); GG versus CC: OR, 2.7 (95% CI, 0.5–15.1)]. The risk estimates for these two SNPs were similar across the three ethnic/racial groups but did not reach statistical significance (Supplementary Table S4).
|
|
| Discussion |
|---|
|
|
|---|
|
The SNP associated with risk in our study (rs11466345) is intronic. However, the haplotype block in which it lies also covers the last two exons and the 3' untranslated region of the gene. Thus, it is possible that this SNP is in linkage disequilibrium with a causal SNP that may either result in a change in amino acid or affect a possible splice variant or the stability of the mRNA. In addition, we found allele A of SNP rs6957, also on the same haplotype, to be associated with higher serum TGFβ-1 levels (Ptrend = 0.04), the direction proposed to be protective against colorectal neoplasia. None of the SNPs we measured either in the multiethnic panel or the adenoma and adenocarcinoma cases and controls were in exons 6 and 7 because of the absence of SNPs with frequency >0.05 in the public SNP databases.
The strengths of our study include a comprehensive approach toward characterizing the TGFB1 gene by leveraging the linkage disequilibrium that exists at this locus, as well as carefully adjusting for potential confounders. The consistency of the results observed across ethnic/racial groups suggests that population stratification is an unlikely explanation for the observed association with risk. Finally, we studied two end points representing early and late stages of colorectal neoplasia. Although, for breast cancer, it has been proposed that TGFβ-1 may be protective at early stages and detrimental effect at later stages (35, 39–42), our results do not suggest such a dual role in colorectal cancer.
One limitation of our study is that circulating TGFβ-1 was only measured at one time point and it is unclear how well a single measure may reflect long-term levels. Thus, intraindividual variation may have reduced our ability to detect a difference in serum level by case-control status. It is possible that one serum measure may not be enough to show the association with disease but is enough to show relationship with genotype. Survival bias may have been an issue in our case-control analysis of colorectal cancer. However, this is unlikely because the median time from identification of case to interview was reasonably short (4.5 months) and we did not find any differences in results by stage for adenocarcinoma. For the genetic analysis, we did not sequence the gene and instead used public SNP databases; thus, some variation could have been missed due to incompleteness of these databases. Controls in the adenoma study only received a flexible sigmoidoscopy. Thus, any adenoma located in the proximal colon may have been missed. Assuming that proximal and distal adenomas have the same relationships with the SNPs under study, as suggested by the adenocarcinoma data, the failure to identify proximal adenomas would result in nondifferential misclassification and attenuation of the risk estimates, but would not create a spurious association. Additionally, the overall statistical power for the adenoma study was lower than that of the adenocarcinoma study. There was also a lack of statistical power to detect associations in stratified (site or stage) and interaction analyses (BMI, alcohol, etc.) in both studies. Whereas a strength of our study is the inclusion of several ethnicities, our reduced sample sizes resulted in low statistical power for detecting ethnic differences. Finally, although we carried out a relatively large number of tests in this study, the likelihood that our finding for block 2 was due to chance is not supported by the global test, which was statistically significant (P = 0.01).
In summary, our data suggest that the A allele of SNP rs6957 may be associated with increased serum TGFβ-1 levels, and that allele G of SNP rs11466345 and its underlying haplotype (which also harbors allele A of rs6957) are inversely associated with colorectal cancer and, possibly, adenoma. If these findings are reproduced, future studies should examine SNPs in the last 2 exons, last intron, and the 3' end of the gene for disease association with an even greater resolution than in the current study.
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank the Hawaii Tumor Registry, Castle Medical Center, Kaiser-Permanente Medical Center, Kuakini Medical Center, Queen's Medical Center, the Pacific Health Research Institute, Straub Clinic and Hospital, St. Francis Medical Center, and Wahiawa General Hospital for their collaboration. We thank our interviewing and phlebotomy staff for data collection; Annette Lum-Jones, Ann Seifried, and Michelle Ault for genotyping; Geoffrey K. Maiyoh for TGFβ-1 serum analysis; and Maj Earle, Lucy Shen, and Anne Tome for data support; and Anthony J. Alberg and Elizabeth A. Platz for their insight and helpful comments. We also thank the two anonymous reviewers for their constructive comments.
| Footnotes |
|---|
Received 6/ 8/07. Revised 11/ 8/07. Accepted 12/13/07.
| References |
|---|
|
|
|---|
T polymorphism, alone of in combination with the T869
C polymorphism, of the transforming growth factor-β1 gene with bone mineral density and genetic susceptibility to osteoporosis in Japanese women. J Mol Med 2001;79:149–56.[CrossRef][Medline]
C polymorphism of the transforming growth factor-β1 gene with genetic susceptibility to myocardial infarction in Japanese. Circulation 2000;101:2783–7.
C polymorphism in the transforming growth factor β1 gene and breast cancer among elderly white women: The Study of Osteoporotic Fractures. JAMA 2001;285:2859–63.This article has been cited by other articles:
![]() |
J. Healy, M.-H. Roy-Gagnon, and D. Sinnett No evidence for association between TGFB1 promoter SNPs and the risk of childhood pre-B acute lymphoblastic leukemia among French Canadians Haematologica, July 1, 2009; 94(7): 1034 - 1035. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |