| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Regular Articles |
1 Laboratory of Cell Regulation and Carcinogenesis, 2 Molecular Signaling and Oncogenesis Section, Department of Cancer and Cell Biology, 3 Laboratory of Systems Biology, and 4 Laboratory of Population Genetics, Cancer Research Center, National Cancer Institute, Bethesda, Maryland; and 5 Genome Institute of Singapore, Singapore
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
In addition to Src, PyMT also acts through the phosphatidylinositol 3'-kinase (PI3K; refs. 7, 8, 9 ) which subsequently activates Akt (also named protein kinase B; ref.10 ) thus inhibiting proapoptotic proteins such as BAD, Forkhead transcription factor, and caspase 9 (11 , 12) . Lastly, PyMT is also involved in the activation of the Ras signaling pathway in mammary tumorigenesis (13, 14, 15, 16) .
Using the MMTV-PyMT mouse mammary carcinoma model, we previously identified mouse strains that harbored dominant genetic modifiers of metastasis by breeding MMTV-PyMT mice to 27 different inbred strains (17) . Mammary tumors derived from the different transgene-positive F1 progeny displayed altered latency, metastasis, and tumor growth rates compared with the FVB/NJ homozygous parent (17) . Our observations raised the intriguing possibility that the host genetic background has a significant impact on the biological behavior of subsequent tumors induced by a single oncogene. In this study, we sought to identify molecular signatures that might be involved with different tumor phenotypes given an identical genetic initiator of tumorigenesis. We compared gene expression profiles of MMTV-PyMT mammary tumors that developed in five different mouse genetic backgrounds: MMTV-PyMT in FVB/NJ and four F1 strain combinations (I/LnJ, LP/J, MOLF/Ei, and NZB/B1NJ).
Although the overall expression profiles from PyMT tumors emerging in various strain backgrounds are similar, a supervised approach identified approximately 200 genes that are differentially expressed between tumors from different genetic backgrounds. Using these subsets of genes that differentiate the genotypes, distinct gene expression patterns could separate the tumors from the different genetic backgrounds into two major clusters that correlated with a general virulence index of the tumors. The behavior of a core set of these virulence-associated genes match those identified previously as belonging to the metastatic signature set of genes for human cancer (1) . These results suggest that host genetic factors can independently alter the transcriptional profiles of mammary cancers and may explain some of the differences in the virulence of the resultant tumors.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
8,700 elements) were purchased from Incyte Genomics Inc. (Wilmington, DE) and spotted on poly-lysine-coated glass slides by the microarray core at the National Cancer Institute Advanced Technology Center. Microarray hybridizations were performed as described previously (19) . The arrays were air-dried and scanned using the Axon GenePix4000A scanner (Axon, Union City, CA) and images were processed using GenePix-Pro3.0 software (Axon). Both image and signal intensity data were stored in the microarray database supported by the Center for Information Technology at the National Cancer Institute, NIH, Bethesda.6 This data are available through the National Cancer Institute Directors Challenge website.7
Oligonucleotide Microarray Hybridization and Data Analysis
To extend the gene coverage of this analysis, we reanalyzed the tumors using a different array platform. The same preparation of total RNA used for cDNA arrays was used for oligonucleotide array studies on the Murine Genome U74Av2 chip containing 12,500 features (Affymetrix, Inc., Santa Clara, CA). Equal amounts of RNA isolated from each genomic background were pooled. Ten micrograms of total RNA were reverse transcribed using a T7 (dThd)24 primer to synthesize the cDNA, followed by the incorporation of biotinylated ribonucleotides by in vitro transcription using T7 RNA polymerase. The biotinylated-labeled RNA was fragmented and hybridized to the Murine Genome U74Av2 chip (Affymetrix) according to the manufacturers recommendations. Each mixed RNA sample was repeated twice by two independent labeling and hybridizations. Signal intensities for different chips were scaled to a trimmed mean signal of 150. Gene expression analysis was performed using the Microarray Suite version 5.0 software (Affymetrix) where pooled 10 to 11 week-old FVB/Nj mammary gland RNA served as the reference.
Gene Expression Analysis
cDNA Arrays.
The mean spot intensities corrected for local background were used in calculating the expression ratio (Cy5 tumor: Cy3 reference and the converse for dye swap) for each clone (excluding the spots that were flagged as bad or not found during the image analysis). The ratios were then normalized to a median ratio value of 1 for each array, log transformed, and averaged for replicated dye-swap experiments. The resulting data matrix used for analysis consisted of the expression levels of 8,464 clones relative to the normal mammary gland. Two-tailed F tests and other statistical calculations were performed on log-transformed ratios. An overall error level of the averaged expression ratios was determined by computing the SE of each averaged expression and the 99th percentile of these SEs. This estimated value corresponds to a ratio of 1.43. This threshold level was set as a minimum requirement for the significance of expression change in addition to the P value of a statistical test.
Gene Expression in Tumor Tissue.
The expressions were considered to be significant when the Student t test P value is <0.01 and 75% percent of replicates have a 2-fold change in the same direction. The number of genes up- and down-regulated in each strain were calculated, and the intersect of all five strains was determined.
Significance Analysis of Microarrays.
The program SAM8
was used to identify genes that are likely to exhibit a significant change in expression among the tumor groups. The relative difference in expression of gene i is defined by a score d(i) = r(i)/[s(i) + s0], where r(i) is Fishers linear discriminant, s(i) is a measure of SD, and s0 is a small positive constant (20)
. A set of 125 significant genes having d(i) scores above the threshold (
)level of 0.1 from expected values was determined as described by Tusher et al. (20)
. At this threshold level, the false discovery rate was estimated to be 0.8%.
Multidimensional Scaling and Hierarchical Clustering.
Multivariate analysis of the correlations among gene distributions of samples was carried out by multidimensional scaling. The multidimensional information was reduced to three dimensions using 1-
as the distance metric where
is the Pearson correlation coefficient for pictorial representation. The analysis was carried out using all genes as well as a subset of genes having P < 0.01 by one-way ANOVA calculation. The calculations were performed using Partekpro 5.0 software (Partek Inc., St. Charles, MO). Hierarchical clustering of both genes and samples was performed using the Eisen Cluster program (21)
with 1-
as the distance metric and the complete linkage option of the software for agglomerative clustering.
| RESULTS |
|---|
|
|
|---|
|
|
Gene Expression Data Integration
Concordance between the Array Platforms.
The overlap of genes represented on both the oligonucleotide and cDNA arrays is only about 27% of the 16,000 total features represented on both arrays. The correlation between the overlapping genes in the data sets from the cDNA and oligonucleotide arrays was excellent with r = 0.9 indicating that the data obtained by two independent methods are very consistent. Because 73% of all genes represented in both array formats were non-overlapping, the combination of these data points would not adversely bias the results of clustering. The key challenge was the harmonization of expression data of genes that were represented in both oligonucleotide and cDNA arrays. To accomplish this, we took the overlapping genes and modeled their association. From the overlapping genes, 3,219 average gene expressions above 1.5 threshold in both oligonucleotide and cDNA arrays of all strains were pooled to determine the degree of correlation. The regression line could not be determined accurately in the presence of significant noise because the error in the estimation of the slope is large. Two slopes, m1 and m2, were determined by fitting these average expressions into the equations y = m1 x and x = m2 y (where x and y represent oligonucleotide and cDNA expressions) and by checking for the equality of m1 and 1/m2. Elimination of 33% noisy data estimated from residuals of trial fit resulted in good agreement of m1 with 1/m2. The best fit was thus depicted by the eq. A
.
![]() | (A) |
The genes common to all five strains of MMTV-PyMT mammary tumor (164 genes identified from the cDNA arrays and 188 genes identified using the oligonucleotide arrays with 30 overlaps) are most likely representative of molecular features associated with PyMT mammary tumors. Included in this list are genes associated previously with human breast cancer, specifically, STAT3, CD24 antigen, lipocalin2, and procollagen I
and III
. In addition, molecules associated with cellular growth, such as cyclin-dependent protein kinase CDC28 (CKS1), hairy (HEY-1), lactotransferrin, and transferrin were up-regulated in tumors. In contrast, cell adhesion molecules or membrane proteins such as junction cell adhesion molecule 1, integral membrane protein 2A, lipocalin2, and procollagen I
and III
, caveolin, CD34, CD36 were all down-regulated. Taken together, this suggests that the cancer cells express a cassette of genes supporting cell growth and reducing cell adhesion. Transcription factors and regulators of cell metabolism were highly induced in the PyMT-induced tumor models. The transcription factors included hairy/enhancer-of-split related (hey-1), Cbp/p300-interacting Transactivator (Cited2), activating transcription factor 3 (Atf3), transcription factor AP-2ß (Tcfap2b), CCAAT/enhancer-binding protein (C/EBP) ß (Cebpb), ets variant gene1 (Etv1), and forkhead box C1 (Foxc1). RAB3D, a member of the Ras oncogene family (Rab3d), was also induced.
Although there did not appear to be specific signatures of signaling pathway activation, several important molecules involved in the PI3K and Ras signaling pathways were differentially regulated. The p85 regulatory subunit of PI3K was up-regulated in mammary tumors from all five strains biologically consistent with the involvement of PI3K in PyMT action. The expression of several Ras signaling-related molecules was also altered tumors from the different background strains as shown in Table 3
.
|
of 0.1 with an estimated false detection rate of 0.8%. Analysis with the F test at significant P < 0.01 indicated 80 genes with a 2-fold differential expression and 275 genes at a threshold of 1.5-fold (Supplementary Table 2). Using this set of 80 differentially expressed genes (2-fold, P < 0.01) selected by this stringent criteria, we applied agglomerative clustering (Fig. 2A)
|
|
Correlation of a Virulence Phenotype to Gene Expression.
Our attempts to define specific gene lists that could distinguish the different individual phenotypes of metastases, growth rate, and latency were not successful, suggesting that the specific phenotype ascertainment is either imperfect or highly interconnected in a complex manner that precludes simple classification. For example, using the expression patterns from all five of the genotypes did not permit the unambiguous identification of a gene set predictive of any metric for metastatic potential. This strongly suggests that other factors, including tumor growth rate and tumor mass, which vary between the FVB/NJ and LP/J F1, MOLF/Ei F1, and I/LnJ F1 genotypes, influence the efficiency of tumor dissemination. Alternatively, covariates in addition to metastasis are likely to degrade the power of the analysis so that five genotypes may not have sufficient statistical power to identify a predictive gene set especially if these genes individually have minor effects on metastases.
Recently however, a small set of 17 genes was reported to predict metastatic potential for a variety of solid tumors by comparing the expression patterns of primary and metastatic samples from human samples (1)
. We therefore asked whether in our dataset, the murine orthologs of these putative human metastatic genes could also play a role in the metastatic phenotype in this model system. To test this, we examined expression data from two strains with the greatest difference in their metastatic potential (as ranked by how many metastases per square micrometer of lung tissue): high-metastasis (FVB/NJ) and low-metastasis (NZB/B1NJ F1). Orthologs of all 17 genes reported from the human analysis were identified on the Affymetrix mouse chips. Intriguingly, except small nuclear ribonucleoprotein polypeptide F, 16 of these genes showed the same expression direction as the human set (i.e., murine tumors with the higher metastatic potential showed the same differential expression of 16 of 17 genes that distinguished human primary from metastases), 10 of which had expression differences of at least 1.4-fold. These results (Table 4
; Fig. 4
) suggest that a limited number of genes associated with metastases in human cancers appear also to be associated with metastatic potential of primary murine tumors and that the behavior of these genes may be modulated by host genetic background and not because of initiating oncogenes (23)
.
|
|
| DISCUSSION |
|---|
|
|
|---|
A total of 1,068 and 1,241 "cancer genes" were found by cDNA and oligonucleotide (Affymetrix) array analyses, respectively. These genes are consistently up- or down-regulated in the mammary tumors from any one of the five strains. To define a minimal set of genes that could distinguish a mammary tumor from the normal mammary gland, we selected those elements that were significantly dysregulated in tumors derived from all five genomic backgrounds. This resulted in a common gene set of 164 genes (Supplementary Table 1A) using the cDNA microarrays and 188 genes (Supplementary Table 1C) from the oligonucleotide arrays.
Included in this list are genes associated previously with human breast cancer, such as, STAT3, CD24 antigen, lipocalin2, and procollagen I
and III
. Genes associated with cellular growth were up-regulated in tumors [CDC28, hairy (Hey-1), and lactotransferrin]. In contrast, cell adhesion molecules or membrane proteins were generally down-regulated (integral membrane protein 2A, procollagen I
and III
, caveolin, actin
1, CD34, CD36). Several transcription factors and regulators of cell metabolism were highly induced in the PyMT-induced tumor models. Taken together, this suggests that the cancer cells express a cassette of genes supporting cell growth, cellular metabolism, and reducing cell adhesion.
Of particular interest is the fact that the p85 regulatory subunit of PI3K (P85
) was up-regulated in all five PyMT-induced tumor models. p85 has been shown previously to be a key signaling mediator of the Src tyrosine kinase family and the Shc-adaptor protein involved in PyMT transformation (24)
. Specifically, PyMT associates with one of the two SH2 domains of the p85 subunit of PI3K (25)
. The importance of the PI3K in mammary tumorigenesis has been established. Transgenic mice expressing a mutant PyMT unable to interact with PI3K develop widespread mammary hyperplasia with a high rate of apoptosis (26)
in contrast to the rapid induction of metastatic mammary tumors observed in mice expressing wild-type PyMT. Mutant PyMT-transgenic mice do eventually develop tumors but with a greatly extended latency. Activated PI3K can also result in the activation of Akt, which can stimulate a number of antiapopototic signaling molecules (10)
and can inhibit proapoptotic proteins such as Bad, Forkhead transcription factor, and caspase 9 (11
, 12)
.
One hallmark of oncogene transformation is the deregulation of cell cycle control mechanisms. Two important cell cycle regulatory molecules, CDC kinase subunit (Cks1) and Cdc25C phosphatase, were up-regulated in PyMT tumors in all five mouse backgrounds.
Cks proteins interact tightly with cyclin-CDK (cyclin-dependent kinases) complexes. Cks proteins are essential for CDK function and cell division in yeast (27) . Cks1 has been shown recently to be an essential cofactor in the ubiquitination of the CDK inhibitor p27. p27 acts as a powerful negative regulator to the G1-S transition by binding to cyclin E-Cdk2 and to cyclin A-Cdk2. Therefore, Cks1 appears to play an important role in regulating G1-S transition through the cell cycle, which may be important for cancer progression (28 , 29) . Aggressive human tumors are associated with low p27 levels, and the absence of p27 expression is a powerful prognostic marker of poor survival in patients with breast, esophageal, lung and colorectal carcinomas (30) . Given its role in promoting p27 degradation, Cks1 may function as a novel proto-oncogene. The discovery of elevated expression levels of Cks1 in all PyMT-initiated mammary tumors in our study suggests that the molecule might play a key role in the mammary epithelial cell transformation.
Both Cks1 and Cdc25C are up-regulated in PyMT-transgenic mammary tumors. The dual protein phosphatase Cdc25 family members regulate intracellular signaling through the mitogen-activated and stress-activated kinases and control the cell transition through G1-S and G2-M in cell circle checkpoints by affecting cyclin-dependent kinase activities. The Cdc25 phosphatases are overexpressed in many human tumors, and this increased expression is associated with a poor prognosis (31) . Cdc25A acts on Cdk2 and Cdk4 at the G1-S-phase and promotes cell entrance into S-phase (32) , whereas Cdc25B and Cdc25C act on cdc2 at the G2-M phase and play a role in the onset of mitosis. The biological significance of Cdc25A in the PyMT-transgenic mice has been studied recently. The epistatically interacting modifier loci in chromosomes 9 and 15 accelerate PyMT-induced mammary tumorigenesis in the I/LnJ background (33) . Further analysis revealed that FVB/NJ chromosome 15 was associated with tumor acceleration and was conditioned on the presence of the I/LnJ allele on chromosome 9 (loci that have been designated as Apmt1 and Apmt2; ref. 17 ). A combined genetics, genomics, and bioinformatics approach identified c-myc and Cdc25A as strong candidates for the tumor latency modifier Apmt1 and Apmt2, respectively (34) . Taken together, Cks1 plays an important role in controlling the G1-S transition, whereasCdc25C is a key positive regulator of mitosis.
Lastly, our results demonstrated that gelsolin is significantly reduced in all PyMT mammary tumors. Gelsolin has been considered to be a prognostic factor for a variety of human tumors (35) , especially for erbB2+ and EGFR+ breast cancer patients (36) . Gelsolin is an actin-binding protein that participates in and regulates dynamic changes in the actin cytoskeleton and is widely expressed by normal cells and down-regulated with transformation of multiple cell types, including breast epithelium (37 , 38) . Recently, gelsolin has been shown to have tumor-suppressive activity (37 , 39) .
Uniquely, we have addressed genetic elements that underlie strain differences in tumor behavior. Hierarchical clustering or multidimensional scaling using these sets of cancer genes was unable to separate the tumor samples into particular subgroups based on the specific tumor phenotypes such as tumor latency, tumor growth rate, and metastatic potential. We suspect that this is because of phenotypic characteristics used to describe the tumors cannot be mechanistically well separated. We defined a virulence index (Table 1)
that takes into account contributions from each tumor phenotype to a general virulence metric for the tumor. When we compared the array clusters to the virulence indices, we found a natural separation in that the least virulent (MOLF/Ei F1 and LP/J F1) constitutes a set that is distinct from the most virulent (FVB/NJ and I/LnJ F1) tumors, with NZB/B1NJ F1 showing a profile associated with an intermediate phenotype. Taken together, these results suggest that the standard division of tumor behavior into growth, latency, and metastatic potential may not reflect reality and that all three characteristics may be inextricably integrated in biology. We suggest that gene functions affecting metastatic behavior will be difficult to dissect from those influencing tumor growth and latency. More interestingly, however, we have identified genes that might influence tumor virulence modulated by host genetic factors.
Our replication of the human metastasis predictive expression pattern in the FVB/NJ versus NZB/B1NJ F1 comparison has several important implications. First, it suggests that many of the factors present in humans breast cancers that induce the metastatic expression signature may also be present in these inbred murine strains, further validating this model system to study metastasis in human breast cancers. Moreover, earlier models predicted that only a rare subpopulation of primary tumor cells would acquire the numerous genetic alterations required for metastatic spread (i.e., a progression of secondary mutations). However, our findings suggest that the primary tumor may already clearly express a signature set of genes that increases the chance for metastatic dissemination and that secondary genetic mutations may be necessary but not sufficient for this metastatic progression. Finally, other genetic models for metastases suggest that the different metastatic potential of tumors is directly caused by a variety of genetic mutations within the tumor. Our observations, however, raise the importance of non-tumor factors that modulate metastatic potential (17) because all of the tumors studied here were initiated by the same oncogenic event. In our study, the association between metastatic efficiency and the metastatic signature gene expression profiles is most likely because of genetic background effects rather than different combinations of oncogenic mutations.
Thus, intriguingly, there may be human subpopulations that are "susceptible" to metastatic dissemination. Furthermore, if the predictive metastatic signature profile is the result of differences in background allelic composition, it predicts that a metastatic "signature" gene expression profile may be obtainable from virtually any tissue of the body. In support of this possibility, we have demonstrated recently that several of the human 17 gene metastasis signature genes are differentially expressed in normal, transgene-negative mammary, and lung tissue from the high-metastatic FVB/NJ versus two low-metastatic F1 genotypes (NZB/B1NJ and DBA/2J9 ). These data suggest that easily accessible non-tumor tissue such as skin may be useful for identifying those patients at highest risk for latent disseminated disease. Such an approach could stratify patients who may benefit from aggressive neoadjuvant therapy to eliminate micrometastases or potential chemoprevention regimes to prevent development of clinically detectable disease.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Note: T. H. Qiu and G. V. R. Chandramouli contributed equally to this work. Supplemental data for this article can be found at Cancer Research Online (http://cancerres@aacrjournals.org).
Requests for reprints: Jeffrey E. Green, Laboratory of Cell Regulation and Carcinogenesis, Cancer Research Center, National Cancer Institute, Bethesda, Maryland 20892. E-mail: JEGreen{at}nih.gov
6 http://nciarray.nci.nih.gov. ![]()
7 URL: http://dc.nci.nih.gov/. ![]()
8 http://www-stat-class.stanford.edu/SAM/SAMServlet. ![]()
9 K. Hunter, unpublished results. ![]()
Received 1/24/04. Revised 4/27/04. Accepted 7/ 6/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. M. Fluck and B. S. Schaffhausen Lessons in Signaling and Tumorigenesis from Polyomavirus Middle T Antigen Microbiol. Mol. Biol. Rev., September 1, 2009; 73(3): 542 - 563. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. La Merrill, D. S. Baston, M. S. Denison, L. S. Birnbaum, D. Pomp, and D. W. Threadgill Mouse breast cancer model-dependent changes in metabolic syndrome-associated phenotypes caused by maternal dioxin exposure and dietary fat Am J Physiol Endocrinol Metab, January 1, 2009; 296(1): E203 - E210. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Barkan, H. Kleinman, J. L. Simmons, H. Asmussen, A. K. Kamaraju, M. J. Hoenorhoff, Z.-y. Liu, S. V. Costes, E. H. Cho, S. Lockett, et al. Inhibition of Metastatic Outgrowth from Single Dormant Tumor Cells by Targeting the Cytoskeleton Cancer Res., August 1, 2008; 68(15): 6241 - 6250. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Martin, K. J. Carter, S. R. Jean-Philippe, M. Chang, S. Mobashery, S. Thiolloy, C. C. Lynch, L. M. Matrisian, and B. Fingleton Effect of Ablation or Inhibition of Stromal Matrix Metalloproteinase-9 on Lung Metastasis in a Breast Cancer Model Is Dependent on Genetic Background Cancer Res., August 1, 2008; 68(15): 6251 - 6259. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. K. Deeb, A. M. Michalowska, C.-y. Yoon, S. M. Krummey, M. J. Hoenerhoff, C. Kavanaugh, M.-c. Li, F. J. Demayo, I. Linnoila, C.-x. Deng, et al. Identification of an Integrated SV40 T/t-Antigen Cancer Signature in Aggressive Human Breast, Prostate, and Lung Carcinomas with Poor Prognosis Cancer Res., September 1, 2007; 67(17): 8065 - 8080. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |