| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Regular Articles |
1 Immunogenetics Section, Department of Transfusion Medicine, Clinical Center, National Institutes of Health, Bethesda, Maryland; 2 III Department of Internal Medicine, The Johannes Gutenberg University, Mainz, Germany; and 3 Martin Luther University, Institute of Medical Immunology, Halle, Germany
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
In this study, we entertained an analysis of the role that ontogeny plays in cancer using as a model primary renal cell cancer (RCCs) tissues paired with normal renal tissue subjected to identical surgical manipulation and experimental preparation. The transcriptional analysis of the paired specimens was subsequently compared with archival frozen samples of melanoma metastases, representing a putative extreme of diversity with regard to ontogenesis and neoplastic progression, and with various primary epithelial cancers to frame the boundaries of similarities and discrepancies in the transcriptional program.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Similarly, total RNA from peripheral blood mononuclear cells pooled from six normal donors was extracted and amplified to serve as constant reference (7 , 11, 12, 13) . Test and reference RNAs were labeled with Cy5 (red) and Cy3 (green) and cohybridized to a costum-made17.5K cDNA microarray.4 Microarrays were printed at the Immunogenetics Section, Department of Transfusion Medicine, Clinical Center, National Institutes of Health (Bethesda, MD) with a configuration of 32 x 24 x 23 and contained 17,500 elements. Clones used for printing included a combination of the Research Genetics RG_HsKG_031901 8k clone set and 9,000 clones selected from the RG_Hs_seq_ver_070700 40k clone set. The 17,500 spots included 12,072 uniquely named genes, 875 duplicated genes, and about 4,000 expression sequence tags.
Data Analysis
Quality Validation.
All statistical analyses were performed using the log2-based ratios normalizing the medial log2 ratio value across the array equal to zero. Validation and reproducibility were performed using our internal reference concordance system, based on the expectation that results obtained through the hybridization of the same test and reference material in different experiments should perfectly collimate. The level of concordance was measured by periodically rehybridizing the same arbitrarily selected test sample (A375 melanoma cell line) with the reference sample. High concordance in gene expression predicts that ratios in different experimental conditions are highly reproducible. With this goal, we analyzed seven forward and seven reciprocally labeled replicate array experiments that were hybridized periodically every other 25 cDNA array slides within each printing. SDs across those 14 arrays were analyzed after labeling swap, and discordant genes due to consistent labeling bias were excluded. This analysis demonstrated a >95% concordance level. Nonconcordant genes due to reproducible or random biases were excluded from subsequent analysis (15)
.
Unsupervised Analysis.
Principal component analysis (PCA) was performed with the Partek Pro software program (Partek Inc., St. Charles, MO). Unsupervised clustering was performed according to the Pearson correlation method of Eisen et al. (16)
and visualized with Treeview software (Stanford University, Stanford, CA). Genomic portraits were depicted according to the central method for display, using a normalization factor as suggested by Ross et al. (17)
. Details about different unsupervised tests are discussed in the respective results section.
Supervised Analysis.
Identification of class-specific genes was performed using a paired or an unpaired two-tailed Students t test, as appropriate. The same analyses were performed using unpaired Wilcoxons nonparametric assessment and provided the same conclusions (data not shown). Identification of relatively consistent gene expression in various data sets was performed by comparing the median log2 ratio values over mean log2 ratio values for each subset of samples among the genes found to be significantly overexpressed in cancer specimens compared with normal kidney samples. For each subset analyzed, consistently expressed genes were considered those that had a median/mean ratio
1 because at least 50% of the samples exhibited an average expression value equal to or above the average expression value for that subset. Sarcoma samples were excluded from the analysis, and their statistics are presented separately. Three subsets of genes were explored. The first subset included genes preferentially expressed by RCCs compared with other tumors (
> 2 Oth, where
= median log2 ratio for RCCs median log2 ratio for other tumors, and Oth = median log2 ratio for other tumors). The second subset included genes concordantly expressed by RCCs and other tumors (
2 Oth, where
= median log2 ratio for RCCs median log2 ratio for other tumors, and Oth = median log2 ratio for other tumors). The third subset included genes preferentially expressed by tumors other than RCCs (
RCCs, where
= median log2 ratio for RCCs median log2 ratio for other tumors, and RCCs = median log2 ratio for RCCs).
| RESULTS |
|---|
|
|
|---|
|
All other primary cancers of nonrenal lineage segregated from the transcriptional profile of renal samples (Fig. 1B)
. Therefore, primary RCCs clustered closer to normal kidney tissue than to other epithelial nonurologic cancers, underlining the predominance of ontogeny on their global expression pattern. As predictable, melanoma metastases segregated from all other specimens either because of the dedifferentiation that accompanies the metastatic process or because of their neuroectodermal origin. Melanoma metastases were combined with the data set only in this preliminary analysis to portray a "biological extreme" that could help frame the transcriptional profile of the primary cancers studied here between the highly differentiated profile of normal kidney tissue and the highly undifferentiated profile of metastatic tumors. Melanoma metastases were not further analyzed because this study was aimed at analysis of the differentiation pattern of primary RCCs compared with that of other primary epithelial nonurologic cancers.
To separate ontogeny from oncogenesis, a supervised analysis was performed to identify genes among the complete data set that were differentially expressed between RCCs included in cluster a and normal kidney. Genes were identified at an arbitrarily set significance threshold of P2 < 0.001 (two-tailed, unpaired Students t test). RCCs samples included in groups b and c were excluded with the assumption that a partial overlap of expression of RCCs-specific and renal tissue-specific genes was at the basis of their clustering in proximity to normal renal samples. This analysis identified 350 clones responsible for the segregation of RCCs and normal kidney specimens. Obviously, other genes related to ontogenesis still expressed by RCCs in group a could not be identified by this strategy. Therefore, the information portrays a partial picture still likely to be sufficient to provide a sense of the effect of lineage-specific gene expression. A high percentage of the genes appeared to be up-regulated in clusters b and c (green vertical bars in Fig. 1B
) that included mostly normal kidney samples. Thus, genes progressively down-regulated during neoplastic dedifferentiation were most likely related to ontogeny. Another set of genes was up-regulated predominantly in RCCs lesions and coordinately expressed by other epithelial cancers of nonrenal lineage, therefore representing genes associated with oncogenesis (red vertical bars in Fig. 1C
). Unsupervised reclustering of the experimental samples based on the 350 clones enhanced the separation between RCCs and normal kidney approximating RCCs to other epithelial cancers, emphasizing the predominance of shared patterns of gene expression when genes responsible for lineage descent are removed (Fig. 2A)
.
|
An analysis using the same significance cutoff was performed on 10 paired RCCs lesions and normal kidney samples derived from the same surgical specimens. This paired analysis identified 170 differentially expressed genes (the smaller number is due to the smaller sample size), mostly overlapping those identified with the previous analyses, and included a majority of genes overexpressed in normal renal tissue (data not shown).
Influence of the General Process of Oncogenesis on the Transcriptional Program of RCCs.
To identify genes associated with oncogenesis independently of lineage, we looked for genes expressed differentially between all primary tumors (RCCs and non-RCCs) and normal kidney with the same arbitrary significance cutoff P2 of <0.001 (two-tailed Students t test). This process identified 1,347 genes. Some of the differences identified could be attributed to a predominant effect of genes differentially expressed between nonrenal tumors and normal kidney tissue still due to lineage diversity. Therefore, we separated the recovered 1,347 genes into those that were differentially expressed between RCCs and other cancers (possibly because RCCs retained lineage-specific expression patterns) and those that were not differentially expressed between the same two groups. This process identified 276 genes that were differentially expressed at the <0.001 level between RCCs and other primary cancers, whereas the remainder were above that threshold. Further analysis of these genes demonstrated that they were related to renal lineage specificity, and their inclusion in the data set was due to the strong differences between epithelial cancers and normal kidneys. However, few genes were truly specific for renal oncogenesis, and those were rescued with a separate analysis discussed later. Because the range of significance between <0.001 and 0.01 was felt to be ambiguous for the separation of genes belonging to either group, these genes were also excluded from the analysis. Thus, another 233 genes were excluded from the data set, and the remaining 850 genes were left as putative oncogenesis-specific genes. No information about normal tissues from the other tumors was available. Therefore, some of the differences found might still be due to the presence of lineage-specific genes related to ontogeny of other cancers; however, this is unlikely because these cancers originated from different epithelial tissues.
Among the 850 clones differentially up-regulated in cancer independently of histologic background, 176 clones included expression tag sequences or other genes with unknown function. Therefore, we eliminated them from further analysis and focused our attention on the remaining 674 clones with functional annotations (Fig. 2B)
. As expected, unsupervised reshuffling of tissue samples based on these genes (see the dendrogram in Fig. 2C
) segregated all normal renal samples into a separate cluster that also included one RCCs (R2838 TU; red bar). Primary epithelial tumors segregated into two main subclusters, one exclusively including seven RCCs (blue bar) and a second one that included all other tumors including sarcomas and the remaining six RCCs lesions (maroon bar). Therefore, even the elimination of genes differentially expressed between RCCs and other epithelial cancers did not completely abrogate the lineage specificity, at least for some RCCs. The first cluster exclusive for RCCs included all of the RCCs samples from groups b and c (R2834 TU, R2839 TU, and R2828 TU) except for R2838 TU that clustered together with normal renal tissue. The other cluster included exclusively RCCs that belonged to the least differentiated cluster a and R2833 TU that also clustered in the previous analyses with tumors other than RCCs, emphasizing the lower level of differentiation of these samples.
Of the 674 genes specific for oncogenesis, 329 were down-regulated in cancer tissues compared with normal kidney, and most of them represented lineage-specific genes associated with renal function. In the end, 345 genes were identified that were specifically up-regulated in cancerous tissues compared with normal renal tissue.
Interestingly, we noted substantial heterogeneity in the expression of these genes in different cancers independent of lineage. Thus, we examined which of the oncogenesis-related genes were consistently expressed in all cancers or more frequently expressed in RCCs lesions or cancers of different histology. This was achieved by identifying within each group the genes with the highest median ratio. Due to their ambiguous biological behavior, sarcomas were excluded from the analysis. Those genes with a log2 ratio between median and mean value
1 within each category were consistently expressed. This parameter identified genes that were expressed above the threshold level in at least 50% of cancers within a given category.
This exercise identified 132 genes whose median expression was similar or higher than their mean level of expression. Of those, 41 were categorized as expressed in RCCs samples, 43 were concordantly expressed by most cancers, and 48 were preferentially expressed by nonurologic epithelial tumors (Supplementary Tables 13, respectively).
Influence of RCCs-specific Oncogenesis on the Transcriptional Program of RCCs.
A separate analysis was performed to identify genes uniquely expressed by RCCs and unrelated to their renal heritage. This separate analysis was performed to rescue genes possibly missed by the previous analysis due to their down-regulation in RCCs or due to the dilution of their significance level by combining in the analysis the total number of primary cancers. The complete data set was reanalyzed by comparing 14 RCCs samples against the other 12 epithelial cancers. First, this was performed by studying all RCCs cancers independently of the relationship among themselves according to the unsupervised clustering shown in Fig. 1B
. The comparison between the two sets of cancers was performed based on a two-tailed unpaired Students t test. This analysis identified 1,114 clones differentially expressed by the two sets of cancers. Ontogenesis-related genes were subsequently excluded by subtracting genes coexpressed between RCCs and normal renal tissue (P2
0.001). The large majority of genes dissipated with this test, suggesting that differences between RCCs and other cancers are predominantly due to lineage specificity. However, 43 clones remained that were highly specific for RCCs and differentially expressed between RCCs and other cancer as well as normal renal tissue.
Genes Commonly Expressed by Cluster a RCCs and Sarcomas.
To explain the close association between RCCs in cluster a and sarcomas, genes differentially expressed by these tissues were compared with those of other tumors excluding melanoma metastases and other RCCs that did not cluster in cluster a. This analysis identified 597 genes differentially expressed at the 0.001 level (two-tailed t test). The large majority of these genes were coordinately expressed by normal renal tissue, suggesting that the proximity of sarcomas to RCCs is due to a particular expression profile in common with normal tissues (compared with other cancer) or specifically to normal renal tissue. For identification of genes specifically coordinately expressed by RCCs in cluster a and sarcomas, the expression profile of both tissue types was compared with normal renal tissue at the P2 value of 0.001 (two-tailed t test). In this fashion, 48 clones were identified, of which only a few were up-regulated (Fig. 3)
. The genes almost totally overlapped the RCCs-specific genes identified with the previous analyses.
|
|
|
| DISCUSSION |
|---|
|
|
|---|
The first finding was that primary RCCs segregate into at least two molecular subgroups. Such subgroups are portrayed in Fig. 1A
by cluster a containing only RCCs samples and the closely related clusters b and c with a mixture of RCCs and normal renal specimens including the chromophobe RCCs (R2828 TU) consistent with the relatively high level of differentiation of this tumor (4)
. Interestingly, both normal kidney and tumor tissues of the samples 2838 and 2839 clustered into group b, whereas other normal and tumor samples are quite distant in Fig. 1B
. We truly have no exact explanation for the specific pairing of these two tumor/normal tissue samples, although we and others have seen it before in other experimental situations. The best explanation we have is that in conditions of relatively good differentiation, primary tumors tend to align genetically with the normal tissue of origin because of the individuals genetic background. This hypothesis has never been properly tested, and a much larger population would be necessary to demonstrate this point. We are, however, quite confident that the similarities noted are not artifactual, based on RNA quality analysis and pathological testing (the samples from which RNA was extracted were composed almost exclusively of tumor or normal cells, respectively, in a proportion similar to other samples). The genes responsible for the differences noted between the RCCs of cluster a and those of the other two clusters belonged predominantly to the renal lineage and were mostly coexpressed by the latter groups of RCCs and normal renal tissues, suggesting that global transcript separated RCCs based on their level of differentiation. Conversely, genes overexpressed by RCCs of group a represented a minority and were coordinately expressed by the other RCCs and other primary epithelial cancers. Thus, one could conclude that if no information was available about the transcriptional profile of paired normal tissues, it would have been difficult to explain the separation of the two major classes of RCCs (cluster a compared with the close subcluster b or c), and it might have been tempting to attribute such differences to separate taxonomies of disease as described previously in the context of metastatic melanoma (2)
before serially analyzing identical lesions during disease progression (7)
. In conclusion, the identification of molecular subcategories of disease may be influenced by the level of dedifferentiation of a given cancer rather than portray a distinct disease taxonomy.
The second observation suggests that removal of lineage-specific genes segregated primary RCCs together with other nonurological cancers because at least some mechanisms of oncogenesis are shared by most cancers. Thus, this study suggests that most genes responsible for segregating cancers of various histology into molecular subclasses are related to the ontogeny of the individual cancers, whereas only a small proportion is responsible for the subclassification of cancers into different molecular entities when ontogeny is removed.
The third important observation demonstrated that although the exclusion of lineage-specific genes closely approximates RCCs with other nonurologic epithelial cancers, several genes (signatures) could be identified as specific for RCCs and, surprisingly, soft-tissue sarcomas. Several of the genes have been described previously by suppression subtractive hybridization (20)
and functional genomics (19)
to be strongly associated with RCCs oncogenesis, such as, for instance, lysyl oxidase and ceruloplasmin. Ceruloplasmin is secreted by human clear cell carcinoma cells in patients (21)
and relevant experimental models (22)
. Caveolin-1 overexpression has been associated with poor prognosis in RCCs (23)
. Enolase 2 (neuronal
-enolase) is a specific serum marker for RCCs (24)
, and experimental models have shown that an isozyme switch from
-enolase to
-enolase occurs during carcinogenesis (25)
, an observation that we confirmed in this study comparing normal renal tissue with RCCs. Similarly, complement component 1 was characterized a long time ago as a risk factor for the development of RCCs (26)
. Collagen type V has been directly associated with the VHL/hypoxia pathway, but its role in oncogenesis has not been determined (27)
, whereas collagen type VI is commonly expressed in RCCs and minimally expressed by normal kidney after development (28)
. Interestingly, both collagen type V and VI were preferentially expressed in sarcomas, although they were coexpressed by RCCs (27)
. Annexin IV has been recently identified as selectively expressed by RCCs using combined two-dimensional gel electrophoresis (29)
. CD68 has been associated with extensive infiltration of RCCs tissue by macrophages (30)
and, together with the expression of major histocompatibility complex molecules, has been previously demonstrated to be strongly increased in RCCs tissues compared with normal renal tissue and benign renal tumors (31)
. In addition, insulin-like growth factor-binding protein 3 coexpressed specifically by RCCs in group a and sarcomas (Fig. 3)
has been described previously by gene expression profiling (8)
.
Recent studies suggested that GST A is expressed in the proximal convoluted tubules of normal renal parenchyma and that expression of GST A is retained mostly by clear cell RCCs (8) . In fact, GST A 2 and 3 were coordinately expressed in most clear cell RCCs (8 of 13) but were not expressed by the chromophobe (R2828 TU) and chromophilic (R2858) RCCs as predicted by others (8) . GST was also expressed by normal renal tissue but not by sarcomas, in line with the anamnestic expression of these genes.
CA IX is an independent predictor of survival in RCCs (18) , whereas CA II is a reliable marker specific for chromophobe RCCs and erratically expressed by clear cell RCCs (30%) (8) . CA II is also expressed in a small proportion of distal convoluted tubules of normal renal parenchyma. Indeed, CA II transcript could be found in normal kidney tissue, suggesting that expression of CA II transcript is related to the ontogenesis of RCCs. Although the highest level of CA II expression was noted in chromophobe RCCs, several clear cell RCCs expressed this gene, suggesting that, at least at the RNA level, there is overlap between the two types of RCCs (8) . Interestingly, CA IX expression was specific for RCCs compared with normal renal tissue, confirming that this gene may be a better target for drug or immune therapy (32) . Interestingly, the expression of CA IX was shared by other primary epithelial cancers but not melanoma metastases, suggesting that the expression of this gene is at least partly associated with oncogenesis and that it may not be solely responsible for the immune responsiveness of these diseases to immune therapy.
Other genes such as phosphofructokinase have been associated with childhood nephroblastomas, but not with adult RCCs (33) . In addition, scavenger receptors have been identified on RCCs cell lines in the past, although no further work has been performed on this subject (34) . Surprisingly, tapasin appeared to be overexpressed in RCCs, which is in contrast with immunohistochemical findings using paraffin-embedded tumor material (35) . It is possible that the increased mRNA tapasin levels are not associated with enhanced protein expression or that the mRNA is produced by normal cells infiltrating RCCs lesions.
Other genes that have been identified in clear cell RCCs were found in this study to also be coexpressed by primary tumors of other histology such as pro-collagen-lysine, 2-oxoglutarate 5-dioxygenase 2 (8) .
Others have compared the transcriptional profile of clear cell RCCs with clinical staging and outcome (19)
. Surprisingly, transcriptional profiling did not differentiate between stage I/II and stage III/IV lesions. On the contrary, and most relevantly, it was found that genetic profiling could segregate clear cell RCCs into two categories, one with a very poor 5-year survival, and the other with a good prognosis. This a very interesting finding suggesting that tumor biology is better represented at the transcriptional level. We also found no correlation between clinical staging and transcriptional profiling; however, analysis of genes whose expression is associated with good prognosis in RCCs (Fig. 5)
suggested that such genes are preferentially expressed by cancers with a genetic profile closer to normal kidney.
In summary, this study is largely confirmatory of other studies in which RCCs-associated markers have been described. However, the three-way comparison between RCCs, normal renal tissue, and assorted nonurologic primary epithelial cancers allowed the differentiation of genes expressed by RCCs as a remnant of their origin from normal kidney from those directly related to the general process of oncogenesis. This differentiation is important because different categories may have diverse biological, diagnostic, and therapeutic value. Genes responsible for lineage specificity may represent poor molecular targets for immune or drug therapy. Most genes associated with oncogenesis are shared with other cancers and may represent better therapeutic targets. Finally, a subset of gene associated with oncogenesis is lineage specific and may provide information regarding the specific biological behavior of individual cancers and facilitate their diagnostic classification. More generally, this study suggests that in large proportion, the molecular portraits of primary tumors are dependent on their ontogeny, and therefore caution should be applied when defining distinct molecularly defined subcategories of cancer.
| FOOTNOTES |
|---|
Note: Supplementary data for this article can be found at Cancer Research Online (http://cancerres.aacrjournals.org).
Requests for reprints: Francesco M. Marincola, Department of Transfusion Medicine, Clinical Center, Building 10, Room 1C-711, 10 Center Drive MSC 1502, Bethesda, MD 20892-1502. Phone: 301-496-9702; Fax: 301-594-1981; E-mail: FMarincola{at}mail.ml.nih.gov
4 http://nciarray.nci.nih.gov/gal_files/index.shtml. ![]()
Received 5/ 6/04. Revised 8/ 4/04. Accepted 8/13/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. F. Basil, Y. Zhao, K. Zavaglia, P. Jin, M. C. Panelli, S. Voiculescu, S. Mandruzzato, H. M. Lee, B. Seliger, R. S. Freedman, et al. Common cancer biomarkers. Cancer Res., March 15, 2006; 66(6): 2953 - 2961. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Panelli, E. Wang, and F. M. Marincola The Pathway to Biomarker Discovery: Carbonic Anhydrase IX and the Prediction of Immune Responsiveness Clin. Cancer Res., May 15, 2005; 11(10): 3601 - 3603. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |