| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Biology, Pathobiology, and Genetics |
1 Department of Hematology-Oncology, Mayo Clinic, Scottsdale, Arizona; 2 Division of Hematology, Mayo Clinic, Rochester, Minnesota; 3 Translational Genomics Research Institute, Phoenix, Arizona; 4 Department of Computer Science and Engineering, Arizona State University, Tempe, Arizona; and 5 Millenium Pharmaceuticals, Inc., Cambridge, Massachusetts
Requests for reprints: Wee J. Chng, Division of Hematology-Oncology, Mayo Clinic, Johnson Research Building, 13400 East Shea Boulevard, Scottsdale, AZ 85260. Phone: 480-301-6363; Fax: 480-301-9162; E-mail: Chng.wee{at}mayo.edu.
| Abstract |
|---|
|
|
|---|
B signaling and antiapoptosis. These patients have better response to bortezomib as compared with patients within other clusters (70% versus 29%; P = 0.02). Furthermore, for a group of patients generally thought to have better prognosis, a cluster of patients with short survival (cluster 1; median survival, 27 months) could be identified. This analysis illustrates the heterogeneity within H-MM and the importance of defining specific cytogenetic prognostic factors. Furthermore, the signatures that defined these clusters may provide a basis for tailoring treatment to individual patients. [Cancer Res 2007;67(7):29829] | Introduction |
|---|
|
|
|---|
H-MM constitutes
50% to 60% of multiple myeloma (1, 4), yet little of its biology is known. Overall survival is favorable for patients with H-MM, but some H-MM patients clearly have a more aggressive variant of the disease (5, 6). In view of this, we hypothesize that H-MM is likely to be biologically heterogeneous, and understanding this heterogeneity may provide a better understanding of the underlying biology, identify possible molecular therapeutic targets, and allow better risk stratification of patients.
In this study, we use gene expression profiling (GEP) to characterize the molecular profile of H-MM, with a view of gaining some insights about its biology, and to study the heterogeneity within H-MM.
| Materials and Methods |
|---|
|
|
|---|
13) were assessed by fluorescence in situ hybridization (FISH) with concurrent immunofluorescent staining for clonotypic cytoplasmic immunoglobulin (7) using probes previously described (8). Plasma cell labeling index (PCLI) was assessed using a slide-based method as previously described (9). Where data were available, patients were also staged according to the International Staging System (10). A summary of clinical information of H-MM patients is presented in Supplementary Table S1. Survival data were available from 50 of the H-MM patients (37 newly diagnosed and 13 relapsed). Twenty-seven of the 50 patients have died. The median and minimum durations of follow-up from time of sampling for the surviving patients are 30 and 13 months, respectively. The study was approved by the Mayo Clinic Institutional Review Board. Patients entered into bortezomib trials (group 2). For validation of subtype signatures detected in group 1 patients, we used a cohort of 86 H-MM entered into bortezomib trials (11). For the assessment of response and progression-free survival to bortezomib, we analyzed a subset of 63 H-MM patients entered into the international phase 3 clinical trial (APEX) of bortezomib versus dexamethasone. In total, 29 patients received single-agent bortezomib whereas the other 34 patients were randomized to dexamethasone. These patients all had relapsed disease. The European Blood and Marrow Transplant group criteria were used to defined response and progression in these patients (12). In this analysis, we considered patients with minimal response or better as having a response. For response analysis, only patients treated with bortezomib were included.
University of Arkansas patients (group 3). As an additional validation cohort, 91 newly diagnosed patients with H-MM from the University of Arkansas with publicly available gene expression data were assessed (13).
Assignment of Hyperdiploid Cases
For group 1 patients, hyperdiploidy was determined by a FISH-based trisomy index that is highly specific for H-MM (14). For group 2 and 3 patients, the H-MM subset is identified using the TC (Translocation and Cyclin D) classification (13) based on the expression of only seven genes (FGFR3, MMSET, ITGB7, MAF, CCND3, CCND1, and CCND2). Only those patients assigned the D1 TC group (aberrant low level of CCND1 expression) that has been shown to unambiguously correspond to H-MM (13, 15) were selected for further analysis. (For details, refer to Supplementary methods).
Gene Expression Profiling
RNA isolation, purification, and microarray hybridization have previously been reported for group 1 (16), group 2 (Mulligan et al. Blood, in press), and group 3 (13). Gene expression analysis was done on CD138+ selected (groups 1 and 3) and negative-selection enriched (group 2) plasma cell RNA using the Affymetrix U133A (group 1), the U133A and B (group 2), and U95Av2 chips (group 3; Affymetrix, Santa Clara, CA). For the Mayo cohort, gene expression intensity values were generated using the Affymetrix MAS 5.0 software. For normalization, the data were scaled by global methods to a target value of 500, log transformed, median centered, and analyzed using GeneSpring 7 (Agilent Technologies, Palo Alto, CA). The gene expression data are deposited on the Gene Expression Omnibus6 (accession no. GSE 6477).
Gene Expression Analysis
To define the molecular signature of H-MM, we did a Student's t test with Benjamini and Hochberg multiple testing corrections between hyperdiploid and nonhyperdiploid cases in group 1. Differentially expressed genes with adjusted P < 0.01 were selected for further Gene Ontology analysis using the Gene Ontology mining tool available in GeneSpring and pathway/network analysis using a web-based software tool, MetaCore (GeneGo, Inc., St Joseph, MI). MetaCore contains an interactive, manually annotated database derived from literature publications on proteins and small molecules that allows for representation of biological functionality and integration of functional, molecular, or clinical information. Several algorithms to enable both the construction and analysis of gene networks are integrated as previously described (17). The output P values reflect scoring, prioritization, and statistical significance of networks according to relevance of input data.
To investigate possible molecular heterogeneity within H-MM, we did a recursive analysis using a combination of supervised and unsupervised methods that allows identification of patient clusters defined by nonoverlapping genes. Genes that varied across the individual H-MM samples were first identified by Welch ANOVA using variance computed by applying the Cross-Gene Error Model based on Deviation from 1 available within GeneSpring. This method overcomes the lack of replicates and variance associated with the individual samples and is similar in principle to variance filtering. The filtered gene list (see Supplementary Table S2 for gene list used for each recursive iteration) was then used to cluster group 1 H-MM samples using the hierarchical agglomerative algorithm (18). Pearson's correlation coefficient and centroid linkage were used as similarity and linkage methods, respectively. The most highly correlated gene cluster (at a distance of 0.7) that characterized the most tightly clustered patients (at a distance of 0.7) was selected and used to recluster the patients. Patients with 2-fold higher (than median) expression of 50% of genes within the selected gene cluster were then assigned to a cluster. These patients were removed from the overall cohort and the analysis repeated on the remaining patients until no obvious large clusters could be identified (Fig. 1 ). Subsequently, genes that distinguished these clusters, requiring that the distinguishing genes have 2-fold higher expression than in patients not in the cluster, and significantly different expression by t test with Benjamini and Hochberg multiple testing corrections, were generated. For validation, the same probe sets that distinguished these clusters were used to cluster group 2 samples. For group 3 samples, the best match probe sets on the Affymetrix U95Av2 chip compared with U133A chip were identified using Affymetrix mapping information7 and used for clustering.
|
Statistical Analysis
Nonparametric Mann-Whitney U test was used for comparison of continuous variables. Categorical variables were compared using the
2 test. The distribution for overall survival was estimated using the method of Kaplan and Meier. The log-rank test was used to test for differences in survival between groups. Overall survival was calculated from diagnosis to death for group 1 patients. For group 2 patients, progression-free survival (PFS) was calculated from day of commencing bortezomib until disease progression. P < 0.05 was considered significant.
| Results |
|---|
|
|
|---|
B (NF-
B), c-myc, and mitogen-activated protein kinase (MAPK)], raising the possibility that one consequence of this protein biosynthesis profile in H-MM is deregulation and overproduction of oncogenic proteins (Supplementary Fig. S1A).
|
Of the 48 overexpressed genes involved in protein biosynthesis, only 6 were not located on the common trisomic chromosomes (chromosomes 3, 5, 7, 9, 11, 15, 19, and 21), suggesting that chromosomal copy number amplification is the most common mechanism for up-regulation of these genes. The overexpression of ribosomal protein genes, which constitute a subset of the protein biosynthesis genes, is particularly striking. Twenty-nine of 74 ribosomal proteins genes whose probes were present in the U133A chip were overexpressed in H-MM at a false discovery rate of 1% by the Benjamini and Hochberg method. Sixty ribosomal protein genes were overexpressed if a false discovery rate of 5% was used. Interestingly, most of these overexpressed ribosomal protein genes are located on the commonly trisomic chromosomes. When the ratio of normalized expression of each ribosomal protein gene in H-MM compared with NH-MM was calculated, these were higher for those genes present on the commonly trisomic chromosomes with a ratio close to 1.5 (Supplementary Fig. S1B). These findings suggest the overexpression of ribosomal protein/protein biosynthesis genes in H-MM relative to NH-MM may be primarily due to a gene dosage effect.
Recursive analysis reveals four putative clusters of H-MM. Our recursive analysis identified four putative clusters of H-MM characterized by the expression of largely mutually exclusive gene sets (Fig. 2A ). When the same 73 probe sets were used to cluster the 63 relapsed H-MM patients entered into bortezomib trials, four clusters were again apparent (Fig. 2B). When the 42 best matching probe sets for the U95Av2 chip were used to cluster group 3 patients, the same four clusters can be identified (Fig. 2C), suggesting the validity and reproducibility of the four clusters. When the same gene list was used to cluster the 37 NH-MM in the Mayo data set or 13 asymptomatic hyperdiploid tumors (monoclonal gammopathy of undetermined significance and smoldering multiple myeloma), clusters expressing similar set of genes could not be clearly identified (Supplementary Fig. S2), suggesting that these clusters identified by the specific gene expression signatures seem to be unique to H-MM. Furthermore, a similar recursive analysis of t(11;14) samples identifies subsets defined by genes that are completely different from the genes that define the different H-MM clusters.8
|
B signaling and antiapoptosis (NFKB2, CFLAR, BIRC3, CKIP-1, TNFAIP3, and RRAS2). Cluster 4 was less well defined except for lack of high expression of the above genes with low expression of HGF in particular. This was not the result of differences in trisomies because there is significant overlap in the composite of trisomies between these two clusters of patients.
Gene set enrichment analysis was done to derived greater biological insights into the different clusters. Cluster 1 was enriched for genes involved in cell cycle and proliferation as well as genes with E2F binding motifs; cluster 2 was enriched for genes involved in receptor-ligand signaling [e.g., insulin-like growth factor I (IGF-I), vascular endothelial growth factor (VEGF), and platelet-derived growth factor (PDGF)]; cluster 3 was enriched for genes involved in activation of NF-
B pathways [e.g., tumor necrosis factor (TNF) receptor 2, TNF
, NF-
B, and inflammatory response pathways] and regulation of apoptosis (e.g., apoptosis and death pathways), as well as genes with NF-
B binding motif; and cluster 4 was enriched by genes involved in Wnt signaling pathways (e.g., activin receptorlike kinase pathway and Wnt targets; Supplementary Tables S5S8). Of interest, the enriched gene sets for the different clusters also seem to be mutually exclusive, further suggesting that these different GEP-defined clusters are likely to be biologically distinct.
GEP-identified H-MM subclasses have unique clinical associations. Cluster 1 tumors were more proliferative (median PCLI, 3.8; P < 0.05). Cluster 3 consisted of predominantly newly diagnosed cases with none of the patients having International Staging System stage III disease. There was no obvious association between any of the clusters with bone disease and
13 (Table 2
). To assess if these signatures represented transcriptional switches at progression, we compared the prevalence of these signatures in the pooled newly diagnosed (group 1 and group 3 patients) and relapsed cases (group 1 and group 2 patients). The prevalence of all four clusters was similar in newly diagnosed compared with relapsed cases (P = 0.16), suggesting that these profiles do not represent progression signatures.
|
B signature, we hypothesized that patients belonging to this cluster should have better response to bortezomib. As predicted, cluster 3 patients had better response to bortezomib compared with patients belonging to other clusters (70% versus 29%; P = 0.02). On the other hand, there was no statistically significant difference in response to dexamethasone between cluster 3 patients and the others (36% versus 50%; P = 0.24). The PFS of cluster 3 patients treated with bortezomib was significantly longer (median PFS, 253 versus 90 days; log-rank P = 0.0002), whereas if they were treated with dexamethasone, the survival is not significantly different (median PFS, 94 versus 168 days; log-rank P = 0.17). When considering only cluster 3 patients, the difference in PFS between bortezomib- and dexamethasone-treated patients was highly significant (median PFS, 253 versus 94 days; log-rank P = 0.0002). Furthermore, the PFS of cluster 3 H-MM patients was almost twice as long as the PFS of all other patients, including the NH-MM patients (median PFS, 253 versus 127 days; log-rank P = 0.13), although this did not reach statistical significance probably due to the relatively small number of cluster 3 patients (Fig. 3
).
|
|
13 and chromosome 1q amplification and lower prevalence of trisomy 11 have poorer survival (6). Among our GEP-defined clusters, the prevalence of
13 and trisomy 11 were similar. However, 1q amplification (assessed by gene expressionderived pseudo-CGH; see Supplementary methods) was significantly more common in cluster 1 (cluster with worst prognosis). It was therefore possible that the poor prognosis H-MM group identified by Carrasco et al. is mainly mediated through association with 1q amplification (and not
13 or lack of trisomy 11) and may correspond to an underlying cluster 1 molecular signature. The lack of association between
13 and the different H-MM clusters with different survival in our data set is consistent with recent findings that
13 is not associated with poorer outcome in H-MM (20). | Discussion |
|---|
|
|
|---|
4 years with no plateau in the survival curve (20). Recent studies have suggested possible clinical and molecular heterogeneity within H-MM (6, 15). However, there has been no comprehensive study that characterizes H-MM in terms of its molecular phenotype and heterogeneity. In this study, we use global GEP to first characterize the molecular signature of H-MM and then to dissect the heterogeneity within H-MM. We found that a protein biosynthesis signature that is primarily driven by copy number changes characterizes H-MM. Furthermore, four biologically relevant subtypes with prognostic implications were identified. More importantly, the results of our study provide insights into the biology of H-MM and the deregulated pathways underpinning these subtypes that can be used to direct therapy. The protein biosynthesis signature in H-MM is striking, comprising a third of genes differentially expressed between H-MM and NH-MM and constituting most of the genes overexpressed in H-MM relative to NH-MM. Importantly, this signature has now been reproduced in two other studies by different groups using different analysis methods. In a recently published study, in an attempt to find gene signatures that define the different TC groups, a similar signature was found in nine patients with low (but abnormal) CCND1 expression and lacking IgH translocations, which has been shown to correspond to H-MM patients. This signature was reproduced by meta-analysis of two other data sets (21). In another meeting report, a German group, in an attempt to find genes that discriminate between multiple myeloma tumors with and without chromosome 13 deletion, found that protein biosynthesis genes predominate in tumors lacking chromosome 13 deletion, which are mainly H-MM (22). In addition, there is significant overlap between our H-MM signature and the top 50 genes overexpressed and underexpressed in the H-MM cluster in a recent study from the University of Arkansas for Medical Sciences (22 of 50 overexpressed and 19 of 50 underexpressed genes; ref. 15).
Overexpression of protein biosynthesis genes, especially ribosome protein genes, may be the result of rapid cellular proliferation and an indirect consequence of cellular transformation. This is unlikely to be the case in H-MM because these tumors are not more proliferative than NH-MM. Instead, overexpression of these genes seemed to be primarily driven by gene copy number. It has been proposed that cells that contain more ribosomes and translational initiation factors would have an increased translation rate and promote transformation by actually promoting the translation of proteins involved in cellular growth control (e.g., proto-oncogenes) or control of apoptosis (23). Consistent with this view is our network analysis showing that many of the overexpressed genes in H-MM involved in ribosome/protein biosynthesis represented end points of c-myc, NF-
B, and MAPK signaling. Proteomic analysis will provide more insights into the functional consequence of this protein biosynthesis gene expression signature, which may be important for maintenance of the tumor phenotype and survival. As a result, this may represent an avenue for therapeutic intervention of this subtype of multiple myeloma. In our analysis, several components of the mammalian target of rapamycin/P70S6 kinase pathways are deregulated in H-MM compared with NH-MM (4EBP1, EIF4A, S6K1, RPS6, and EEF2), suggesting that this pathway may be exploited therapeutically with mammalian target of rapamycin inhibitors (24).
Within H-MM, we identified four clusters that are reproducible in two independent cohorts of patients with GEP done on different generation of Affymetrix chips. In addition, these clusters are highly reproducible despite using different patient populations (newly diagnosed and relapsed) and different methods of defining hyperdiploidy (although these methods are complementary). Furthermore, these clusters are not related to differences in genetic abnormalities or composite of trisomies or to changes relating to progression.
The signature for each cluster provides insights into its biology. Cluster 1 is defined by overexpression of various CTA genes and mitosis/proliferation-related genes. Consistent with the latter, the tumors seem to be more proliferative when assessed by PCLI. Epigenetic mechanisms (hypomethylation) are implicated in the expression of CTAs in tumors (25). It is therefore possible that the expression of these genes is a surrogate for a deregulated epigenetic mechanism that underlies cluster 1 tumors. Of interest, hypomethylation and decondensation of centromeric heterochromatin have been implicated in the development of 1q amplification in multiple myeloma (26), and 1q amplification is very common in cluster 1 tumors. Indeed, epigenetic silencing of cell cycle inhibitors (2729) or increased copy number of critical cell cycle gene on chromosome 1q such as CKS1B (30) may be responsible for the proliferative phenotype. Cluster 2 is defined by the overexpression of the HGF and IL6 genes. Deregulated HGF signaling induced migration, survival, growth, and invasion of cancer cells via multiple downstream pathways (31). Stimulation of multiple myeloma cells with HGF led to the activation of the Ras/MAPK and phosphatidylinositol 3-kinase/Akt pathways (32). HGF also induces angiogenesis via VEGF. IL-6 mediates similar effects on cellular phenotype and signals through similar pathways (33), all of which are critical in multiple myeloma. Cluster 3 is defined by the overexpression of genes involved in NF-
B signaling, especially genes involved in NF-
Binduced antiapoptosis. NF-
B signaling is constitutively active in multiple myeloma and is important in mediating many downstream effects of the multiple myeloma-bone marrow interaction (33). One of the important consequences is the up-regulation of antiapoptotic molecules resulting in drug resistance and survival of myeloma cells. Therefore, these signatures may represent divergent pathways in H-MM.
More importantly, these disparate and distinct deregulated pathways provide a useful means for targeting subset of patients with specific therapy. A proof of this principle was provided by the analysis of response and progression of bortezomib-treated patients (group 2) belonging to the different clusters. The NF-
B pathway is one of the key therapeutic targets of bortezomib (34), and, as predicted, cluster 3 patients have better response and PFS when treated with bortezomib compared with dexamethasone, whereas this difference is not observed for the other clusters. Although the number of patients included in this analysis is small, the results are rather striking and are probably real. Our results will have to be confirmed in larger cohorts of H-MM patients in the future.
Many CTAs contain epitopes recognized by CTLs and are potential targets for active immunotherapy (24). Several of these antigens have been identified as potential targets for immunotherapy in multiple myeloma (35, 36). A possible treatment strategy for cluster 1 patients may therefore involve immunotherapy to eradicate residual clone following standard induction with stem cell transplantation. For cluster 2 patients, critical activated pathways (Ras/MAPK and phosphatidylinositol 3-kinase/Akt) could be targeted (37, 38).
The clinical outcome of these clusters is different, adding credence to the biological validity and distinction of these subtypes. Cluster 1 patients have the worst prognosis. Expression of CTA has been associated with poor prognosis in other cancer, and the more proliferative nature of these tumors may contribute to the poor prognosis. However, when all patients with proliferative tumors (PCLI
1) are considered, cluster 1 patients still have significantly worse survival (data not shown). Importantly, although H-MM patients are generally thought to have good prognosis, a cohort of patients (cluster 1) with survival of only
2 years could be identified.
Recently, Zhan et al. (15) have published an unsupervised molecular classification of myeloma. They identified a hyperdiploid cluster that contained most of the hyperdiploid cases and a proliferation cluster that also contained some hyperdiploid cases. This proliferation cluster (also express some CTAs) is associated with 1q amplification and poor prognosis and, hence, is similar to the cluster 1 identified in the current study. As their study aimed to derive a molecular classification of all multiple myeloma, they did not identify additional subtypes of H-MM. In the current study, we specifically wanted to molecularly dissect H-MM and have identified additional clusters including one with NF-
B signature that respond very well to bortezomib.
In summary, H-MM is characterized by a protein biosynthesis signature. In addition, four subclasses of H-MM with distinct clinical and biological associations could be identified, including a subgroup that have very bad prognosis and a subgroup that response particularly well to bortezomib. The gene signature associated with each subclass provides a molecular basis for the different clinical behavior and further insights into the deregulated pathways in H-MM that could potentially be exploited therapeutically.
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank the investigators from the University of Arkansas for Medical Sciences, Little Rock, AR, as well as investigators involved in the bortezomib pharmacogenomics studies and Millenium Pharmaceuticals for making available the various gene expression analysis data sets for analysis.
| Footnotes |
|---|
R. Fonseca is a Clinical Investigator of the Damon Runyon Cancer Research Fund.
6 http://www.ncbi.nlm.nih.gov/geo/ ![]()
7 http://www.affymetrix.com/support/technical/comparison_spreadsheets.affx ![]()
Received 11/ 3/06. Revised 1/16/07. Accepted 1/18/07.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |