| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Molecular Biology, Pathobiology, and Genetics |
1 Veridex, LLC, a Johnson & Johnson company, San Diego, California; Departments of 2 Surgery, 3 Biostatistics, and 4 Pathology, and 5 Comprehensive Cancer Center, University of Michigan, Ann Arbor, Michigan; and 6 Department of Pathology, University of Virginia, Charlottesville, Virginia
Requests for reprints: Mitch Raponi, Veridex LLC, a Johnson & Johnson company, Molecular Diagnostics, 3210 Merryfriend Row, San Diego, CA 92121. Phone: 858-320-3319; E-mail: mraponi1{at}vrxus.jnj.com.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
13% of all cancer diagnoses. Nonsmall-cell lung cancers (NSCLC) compose the majority (
80%) of bronchogenic carcinoma with a lesser fraction being small-cell lung carcinomas (SCLC). The three main subtypes of NSCLC are adenocarcinoma (40%), squamous cell carcinoma (SCC; 40%), and large-cell cancer (20%). Adenocarcinoma has replaced SCC as the most frequent histologic subtype over the last 25 years. The treatment options for most patients with NSCLC are not different between either of these histologic subtypes. Currently, many early-stage lung cancer patients, such as those with stage IA disease, receive only surgery whereas other patients generally receive adjuvant chemotherapy and, possibly, radiation therapy following surgery (Lung Cancer Treatment Guidelines for Patients, National Comprehensive Cancer Network, May 2005 version 2). The overall 10-year survival rate of patients with NSCLC is a dismal 8% to 10%. Approximately 25% to 30% of patients with NSCLC have stage I disease and, of these, 35% to 50% will relapse within 5 years after surgical treatment (1, 2). It is currently not possible to identify those patients that are of high risk of relapse. The ability to identify high-risk patients among the stage I disease group will allow for the consideration of additional therapeutic intervention. This potentially could lead to an improved survival in these patients. Indeed, recent clinical trials have shown that adjuvant therapy following resection of lung tumors can lead to improved survival in early-stage NSCLC (3, 4). Specifically, Kato et al. (3) showed that adjuvant chemotherapy with uracil-tegafur improved survival among patients with completely resected pathologic stage I lung adenocarcinoma, particularly with T2 disease. It was also found that early-stage patients who received vinorelbine and cisplatin after surgery had an overall survival of 94 months compared with 73 months in those patients who did not receive the adjuvant therapy (4).
Microarray gene expression profiling has recently been used to define prognostic signatures in patients with lung adenocarcinomas (59); however, whereas several studies have investigated diagnostic profiles in lung SCC, there have been no similar large studies that have investigated gene expression profiles of prognosis in this subtype of NSCLC (10). Here, we report the profiling of a large set of 130 lung SCC samples using Affymetrix U133A GeneChips. Hierarchical clustering and Cox modeling identified genes that correlate with patient prognosis. Unsupervised clustering has also identified a SCC subtype showing an aggressive clinical profile. In addition, we have combined this SCC signature with our previous prognostic signature for lung adencarcinoma to generate a classifier to capture the majority of NSCLC (5). The combined signature generated from both SCC and adenocarcinoma data sets was validated in an independent set of 72 NSCLC samples (11). These signatures and further understanding of their associated biological information could be used to better manage patient treatment following initial surgery.
| Materials and Methods |
|---|
|
|
|---|
Patient population. One hundred-thirty fresh frozen, surgically resected SCC lung tissue from 129 individual patients (LS-71 and LS-136 were duplicate samples from different areas of the same tumor) from all stages of lung SCC were evaluated in this study. Supplementary Table S1 lists the clinical data associated with all lung SCC samples used in this study. The 86 lung adenocarcinoma samples have previously been described (5). Table 1 shows the clinical information associated with both adenocarcinoma and SCC samples used for identifying and testing the prognostic signatures. Patients were censored from statistical analysis if they were alive but had <3 years of clinical follow-up. Censored patients were excluded from the receiver operating characteristic (ROC) analysis and the calculation of sensitivity and specificity but they were included in the Cox regression for marker discovery. The independent NSCLC data set was composed of 36 lung adenocarcinoma (27 stage I) and 36 lung SCC samples (25 stage I) with at least 3 years of follow-up (Gene Omnibus data set: GSE3141; ref. 11).
|
Real-time quantitative reverse transcription-PCR. Total RNA samples were first quality tested by capillary electrophoresis using a Bioanalyzer (Agilent, Palo Alto, CA). For aRNA, the Ribobeast 1-Round Aminoallyl-aRNA Amplification Kit (Epicentre, Madison, WI) was used. Real-time quantitative reverse transcription-PCR (RT-PCR) analyses were done on the ABI Prism 7900HT sequence detection system (Applied Biosystems, Foster City, CA).
Immunohistochemistry. Immunohistochemistry was done on tissue microarrays containing 60 lung SCCs. These tumors were from a subset of the ones analyzed by microarray. The 60 samples chosen for the tissue microarray were randomly selected and included 31 stage 1, 9 stage II, and 19 stage III SCC samples. Areas of the tumor that best represented the overall morphology were selected for generating a tissue microarray block as previously described (13). All controls stained negative for background.
Statistical analysis. Three statistical methods were done to identify the optimal prognostic gene signature: Cox proportional-hazard regression modeling, bootstrapping, and a 5-fold cross-validation. Hierarchical clustering was done to identify major clusters of patients and investigate their association with patient co-variates. The method for developing a risk index is similar to that previously described (14). Pathway analysis was done by first mapping genes to the Biological Process categories of Gene Ontology and then calculating the significance of overrepresented categories in the selected gene list.
| Results |
|---|
|
|
|---|
As expected, Kaplan-Meier analysis indicated that stage III patients had significantly worse overall survival compared with stages I and II (Supplementary Fig. S1A). Stage II patients could be seen to have a worse overall survival compared with those patients with stage I only within the first year of follow-up. It was also noted that the overall survival rate in this population of older patients (mean age, 68 ± 10 years) converged beyond 5 years for all stages (data not shown). This is likely due to confounding diseases in this population. Because the majority of patients relapse within the first 3 years, we used the 3-year time point as a cutoff in our survival analyses (15).
We asked the question whether there are gene expression profiles that correlate with outcome irrespective of stage of disease. To this end, we did both unsupervised and supervised statistical analyses. The 130 lung SCC samples were clustered based on unsupervised hierarchical clustering of the remaining 11,101 genes. Samples that originated from different areas of the same tumor (LS71 and LS136) clustered next to each other and their correlation was 0.92, indicating that the gene expression profiling measures were reproducible (data not shown). The mean value of these duplicate samples was used in further analyses. Interestingly, the resulting 129 lung SCC samples clustered into two main groups: one containing 55 patients and the other with 74 patients (Fig. 1A ). Similar clusters were observed when the analysis was done with 14K, 11K, 9K, or 7K genes (data not shown). No significant association between the two clusters and tumor stage, differentiation, or patient gender was identified. There were approximately equal proportions of each stage present in both clusters (cluster 1 consisted of 31 stage I, 15 stage II, and 9 stage III patients; cluster 2 consisted of 42 stage I, 18 stage II, and 14 stage III patients). However, the patients in clusters 1 and 2 showed significantly different overall survival (Fig. 1B; P = 0.036), indicating that gene expression profiles, irrespective of stage, existed that were associated with overall survival (Fig. 1B).
|
|
Identification of a SCC prognostic gene signature. To identify a robust set of mRNA transcripts that could stratify patients into good and poor prognostic groups, we employed Cox proportional hazard modeling. The 129 SCC samples were analyzed using a 5-fold cross-validation to determine the optimal number of gene transcripts to be used in the prognostic signature (Fig. 2A ). When increasing numbers of gene transcripts were plotted versus the mean performance from 100 five-fold cross-validations, as measured by the average area under the curve (AUC) of the Receiver Operating Characteristic (ROC) curves and using the death within 3 years as the defining point, it could be seen that the signature performance began to plateau at around 50 probe sets (Fig. 2B). Therefore, the top ranked 50 probe sets were used in the classifier (Supplementary Table S4).
|
|
To summarize the prognostic value of these two classifiers in NSCLC, we analyzed the independent SCC and adenocarcinoma samples as a single data set. Thus, taken together, the SCC and adenocarcinoma classifiers were tested in the complete set of 72 independent NSCLC samples. The resulting ROC curve showed an AUC of 0.71 (Fig. 4A ). Kaplan-Meier analysis showed a significant stratification in overall survival (log-rank P = 0.0002; HR, 3.54; 95% CI, 1.74-7.19; Fig. 4B). The overall specificity and sensitivity of the two classifiers were 77% (95% CI, 0.59-0.88) and 64% (95% CI, 0.43-0.80), respectively. We then tested the performance of the two classifiers in the subset of 52 independent NSCLC stage I samples and the ROC curve showed an AUC of 0.70 (Fig. 4C). Similarly, Kaplan-Meier analysis showed a significant stratification in overall survival in this clinically relevant group of patients (log-rank P = 0.0012; HR, 3.86; 95% CI, 1.61-9.24; Fig. 4D). These data indicate that the combination of the two classifiers can identify early-stage NSCLC patients with either a good or poor prognosis.
|
| Discussion |
|---|
|
|
|---|
We first analyzed a large set of 129 lung SCC tumors and found evidence of a novel subset of tumors associated with a more aggressive clinical behavior. Hierarchical clustering showed two major subtypes of SCC, one of which was at a significantly higher risk of death within 3 years. There was no association with the SCC subtypes and tumor grade (differentiation). However, the expression profile associated with the poor prognostic subgroup was indicative of a dedifferentiated phenotype. Specifically, genes related to epidermal differentiation were down-regulated in the poor prognostic group, including many cytokeratins and small proline-rich proteins. Immunohistochemistry showed that several of the cytokeratins and tyrosine kinase receptor FGFR2 were associated with tumor cells and not the stroma. We also found that for FGFR2, protein expression correlated with clinical outcome in our SCC samples. Our finding that some tyrosine kinase receptors are down-regulated in aggressive SCC is consistent with the finding of Muller-Tidow et al. (19) who mapped expression of 56 known tyrosine kinase receptors in 70 early-stage NSCLC samples, including 31 SCCs. In that study, reduced expression of FGFR2 and NTRK2 showed a trend towards decreased risk of distant metastasis.
Recently, Inamura et al. (18) also found two subclasses present in 48 lung SCC samples using a cDNA array platform. In their analyses, several biological process categories were identified as being enriched in SCC compared with normal lung tissue, including epidermal development. They also showed that the two subgroups showed different clinical outcomes; however, their analysis did not associate the epidermal development category with these clusters. This may be due to the different platforms used (different genes analyzed), their small data set, and the different algorithms for selecting functional categories. Further, the authors did not identify a prognostic classifier and, unfortunately, the raw data were not available from their study for independent analysis. Nevertheless, their report does confirm our finding of two expression-based SCC subtypes with distinct clinical outcomes.
To develop the most robust prognostic signature for SCC, we did Cox proportional hazard modeling on all 129 SCC samples. An optimal set of 50 gene transcripts was identified and then tested in a completely independent set of 36 SCC samples from an independent institution (11). The performance of this classifier provided an overall predictive accuracy of 68%. Using a cutoff defined in the training set, a specificity of 84% and a sensitivity of 41% were achieved and Kaplan-Meier analysis showed significant stratification of high-risk and low-risk patients (log-rank P = 0.04; HR, 2.66; 95% CI, 1.01-7.05). Whereas only 25 patients samples were available that had stage I SCC and 3-year follow-up, a similar trend was also seen when analyzed with this classifier (P = 0.09). We believe that further analysis of larger lung SCC test sets will allow a refinement of this classifier and provide better understanding of the biological basis for associations with clinical outcome.
To determine whether an expression-based classifier can capture the majority of all NSCLCs, we combined the classifier obtained from the 129 SCC samples with our previously identified 50-gene classifier for 86 lung adenocarcinomas (5). The two classifiers were tested in an independent set of 52 stage I NSCLC samples (36 adenocarcinoma and 36 SCC; ref. 11). This dual signature clearly stratified good and poor prognostic patients with a HR of 3.86 (95% CI, 1.61-9.24; log-rank P = 0.0012).
It is interesting to note that the performance of the adenocarcinoma classifier showed a better predictive accuracy than the SCC classifier (adenocarcinoma AUC = 0.83, SCC AUC = 0.68). This could be due to the heterogeneity of the SCC samples as indicated by the two distinct subgroups showing differing clinical outcomes in this tumor type. The independent SCC population may not have captured this heterogeneity because it was a relatively small sample size. Indeed, we did not find evidence of the two subgroups in this set of 36 samples (data not shown). Therefore, testing in a larger independent set of SCC samples will be required to further validate this signature.
This is the first large set of well pathologically and clinically annotated lung SCC to undergo gene expression profiling. Similarities and differences between several previous analyses of SCC are apparent; however, the value of large data sets of similarly analyzed samples will allow more complete analyses of groups and subgroups of these tumors and genes most associated with them. Ongoing collaborative studies on large numbers of lung adenocarcinomas by National Cancer Institute Director's Challenge investigators will provide similar data sets. The many specific individual genes and their potential involvement in lung tumor biology and clinical outcome will require more extensive analyzes as we have recently shown for several such candidates (2022). Prognostic genes acting in both SCC and adenocarcinoma is not surprising but defining the similarities and differences will allow greater understanding of this deadly disease and possibly better diagnostic or therapeutic approaches. In conclusion, our lung SCC and adenocarcinoma prognostic signatures, when used together in a combinatorial approach, identified early-stage NSCLC patients who have a poor prognosis. Subsequent studies will examine the predictive value of our classifier for possible recommendation of adjuvant therapy in those patients with early-stage lung cancer but seem to have biologically aggressive disease.
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank Drs. John F. Palma and Dimitri Talantov for critical comments on the manuscript.
| Footnotes |
|---|
The data discussed in this publication have been deposited in National Center for Biotechnology Information Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) and are accessible through GEO Series accession number GSE4573.
Received 3/30/06. Revised 5/ 9/06. Accepted 5/18/06.
| References |
|---|
|
|
|---|
B, perturbs cell cycle, and is associated with poor outcome in lung adenocarcinomas. Proc Natl Acad Sci U S A 2005;102:1250712.This article has been cited by other articles:
![]() |
K. A. Hassan, G. Chen, G. P. Kalemkerian, M. S. Wicha, and D. G. Beer An Embryonic Stem Cell-Like Signature Identifies Poorly Differentiated Lung Adenocarcinoma but not Squamous Cell Carcinoma Clin. Cancer Res., October 15, 2009; 15(20): 6386 - 6390. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ocak, M. L. Sos, R. K. Thomas, and P. P. Massion High-throughput molecular analysis in lung cancer: insights into biology and potential clinical applications Eur. Respir. J., August 1, 2009; 34(2): 489 - 506. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Raponi, L. Dossey, T. Jatkoe, X. Wu, G. Chen, H. Fan, and D. G. Beer MicroRNA Classifiers for Predicting Prognosis of Squamous Cell Lung Cancer Cancer Res., July 15, 2009; 69(14): 5776 - 5783. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Hsu, C. R. Acharya, B. S. Balakumaran, R. F. Riedel, M. K. Kim, M. Stevenson, S. Tuchman, S. Mukherjee, W. Barry, H. K. Dressman, et al. Characterizing the developmental pathways TTF-1, NKX2-8, and PAX9 in lung cancer PNAS, March 31, 2009; 106(13): 5312 - 5317. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. Boutros, S. K. Lau, M. Pintilie, N. Liu, F. A. Shepherd, S. D. Der, M.-S. Tsao, L. Z. Penn, and I. Jurisica Prognostic gene signatures for non-small-cell lung cancer PNAS, February 24, 2009; 106(8): 2824 - 2828. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Broet, S. Camilleri-Broet, S. Zhang, M. Alifano, D. Bangarusamy, M. Battistella, Y. Wu, M. Tuefferd, J.-F. Regnard, E. Lim, et al. Prediction of Clinical Outcome in Multiple Lung Cancer Cohorts by Integrative Genomics: Implications for Chemotherapy Selection Cancer Res., February 1, 2009; 69(3): 1055 - 1062. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bulk, B. Sargin, U. Krug, A. Hascher, Y. Jun, M. Knop, C. Kerkhoff, V. Gerke, R. Liersch, R. M. Mesters, et al. S100A2 Induces Metastasis in Non-Small Cell Lung Cancer Clin. Cancer Res., January 1, 2009; 15(1): 22 - 29. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Roepman, J. Jassem, E. F. Smit, T. Muley, J. Niklinski, T. van de Velde, A. T. Witteveen, W. Rzyman, A. Floore, S. Burgers, et al. An Immune Response Enriched 72-Gene Prognostic Profile for Early-Stage Non-Small-Cell Lung Cancer Clin. Cancer Res., January 1, 2009; 15(1): 284 - 290. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Guo, Y.-W. Wan, K. Tosun, H. Lin, Z. Msiska, D. C. Flynn, S. C. Remick, V. Vallyathan, A. Dowlati, X. Shi, et al. Confirmation of Gene Expression-Based Prediction of Survival in Non-Small Cell Lung Cancer Clin. Cancer Res., December 15, 2008; 14(24): 8213 - 8220. [Abstract] [Full Text] [PDF] |
||||
![]() |
E.-S. Lee, D.-S. Son, S.-H. Kim, J. Lee, J. Jo, J. Han, H. Kim, H. J. Lee, H. Y. Choi, Y. Jung, et al. Prediction of Recurrence-Free Survival in Postoperative Non-Small Cell Lung Cancer Patients by Using an Integrated Model of Clinical Information and Gene Expression Clin. Cancer Res., November 15, 2008; 14(22): 7397 - 7404. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Raz, M. R. Ray, J. Y. Kim, B. He, M. Taron, M. Skrzypski, M. Segal, D. R. Gandara, R. Rosell, and D. M. Jablons A Multigene Assay Is Prognostic of Survival in Patients with Early-Stage Lung Adenocarcinoma Clin. Cancer Res., September 1, 2008; 14(17): 5565 - 5570. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Skrzypski, E. Jassem, M. Taron, J. J. Sanchez, P. Mendez, W. Rzyman, G. Gulida, D. Raz, D. Jablons, M. Provencio, et al. Three-Gene Expression Signature Predicts Survival in Early-Stage Squamous Cell Carcinoma of the Lung Clin. Cancer Res., August 1, 2008; 14(15): 4794 - 4799. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bulk, A. Hascher, R. Liersch, R. M. Mesters, S. Diederichs, B. Sargin, V. Gerke, M. Hotfilder, J. Vormoor, W. E. Berdel, et al. Adjuvant Therapy with Small Hairpin RNA Interference Prevents Non-Small Cell Lung Cancer Metastasis Development in Mice Cancer Res., March 15, 2008; 68(6): 1896 - 1904. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. C. Finger, R. S. Turley, M. Dong, T. How, T. A. Fields, and G. C. Blobe T{beta}RIII suppresses non-small cell lung cancer invasiveness and tumorigenicity Carcinogenesis, March 1, 2008; 29(3): 528 - 535. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Sun, D. A. Wigle, and P. Yang Non-Overlapping and Non-Cell-Type-Specific Gene Expression Signatures Predict Lung Cancer Survival J. Clin. Oncol., February 20, 2008; 26(6): 877 - 883. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Lau, P. C. Boutros, M. Pintilie, F. H. Blackhall, C.-Q. Zhu, D. Strumpf, M. R. Johnston, G. Darling, S. Keshavjee, T. K. Waddell, et al. Three-Gene Prognostic Classifier for Early-Stage Non Small-Cell Lung Cancer J. Clin. Oncol., December 10, 2007; 25(35): 5562 - 5569. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Janes and F. McCaughan Where next for lung cancer gene expression profiling? Eur. Respir. J., July 1, 2007; 30(1): 5 - 6. [Full Text] [PDF] |
||||
![]() |
J. E. Larsen, S. J. Pavey, R. Bowman, I. A. Yang, B. E. Clarke, M. L. Colosimo, N. K. Hayward, and K. M. Fong Gene expression of lung squamous cell carcinoma reflects mode of lymph node involvement Eur. Respir. J., July 1, 2007; 30(1): 21 - 26. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Larsen, S. J. Pavey, L. H. Passmore, R. V. Bowman, N. K. Hayward, and K. M. Fong Gene Expression Signature Predicts Recurrence in Lung Adenocarcinoma Clin. Cancer Res., May 15, 2007; 13(10): 2946 - 2954. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Larsen, S. J. Pavey, L. H. Passmore, R. Bowman, B. E. Clarke, N. K. Hayward, and K. M. Fong Expression profiling defines a recurrence signature in lung squamous cell carcinoma Carcinogenesis, March 1, 2007; 28(3): 760 - 766. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Herbst and S. M. Lippman Molecular Signatures of Lung Cancer -- Toward Personalized Therapy N. Engl. J. Med., January 4, 2007; 356(1): 76 - 78. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |