| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Regular Articles |
Discovery Medicine [N. C. T., J. A. S., A. J. D., W. L. T., M. E. B.] and Expression Profiling Informatics [D. K. S.], Wyeth Research, Cambridge, Massachusetts 02140; Clinical Research and Development, Wyeth Research, Collegeville, Pennsylvania 19426 [B. M., G. D.]; University of Texas Health Science Center, San Antonio, Texas 78229 [M. H.]; University of Chicago, Chicago, Illinois 60637 [W. S.]; Indiana University, Indianapolis, Indiana 85012 [T. L.]; Our Lady of Mercy Medical Center, New York Medical College, Bronx, New York 10466 [J. D.]; and Fox Chase Cancer Center, Philadelphia, Pennsylvania 19111 [G. H.]
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
| MATERIALS AND METHODS |
|---|
|
|
|---|
PBMC Preparation, Isolation of RNA, and Hybridization of Targets to Microarrays.
PBMCs from individuals were isolated from whole blood samples (8 ml) collected into cell purification tubes according to the standard procedure. All normal and RCC blood samples were shipped or stored overnight before processing. Total RNA was isolated from PBMC pellets using the RNeasy mini kit (Qiagen, Valencia, CA), and labeled probe for oligonucleotide arrays was prepared using a modification of the procedure described by Lockhart et al. (6)
. Labeled probes were hybridized to oligonucleotide arrays comprising over 12,600 human sequences (HgU95A, Affymetrix), according to the Affymetrix Expression Analysis Technical Manual (Affymetrix).
Gene Expression Data Reduction.
Data analysis and absent/present call determination were performed on raw fluorescent intensity values using GENECHIP 3.2 software (Affymetrix). "Present" calls were calculated by GENECHIP 3.2 software by estimating whether a transcript is detected in a sample based on the strength of the signal of the gene compared with background. The "average difference" values for each transcript were normalized to "frequency" values using the scaled frequency normalization method (7)
, in which the average differences for 11 control cRNAs with known abundance spiked into each hybridization solution were used to generate a global calibration curve. This calibration was then used to convert average difference values for all transcripts to frequency estimates, stated in units of parts per million ranging from 1:300,000 (
3 ppm) to 1:1,000 (1,000 ppm).
Statistical and Clustering Analyses.
Unsupervised hierarchical clustering of genes and/or arrays on the basis of similarity of their expression profiles was performed using the procedure of Eisen et al. (8)
. Nearest neighbor analysis and supervised prediction were performed using Genecluster version 2.0,4
which has been described previously (9)
. For hierarchical clustering and nearest neighbor analysis, data were log transformed and normalized to have a mean value of zero and a variance of one. To identify the disease-associated transcripts, a Students t test was used to compare normal PBMC expression profiles to renal carcinoma PBMC profiles.
Additional Samples from the GeneLogic GX2000 Bioexpress Database and Fold Change Analysis.
Expression profiles measured on HgU95 chips of renal carcinoma biopsies (n = 47) and nonmalignant normal kidney tissues (n = 60), WBCs from nondiseased volunteers (n = 4) and WBCs from non-RCC end-stage renal failure patients (n = 9), or unstimulated CD4 T cells in culture (n = 3) and anti-CD3/anti-CD28-stimulated CD4 T cells in culture (n = 3) were accessed from the GX2000 BioExpress database (GeneLogic, Gaithersburg, MD). Data were processed in Affymetrix Micro Array Suite 4 and then normalized using the GeneLogic normalization algorithm. Fold changes were calculated in the GX2000 Fold Change analysis tool, which uses a geometric mean to calculate average changes in the expression of a gene between groups of samples.
| RESULTS |
|---|
|
|
|---|
An initial unsupervised cluster analysis approach, which hierarchically groups samples and genes based on correlation coefficients (8)
, was performed using the 5249 genes passing the main filtering criteria (Fig. 1A)
. The dendrogram describing sample relationships grouped the majority of RCC PBMCs (42 of 45) into a single RCC-specific cluster, whereas expression patterns of normal PBMCs and a small subset of RCC PBMCs (3 of 45) formed a separate cluster (Fig. 1B)
. A fold change analysis and Students t test (two-tailed distribution; two-sample unequal variance) identified transcripts defined as differentially expressed between RCC PBMCs and normal PBMCs. In total, transcript levels of 184 genes differed, on average, by 2-fold or greater between normal and RCC PBMCs, with an unadjusted P below 0.001 in a t test between the groups. Of these, 132 transcripts were expressed in at least 15% of the PBMC samples (present in 10 or more of the 65 profiles) and are presented in Table 1
.
|
|
In a second approach, we compared the differentially expressed genes in RCC PBMCs with genes differentially expressed between unstimulated CD4+ T cells (n = 3 normal donors) and CD4+ T cells (n = 3 normal donors) stimulated ex vivo with anti-CD3 and anti-CD28 in culture. Stimulated CD4+ T cells possessed 14 transcripts that were greater than 2-fold changed in the same direction (induced or repressed) as the disease-associated transcripts in RCC PBMCs (Table 1)
.
In a third approach, we compared the differentially expressed genes in RCC PBMCs with genes differentially expressed between PBMCs from non-RCC end-stage renal failure patients (n = 9 individuals) and PBMCs from normal volunteers (n = 4 individuals). Of these, nine transcripts differentially expressed in PBMCs from renal failure patients were also disease-associated transcripts in RCC PBMCs (Table 1)
. Thus, our marker gene list from PBMCs of RCC patients contains a subset of markers commonly involved in immune responses measured ex vivo (CD4+ T-cell activation) and in responses of circulating leukocytes to renal dysfunction observed in vivo.
Classification of RCC and Normal Status Using Patterns of Expression in Peripheral Blood Cells.
We next sought to apply our results by determining the ability of a minimal gene set(s) to classify RCC versus normal status using expression patterns in the peripheral blood. To initially build and subsequently train the classifiers, 70% of the RCC PBMCs (n = 31) and 70% of the normal PBMCs (n = 14) were selected randomly and used as the training set. We used the Genecluster default correlation metric (9)
to identify genes with expression levels most highly correlated with the classification vector characteristic of the training set. All 5249 genes meeting the main filter criteria were screened using this approach.
Prediction was also performed in Genecluster using the weighted voting method. In this method, the expression level of each gene in the classifier set contributes to an overall vote on the classification of the sample (10) . The prediction strength is a combined variable that indicates the support for one class or the other and can vary between 0 (narrow margin of victory) and 1 (wide margin of victory) in favor of the predicted class.
Predictor sets containing between 2 and 20 genes were evaluated by leave-one-out cross-validation to identify the predictor set with the highest accuracy for classification of the samples in the training set (Fig. 2A)
. An eight-gene classifier set containing the four top genes up-regulated in RCC and the four top genes down-regulated in RCC was found to yield the highest cross-validation prediction accuracy on the training set. The relative expression levels for this eight-gene classifier set across the training set are shown in Fig. 2B
, and the individual prediction confidence scores for each sample in the training set using this eight-gene classifier set are presented, in the same order, in Fig. 2C
. For illustrative purposes, we assigned a positive sign to the prediction strengths resulting in votes for RCC and a negative sign to prediction strengths resulting in votes for normal PBMCs.
|
| DISCUSSION |
|---|
|
|
|---|
In the present study we identified 132 genes with transcripts detected as present in at least 15% of the samples that varied greater than 2-fold between RCC and normal PBMCs with significance at the 99.9% confidence level or greater. Among the most significantly elevated transcripts (P < 10-7) in RCC PBMCs were several inflammatory-related genes, including Toll-like receptor 2, galectin-3, interleukin 1 receptor antagonist, and aquaporin-9, a water channel implicated in leukocyte migration. The unchanged levels of many other cytokines, chemokines, and their respective receptors between normal and RCC PBMCs suggested that a specific, rather than global, activation of PBMCs constituted an important part of the disease signature in RCC peripheral blood.
It is interesting to note that the vast majority of the transcripts detected as significantly changed in PBMCs from RCC patients also possessed the highest variability across the RCC PBMC profiles. This finding indicates that although the levels of these transcripts were significantly distinct from levels in normal PBMCs, there was relative heterogeneity of expression of these transcripts across RCC patients. This observation is similar to the recent finding by Whitney et al. (11) that variation in normal PBMC expression is lower than the variation in patients with diffuse large B-cell lymphomas and chronic lymphocytic leukemias. The results here demonstrate that this may also be the case for diseases of solid tumors. It will be of great interest to determine whether any of these disease-associated, yet highly variable, transcripts in RCC PBMCs will be correlated with any clinical categories of response, once clinical indices of outcome and follow-up are satisfactorily measured in these patients.
Our comparisons clearly identified transcripts, the expression levels of which were altered in PBMCs from RCC patients relative to PBMCs of disease-free volunteers. However, it was unknown whether PBMC profiles of patients with RCC tumors of different origin might be further distinct. In a subsequent analysis, we used a multiclassification approach to determine whether transcripts in PBMCs from RCC patients were significantly distinct between patients with renal tumors of different cellular origin (clear cell versus mixed versus papillary). Significance testing by random permutation revealed that no transcripts were significantly correlated (P < 0.05) in PBMCs of patients with specific tumor types within the RCC population (data not shown). Therefore, in this small study, expression profiles in the surrogate tissue PBMCs were not sufficiently distinct to allow classification of RCC tumor type on the basis of PBMC transcriptional profiles alone.
Previous studies have used high-sensitivity arrays or RT-PCR in the peripheral blood to diagnose tumor status (12 , 13) , based on the detection of transcripts derived from metastatic cells in the circulation. Although this was unlikely to contribute to the disease-associated gene set in these studies, we nonetheless formally explored this possibility and other potential biological bases for the disease associated gene set in PBMCs of RCC patients. None of the most highly induced transcripts in RCC tumors were detected in PBMCs from RCC patients, consistent with the hypothesis that shed tumor cells did not contribute to the disease-associated transcripts observed in RCC PBMCs. We also compared RCC PBMC profiles measured in vivo with those of ex vivo-activated CD4+ T cells in culture. RCC has been characterized as an immunogenic tumor similar to melanoma (14 , 15) . Comparison of profiles measured in CD4+ T cells stimulated ex vivo with anti-CD3/anti-CD28 antibodies to profiles measured in PBMCs isolated from RCC patients identified 14 transcripts commonly induced or repressed 2-fold or more. The genes commonly regulated in RCC PBMCs and in activated CD4+ T cells often showed quite significant induction ex vivo. These results support a hypothesis that the expression of at least a subset of the disease-associated genes observed in RCC PBMCs may result from an activation of circulating T cells and/or other leukocytes in response to the presence of the tumor. In a final comparison, we also identified another set of nine genes that were commonly regulated in PBMCs from advanced RCC patients and in PBMCs from patients with end-stage renal failure. Because the patients in this trial possessed adequate renal function (serum creatine <1.5 x upper limit of normal or a calculated creatinine clearance >60 ml/min) at the time of trial entry, we anticipated that very few, if any, alterations in PBMC profiles would be due to renal dysfunction. It is, however, possible that the regulation of this small subset of disease-associated transcripts detected in RCC PBMCs could be due to alterations in leukocyte expression profiles in response to early (as yet undetectable) renal dysfunction onset in the RCC patients.
Ongoing studies in our laboratory have demonstrated that PBMCs from RCC patients can be accurately distinguished not only from PBMCs of normal volunteers but also from PBMCs of patients with other types of solid tumors (prostate and head and neck cancer; data not shown). Using a multiclass approach, we have predicted RCC status using profiles in PBMCs with moderately high accuracy (70%), comparable with the overall accuracy achieved by Ramaswamy et al. (16) across a large database of primary tumor biopsy profiles. If these preliminary results are confirmed in a larger patient population and across multiple tumor types in individuals with nonadvanced cases of disease, it is possible that expression profiles in PBMCs could ultimately be used for diagnostic purposes.
As clinical pharmacogenomic analyses gain acceptance and become more commonplace in clinical trials, it is increasingly evident that microarrays will eventually be used as diagnostic devices. Recent consideration has been given to the use of microarrays as medical devices (17) . One of the important issues will be to establish a rigorous and numerically-based method for reporting expression "pattern" results from a diagnostic assay and how an associated reference range for that pattern will be calculated and reported (18) . We are currently using the weighted voting method described herein to collapse expression pattern results from many genes into a single numerical confidence score (9) . One important advantage of this method is that it reports a prediction strength score, indicative of the confidence in the prediction for each patient. A confidence threshold can, thus, be established to optimize the accuracy of prediction and minimize the incidence of both false positive and false negative results. In the future, average confidence scores collected for the accumulating pool of correctly diagnosed patients and correctly nondiagnosed disease-free individuals could be calculated, and a reference range of values, for the particular predictive gene set diagnostic in question, could be reported. Alternatively, a reference disease-free RNA standard could be run alongside the clinical RNA sample in question, although the requirements and the source of such a standard remain to be defined.
In summary, the present study has established that there appear to be disease-associated genes in the PBMCs of patients with RCC. On the basis of these data, it is possible that because PBMCs circulate throughout the bodily tissues, their expression profiles may serve as a sensitive indicator and physiological monitor of disease and health. If additional analyses bear these findings out, the genes identified here represent the foundation on which to build disease-specific gene sets that can be used as part of a molecular diagnosis of disease using peripheral blood. We suggest that global expression profiling of the peripheral blood will identify gene sets of limited size that may ultimately be developed into clinical assays. Additional research and larger patient populations will ultimately determine the exact identities of the transcripts in circulating leukocytes with the greatest predictive power in clinical diagnosis, and establish the limits and caveats associated with the ability to predict/distinguish each disease type using expression profiles in peripheral blood.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
1 Present address: Millennium Pharmaceuticals, 75 Sidney Street, Cambridge, MA 02139. ![]()
2 To whom requests for reprints should be addressed, at Wyeth Research, 1 Burtt Road, Andover, MA 01810. Phone: (978) 247-1156; Fax: (978) 247-1133; E-mail: mburczynski{at}wyeth.com ![]()
3 The abbreviations used are: RCC, renal cell carcinoma; PBMC, peripheral blood mononuclear cell; ppm, parts per million. ![]()
4 Internet address: www-genome.wi.mit.edu/cancer/software/genecluster2.html. ![]()
Received 3/26/03. Revised 6/10/03. Accepted 6/20/03.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
O. Galamb, F. Sipos, N. Solymosi, S. Spisak, T. Krenacs, K. Toth, Z. Tulassay, and B. Molnar Diagnostic mRNA Expression Patterns of Inflamed, Benign, and Malignant Colorectal Biopsy Specimen and their Correlation with Peripheral Blood Results Cancer Epidemiol. Biomarkers Prev., October 1, 2008; 17(10): 2835 - 2845. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Bosse, P. Mathieu, and P. Pibarot Genomics: The Next Step to Elucidate the Etiology of Calcific Aortic Valve Stenosis J. Am. Coll. Cardiol., April 8, 2008; 51(14): 1327 - 1336. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Kim, D. J. Dix, K. E. Thompson, R. N. Murrell, J. E. Schmid, J. E. Gallagher, and J. C. Rockett Effects of Storage, RNA Extraction, Genechip Type, and Donor Sex on Gene Expression Profiling of Human Whole Blood Clin. Chem., June 1, 2007; 53(6): 1038 - 1045. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Osman, D. F. Bajorin, T.-T. Sun, H. Zhong, D. Douglas, J. Scattergood, R. Zheng, M. Han, K. W. Marshall, and C.-C. Liew Novel blood biomarkers of human urinary bladder cancer. Clin. Cancer Res., June 1, 2006; 12(11): 3374 - 3380. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Kim, D. J. Dix, K. E. Thompson, R. N. Murrell, J. E. Schmid, J. E. Gallagher, and J. C. Rockett Gene expression in head hair follicles plucked from men and women. Ann. Clin. Lab. Sci., March 1, 2006; 36(2): 115 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Burczynski, R. L. Peterson, N. C. Twine, K. A. Zuberek, B. J. Brodeur, L. Casciotti, V. Maganti, P. S. Reddy, A. Strahs, F. Immermann, et al. Molecular Classification of Crohn's Disease and Ulcerative Colitis Patients Using Transcriptional Profiles in Peripheral Blood Mononuclear Cells J. Mol. Diagn., February 1, 2006; 8(1): 51 - 61. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Burczynski, N. C. Twine, G. Dukart, B. Marshall, M. Hidalgo, W. M. Stadler, T. Logan, J. Dutcher, G. Hudes, W. L. Trepicchio, et al. Transcriptional Profiles in Peripheral Blood Mononuclear Cells Prognostic of Clinical Outcomes in Patients with Advanced Renal Cell Carcinoma Clin. Cancer Res., February 1, 2005; 11(3): 1181 - 1189. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-H. Tan, C. G. Rogers, J. T. Cooper, J. A. Ditlev, T. J. Maatman, X. Yang, K. A. Furge, and B. T. Teh Gene Expression Profiling of Renal Cell Carcinoma Clin. Cancer Res., September 15, 2004; 10(18): 6315S - 6321S. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Xu, C.-T. Shu, E. Purdom, D. Dang, D. Ilsley, Y. Guo, J. Weber, S. P. Holmes, and P. P. Lee Microarray Analysis Reveals Differences in Gene Expression of Circulating CD8+ T Cells in Melanoma Patients and Healthy Donors Cancer Res., May 15, 2004; 64(10): 3661 - 3667. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |