| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Immunology |
1 Division of Hematology, Stanford University School of Medicine, Stanford; 2 Department of Statistics, Stanford University, Stanford; 3 Agilent Technologies, Palo Alto; and 4 University of Southern California/Norris Cancer Center, Los Angeles, California
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
DNA microarray technology provides a powerful tool to study differences in transcript abundance in parallel (6) . Typical microarray labeling procedures require 36 µg polyadenylated RNA or 20100 µg total RNA per cDNA microarray. This amount of RNA can only be reliably obtained from cell lines or tissue samples, which contain heterogeneous cell types. Because it is difficult to obtain sufficient material for microarray analysis from pure primary cells, RNA amplification methods have been developed. Among the currently available amplification methods, the most commonly used is T7 promoter-based linear amplification first developed by Eberwine (7) . In this report, we used a modified T7-based amplification protocol (developed by Agilent Technologies Inc.). Systematic evaluation of the reproducibility of this amplification protocol demonstrated strong correlations of >0.9, thus ensuring a high quality of amplification. This protocol enabled us to study small sort-purified populations of CD8+ T-cell subsets. We found subtle gene expression differences between T cells from melanoma patients versus those from healthy controls. These subtle differences were revealed using a new data transformation that stabilizes the variance across the whole set of expression levels. These results were confirmed via quantitative real-time PCR (RQ-PCR) analysis of the original, unamplified RNA samples. This represents an important step toward the elucidation of the mechanism of immune dysfunction in cancer.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Flow Sorting and Cell Treatment.
PBMCs were thawed and cultured (RPMI 1640 supplemented with 10% fetal bovine serum, penicillin, streptomycin, and glutamine) overnight at 37°C, 5% CO2. CD8+ T cells were enriched using RosetteSep (StemCell Technologies, Vancouver, British Columbia, Canada). The resulting cells were incubated with the antibodies CD8-phycoerythrin (Caltag, San Francisco, CA), CD45RA-Cy5PE (Phar-Mingen, San Diego, CA), and CD27-FITC (Caltag) at room temperature for 30 min. After staining, cells were washed and sorted using a FACSVantage (Becton Dickinson, San Jose, CA). CD8+CD45RA+CD27+, CD8+CD45RA+CD27, and CD8+CD45RA-CD27+ cells were collected separately using gates shown in Fig. 1
. Cell purity was confirmed after the sorting by analyzing each sorted fraction using a FACSCalibur (Becton Dickinson). Cells (100,000) from each fraction were homogenized into 1 ml of TRIzol (Invitrogen) and frozen at 80°C, with the addition of 10 µg of linear acrylamide (Ambion).
|
RNA Amplification, Labeling, and Hybridization.
To minimize potential amplification bias, we elected to use only one round of amplification in this study. The typical yield of amplified RNA (aRNA) using the Agilent amplification protocol is between 400 ng and 10 µg. The polyadenylated RNA was amplified using the Agilent Low Input Linear Amplification kit following the protocol described in the users manual for RNA Amplification and Fluorescent cDNA Synthesis. Briefly, total RNA (7.8 µl) was mixed with 1.2 µL T7 promotor primer. The primer and template were annealed by incubating the reaction at 70°C for 10 min and then placing the reaction on ice. The cDNA synthesis was performed by adding 1x First Strand Buffer, DTT, RNaseOut, deoxynucleoside triphosphates, and Moloney murine leukemia virus reverse transcriptase. The reaction was incubated at 40°C for 2 h. The enzyme was then inactivated by heating the reaction to 65°C for 15 min and transferring the reaction to ice. In vitro transcription was initiated by the addition of 60 µl of Transcription Master Mix containing 1x Transcription Buffer, DTT, nucleotide triphosphates, polyethylene glycol, RNaseOut, Inorganic Pyrophosphatase, and T7 RNA polymerase to the cDNA synthesis reaction (total volume 80 µl). The reaction was incubated at 40°C for 4 h. The aRNA was purified using Qiagen RNeasy mini spin columns, as described in the Agilent user manual. The eluted aRNA was dried under vacuum in a rotary dessicator. The pellet was resuspended in 15 µl of RNase-free water. The aRNA concentration was determined using the Nanodrop ND-1000, and aRNA was visually qualified using the Agilent 2100 BioAnalyzer (RNA 6000 Nano Labchip kit). The reference sample was Stratagene human universal reference and was amplified at the same time as the sample RNA using the same method. Individual reference aRNA reactions were pooled before labeling. After amplification, each aRNA sample was checked using the RNA Nano Labchip before being converted to fluorescently labeled cDNA (data not shown). The patient samples were labeled using Cy5-dCTP and the reference sample was labeled using Cy3-dCTP. After RNase A digestion, the Cy3- and Cy5-labeled cDNA targets were pooled and purified using Qiaquick PCR purification kit (Qiagen) following the Agilent recommended protocol. The eluted targets were dried under vacuum in a rotary dessicator and resuspended in 7.5 µl of water. Cot 1 DNA, control targets, and hybridization buffer were added to the target solution. After incubating at 98°C for 2 min, the target was cooled. Fluorescently labeled cDNA targets were hybridized onto Agilent Human 1 cDNA microarrays following the Agilent cDNA Microarray user manual.
Microarray Imaging and Data Analysis.
The microarrays were washed and dried following the user manual instructions and scanned on the Agilent dual laser DNA microarray array scanner. Data were corrected with regards to local background as implemented by the Agilent feature extraction software. Microarrays that were hybridized on the same day were labeled as belonging to the same batch. The median red and green background and foreground variables were transferred into the R software package (8)
. The data were first transformed batch by batch using a variance stabilizing procedure as described in the next section. Remaining batch effects were eliminated through subtraction of the median from each batch, thus aligning all of the batch medians to a common value. In some analyses, the cell type effect (naïve, effector, and memory) was also removed through an additive model to make the averages of all of the genes for each of the cell types zero in the residual data. Features that show little overall variability across arrays were filtered out by excluding the features that had a t-statistic of <1.6 in all of the cell type groups. This left a subset of 2150 features in the data set.
Data Normalization.
Many microarray studies have shown that the usual log ratios tend to have variances that change with gene expression intensity (9)
. This makes inference based on fold difference (10)
alone difficult due to heteroscedasticity. Also, the features in which the intensity of one or two channels is nonpositive have to be dismissed. The variance stabilizing normalization (9)
procedure we used avoids these drawbacks. It applies a transformation h to the raw intensities of both green and red channels to remove the dependency of the variance on the mean. The differences between the transformed values can be viewed as "generalized log ratios" because for high intensities, xi and xj,
![]() |
Using Bioconductors multitest, adjusted Ps were calculated using an implementation of the step-down procedure for multiple testing of Westfall and Young (11) . The adjusted Ps given to the genes that have marked differential expression between melanoma and healthy donors were rank ordered, and the 50 genes with the smallest were retained and submitted to the choice of discriminatory genes procedure detailed below.
Real-Time RQ-PCR.
To validate the hypotheses generated by the microarray study, quantitative RQ-PCR was performed. Five genes were chosen according to multiple testing procedures. The PCR primers specific to these genes were designed using Primer 3 software.5
All of the primers were designed with melting temperature 58-60°C and resulting products between 100 and 150 bp. For each healthy donor and melanoma patient, a small aliquot of total RNA extracted from each CD8+ subset was saved for quantitative PCR (separate from the total RNA used for linear amplification). cDNA was transcribed from total RNA using oligodeoxythymidylate primer and Superscript II (Invitrogen) in 20 µl of reaction volume. One µl of cDNA was then carried out in triplicate using iQ SYBR Green Supermix (Bio-Rad). The PCR conditions were as follows: 7 min at 95°C for initial denaturing, followed by 40 cycles of 95°C for 30 s, 55°C for 30 s, and 72°C for 30 s in the Bio-Rad iQ real-time sequence detection system. Levels of each gene were normalized to expression levels of a housekeeping gene, glyceraldehyde-3-phosphate dehydrogenase (GAPDH).
The primers for each gene are: FLJ22059 clone, 5'-GGAGACCATAG-CAGCGAGTC-3'5'-TCTTCCCACATGGACATGAA-3'; DDB2, 5'-CTCCTCAATGGAGGGAACAA-3'5'-GTGACCACCATTCGGCTACT-3'; N-myristoyltransferase 2, 5'-GAACATTGATGAGGCTGCAA-3'5'-ACCC-TGTGGCAAAGAATACG-3'; Galectin-1, 5'-GGAACATCCTCCTGGAC-TCA-3'5'-CAGGTTGTTGCTGTCTTTGC-3'; Hypothetical Protein 669, 5'-ATGGAAAAGCCATCAAGGTG-3'5'-GGTTCCTCCACTTCCTCCT-C-3'; Paraspeckle 1, 5'-CTACGGATTCGCTTCGCTAC-3'5'-CGATCAT-CCACAACCACAAC-3'; and GAPDH, 5'-CAGCCTCAAGATCATCAGCA-3'5'-GTCTTCTGGGTGGCAGTGAT-3'.
| RESULTS |
|---|
|
|
|---|
Quality of RNA Extraction, Linear Amplification, and Labeling.
To determine the amount, quality, and reproducibility of RNA extracted from small numbers of cells, we sorted various numbers of CD8+ T cells from healthy donor PBMC ranging from 50,000 to 500,000 cells.
We optimized the cell sorting conditions to maintain RNA integrity and the RNA isolation method to minimize RNA degradation and genomic DNA contamination. Genomic DNA contamination can become problematic when isolating RNA from small cell numbers because the fraction of genomic DNA relative to the percentage of RNA increases with decreasing cell number. One of the main problems with genomic DNA contamination is elevation of the RNA concentration when using UV spectrophotometry, because RNA and genomic DNA absorb at the same wavelength. Because the RNA yield was too low to measure by UV spectrophotometry, an estimation of RNA concentration was determined using the Bioanalyzer Pico assay software, which enabled us to compare RNA yields from each patient sample to ensure that the yields were comparable (data not shown). It was estimated that the average yield of total RNA from 100,000 cells was 2060 ng based on the concentration reported by the RNA Pico Assay software and generating a standard curve from a sample of known concentration. The limited amount of sample RNA did not allow for replicate hybridizations of each sample. Thus, we performed control experiments to test the reproducibility of our sample labeling method. RNA was isolated from 200,000 T cells (unsorted) and divided into several aliquots. Each sample was linearly amplified and then converted to fluorescently labeled cDNA. The system noise was determined using self-self hybridizations in which the same sample was labeled in both cyanine-3 and cyanine-5. In Fig. 2A
, data points represent the red signal versus the green signal for each feature on the microarray. The data converged to a tight line corresponding to the expected log ratio of 0, indicating that the signal from the two individual labeling reactions is nearly identical. Less than 1.5% of the genes on the microarray were identified as genes differentially expressed (P < 0.01), indicating the low level of system noise. To additionally check that the difference in the red and green dyes was not introducing additional variability, we performed dye swap experiments and analyzed the combined plot using the Rosetta Resolver system shown in Fig. 2B
. The scatter plot shows that 97% of the genes were correlated, with 2773 genes showing strong correlations in both polarities. The anticorrelation was <1% (pink crosses).
|
Discriminant Analysis between Healthy and Melanoma T Cells.
Gene expression data were extracted from each hybridized array using the Agilent Feature Extraction software with Lowess background correction. After the variance stabilizing normalization transformation, an initial analysis of these data showed no statistically significant differences in gene expression between T cells from healthy and melanoma patients. Additional analysis suggested that this may have occurred due to large gene expression differences among naïve, effector, and memory T cells, which could mask potentially more subtle differences between melanoma and healthy T cells. To maximize our ability to detect subtle differences in gene expression, we thus removed the cell type effects and pooled the gene expression data from all three of the T-cell subsets from each subject using a simple linear model on the transformed data. Next, the Westfall and Young multiple testing procedure was applied to detect any differences between T cells from melanoma patients and healthy donors. A subset of 50 genes with the smallest Ps was selected by taking the 50 features with smallest adjusted Ps. These were used as the input into a discriminant analyses (a simple, efficient supervised learning procedure; see Ref. 12
), with cross-validation that chose the best subset for discriminating the arrays into healthy and melanoma groups with low classification error.
Eleven features (10 genes) were found to consistently discriminate between T cells from melanoma patients versus healthy controls with adjusted Ps
0.05 (Table 1)
.
|
2-fold for the galectin-1 gene.
Hierarchical Clusterings.
A hierarchical clustering of the data was performed using the mva package in R. The results are displayed here for the 11 retained features.
Fig. 3A
shows that this smaller subset of genes classifies healthy donors and melanoma patients into two groups. To find the most discriminatory genes, we used a cross-validation method (13)
. This method consists in removing one observation (melanoma patient or healthy patient), calculating a new discriminating rule on the 29 remaining patients, and then using the new rule to attribute the deleted observation to a class (melanoma or healthy) and recording whether the observation was actually well classified by the rule. This is repeated 30 times, once for each observation, and the prediction error of the discriminating rule is assigned the percentage of improperly classified observations. We took all subsets of genes of size <20 from the 50 genes prefiltered to have the smallest multiple testing Ps and chose the subset with the best cross-validated prediction score. This turned out 11 features (10 genes). This particular subset gave a cross-validation score of 100% of the observations well classified, encouraging us to think that these 10 genes were indeed good predictors of T cells from melanoma versus healthy subjects. The expression for each gene in this subset was significantly different between healthy donors and melanoma patients (P < 0.05). These data confirm that CD8+ T cells in melanoma patients have subtle differences in gene expression from CD8+ T cells from healthy donors.
|
Gene Confirmation with Real-Time Reverse Transcription-PCR.
To confirm the significance of the genes identified to be differentially expressed using microarrays, we performed RQ-PCR analysis using the original, unamplified RNA materials. CD8+ T-cell subsets were not combined in these experiments. Due to limited quantities of unamplified RNA, 6 of 10 genes were selected for RQ-PCR analysis: DDB2, PSPC1, FLJ10955, Galectin-1, N-myristoyltransferase, and Hypothetical Protein 669. A housekeeping gene, GAPDH, was also amplified for normalization of data. The threshold cycle for each sample is calculated by iCycler Real Time Detection System. After normalization with GAPDH, the threshold cycle of the 6 genes was analyzed using a Wilcoxon test. As shown in Table 2
, significant differences (P < 0.05) were found in at least one CD8+ T-cell subset between melanoma patients and healthy donors confirming results from the microarray data. The only exception was expression of gene FLJ22059, which did not show significant differences between healthy and melanoma in any of the three subsets.
|
| DISCUSSION |
|---|
|
|
|---|
Most systematic studies use serial dilution of standard RNA or serial dilution of total RNA extracted from large numbers of cells, so the RNA quality is easy to control. To keep the integrity of RNA, we optimized the sorting condition to prevent RNA degradation in our small, individually sorted T-cell samples. From 100K sorted CD8+ T cells, 2060 ng total RNA were isolated. Of the extracted RNA, 10% was measured using Bioanalyzer, ensuring the quality of the RNA used for amplification. After one round of amplification, the yield of antisense RNA from the 100K CD8+ T cells was
1 µg available for the labeling and hybridization steps.
Another important aspect of amplification is the degree of reproducibility. We showed that the correlation coefficient between individual hybridization for aRNA amplified from the same total RNA is high using the Agilent amplification protocol. As for the fidelity of T7-based linear amplification, there are several publications showing that the correlation coefficient between amplified and unamplified samples range from 0.83 to 0.86 (22) . We found similar results in our experiments (data not shown). Bias as compared with unamplified materials is unavoidably induced when samples are amplified. Importantly, bias introduced into gene expression by amplification is reproducible and systematic (17 , 23) , hence allowing for reliable comparisons of amplified samples against each other. Gene expression profiling using aRNA provides an approximation of the true expression profile of the original sample. Although duplication is always desirable, for clinical samples it is often difficult to obtain sufficient aRNA to hybridize onto two arrays. Data from our dye swap and self-to-self experiments show high correlation coefficients.
Most microarray experiments performed thus far have been on heterogeneous cell populations, such as tumor biopsies (Ref. 6 ; which may contain tumor cells, stromal cells, immune cells, and other contaminants), and PBMC (Refs. 24 , 14 ; which contain CD4+ T cells, CD8+ T cells, B cells, natural killer cells, monocytes, and some granulocytes). With such heterogeneity in cells, it is impossible to determine from which cells certain gene expression differences arise. In this study, to maximize the resolution of our data, we focused on CD8+ T cells only, and sorted them into naïve, effector, and memory subsets and hybridized these individually on microarrays. However, because our sample size was small (5 healthy donors and 5 melanoma patients), we removed the cell subset effect in our initial microarray data analysis, that is, we normalized the median of expression of all three of the subsets. This eliminated the subset effect and allowed us to pool the three subsets to look for other more subtle differences than those depending on cell type. Importantly, we confirmed our microarray data using quantitative (real-time) PCR on unamplified RNA extracted from individual T-cell subsets.
On the basis of the analytical model in this study, 11 features (10 genes) provide a robust classifier between CD8+ from melanoma patients and CD8+ from healthy donors. In other words, the combination of these 10 genes is more reflective of molecular behavior of the CD8+ in melanoma patients. Because T cells in cancer patients are known to undergo higher rates of apoptosis, we specifically looked for differential expression of genes classical linked to apoptosis. We generated a list of 86 genes commonly associated with apoptosis, including caspases 19, bcl-2, Fas (CD95), and many others. Importantly, the same discriminant analysis applied to this sublist of apoptosis genes did not yield a set that could differentiate between the melanoma and healthy patients.
Possible mechanism(s) underlying T-cell abnormalities in cancer patients include factors secreted by tumor cells (e.g., tumor necrosis factor
), cell-cell contact (e.g., Fas-Fas ligand), and repeated stimulation leading to activation-induced cell death. The classical apoptosis pathway involves a family of cysteine proteases, known as caspases, which cleave many vital cellular proteins and proteolytically activate enzymes involved in cell death (25)
. Several genes we found to be differentially expressed in T cells of melanoma versus healthy subjects may be linked directly or indirectly to apoptosis. Galectin-1 is a galactoside binding protein that has been linked to apoptosis in T cells (26, 27, 28)
. Damage-specific DNA binding protein 2 mediates damage repair to DNA; recent evidence suggests that DNA binding protein 2 interacts with p53 in regulating apoptosis (29)
. The addition of the carbohydrate moiety N-myristoyl to certain apoptosis regulators, such as BID (30)
, could modulate their activity. Hence, the enzyme N-myristoyltransferase 2 may play an indirect role in apoptosis. Ezrin (villin) is a key protein in membrane-cytoskeleton interaction, and is involved in membrane polarization and shape modulation. Ezrin has been shown to be involved in CD95-mediated apoptosis by modulating CD95 linkage to the actin cytoskeleton (31)
. Whereas the function of PSPC1 is unknown, it belongs to the same family of proteins as p54/nrb, which has been linked to CD95-induced apoptosis in T cells (32)
.
Importantly, we could not find evidence for the differential expression of classical apoptosis mediators such as caspases, Fas/Fas ligand, bcl-2, bcl-xL, BH3, Bim, Bax, Bak, FLIP, FADD, and Apaf-1, by T cells from melanoma versus healthy subjects (as shown in Fig. 3
). This raises the possibility that a secondary (alternate) apoptosis pathway may be triggered in circulating T cells of cancer patients. Of note, as many caspases are already present in cells as catalytically dormant proenzymes (zymogens). which are activated in response to stimuli that trigger apoptosis, perhaps it is not unexpected that transcription of some of these genes is not altered in cells undergoing apoptosis. Nonetheless, tumor necrosis factor
(which is known to be secreted by many cancers) has been reported to trigger a secondary (noncaspase mediated) apoptosis pathway. Tumor necrosis factor
seems to trigger cathepsin B and L (not caspase), which activates apoptosis through the mitochondria.
Microarrays may emerge to be a powerful new tool to elucidate the mechanism of cancer-induced immune dysfunction. Our approach aimed to minimize spurious sources of variation such as heterogeneity in cell populations and dye effects. By using a variance stabilization transformation, we were able to detect subtle differences in the expression of a small set of genes that differentiate CD8+ T cells from healthy and melanoma patients at an
2-fold level. Importantly, we were able to confirm these microarray results by RQ-PCR experiments. Taken together, our data support previous observations of T-cell apoptosis in cancer patients and raise the possibility of an alternative apoptosis pathway being triggered in T cells by cancer. Our data also suggest that different T-cell types may be impacted differently by cancer, and thereby illustrates the benefit of gene expression analysis of purified cell populations.
| FOOTNOTES |
|---|
Requests for reprints: Peter P. Lee, Stanford University, Center for Clincal Sciences Research, Room 1155, 269 Campus Drive, Stanford, CA 94305. Phone: (650) 498-7942; E-mail: ppl{at}stanford.edu
5 Internet address: http://www-genome.wi.mit.edu/cgi-bin/primer/primer3_www.cgi. ![]()
Received 10/29/03. Revised 2/11/04. Accepted 3/ 5/04.
| REFERENCES |
|---|
|
|
|---|
T cell subsets defines distinct immunoregulatory phenotypes and unexpected gene expression profiles. J Immunol, 170: 356-64, 2003.This article has been cited by other articles:
![]() |
C. A. Learn, P. E. Fecci, R. J. Schmittling, W. Xie, I. Karikari, D. A. Mitchell, G. E. Archer, Z. Wei, H. Dressman, and J. H. Sampson Profiling of CD4+, CD8+, and CD4+CD25+CD45RO+FoxP3+ T Cells in Patients with Malignant Glioma Reveals Differential Expression of the Immunologic Transcriptome Compared with T Cells from Healthy Volunteers Clin. Cancer Res., December 15, 2006; 12(24): 7306 - 7315. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Holmes, M. He, T. Xu, and P. P. Lee Memory T cells have gene expression patterns intermediate between naive and effector PNAS, April 12, 2005; 102(15): 5519 - 5523. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |