| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Clinical Research |
1 Radiation Oncology Branch, National Cancer Institute; 2 National Cancer Institute-Food and Drug Administration Clinical Proteomics Program, Laboratory of Pathology; 3 Laboratory of Tumor Immunology and Biology, Center for Cancer Research, National Cancer Institute, NIH-Department of Health and Human Services; 4 National Cancer Institute-Food and Drug Administration Clinical Proteomics Program, Office of Cellular and Gene Therapy, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, Maryland; 5 Radiation Medicine Program, Princess Margaret Hospital, University of Toronto, Toronto, Ontario, Canada; 6 Department of Radiation Oncology, McGill University, Montreal, Quebec, Canada; and 7 Science Applications International Corporation-Frederick, Inc., Clinical Proteomics Reference Laboratory, National Cancer Institute Frederick, Gaithersburg, Maryland
Requests for reprints: Kevin Camphausen, Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH-Department of Health and Human Services, Room B3B69, Building 10, 9000 Rockville Pike, Bethesda, MD 20892. Phone: 301-496-5457; Fax: 301-480-5439; E-mail: camphauk{at}mail.nih.gov.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The long-established gold standard biomarker of radiation exposure, cytogenetic analysis of peripheral blood lymphocytes, requires complex analysis by skilled workers (5, 11, 12). A number of alternative minimally invasive strategies have been explored, including the measurement of specific cytokines and metabolites in tissues and body fluids (4), micronucleus assays (13), and germ line radiationinduced apoptosis rates (2). More recently, a need for comprehensive profiling in the field of biomarker discovery has been recognized (10, 14). Clinical studies to date have focused on genomic profiling of circulating peripheral blood lymphocytes using microarray technology (1517).
Although the ability to distinguish cancers from the profile of circulating protein constituents of serum was first shown in 1978 (18), only recent technological advances have led to robust techniques that permit high-throughput and comprehensive analysis of serum proteins (19). These techniques, including surface-enhanced laser desorption and ionization time of flight (SELDI-TOF) mass spectrometry (MS; ref. 20) and complementary mass spectroscopic fractionation (21), enable the study of global expression changes following pathologic events and physiologic stresses. As such, they hold promise as a comprehensive technique for biomarker discovery. Recently, it has been shown that most of the low-molecular-weight molecules that underpin the SELDI-TOF MS profiles exist bound to larger highly abundant proteins, such as albumin (22).
Thus, we sought to explore the merit of proteomic profiling strategies in patients with cancer before and during radiotherapy in an effort to discover clinical biomarkers of radiation exposure. We also sought to further characterize the information content of the spectral components using an albumin enrichment strategy of the same serum samples used in the high-throughput discovery series. There are three hypotheses tested in this study: First, proteomic profiling methods can discover a discriminating set of low-molecular-weight biomarkers that, in turn, can be used as a pattern-based diagnostic of human exposure to ionizing radiation. Second, a serum proteomic pattern diagnostic can discriminate a low dose-volume radiation exposure from a high dose-volume exposure. Last, MS identification of the same information archive that underpins the observed MS peaks will reveal important clues to the identity and nature of the discriminatory entities.
| Materials and Methods |
|---|
|
|
|---|
Specimen processingSELDI-TOF. Aliquots of fresh frozen sera were sent to the National Cancer Institute (NCI)-Food and Drug Administration Clinical Proteomics laboratory. The pretreatment and posttreatment samples all had been collected and handled in an exact identical manner, not previously thawed to minimize any potential for collection and sample handling bias. Samples were thawed and 10 µL aliquots were obtained and used immediately for high-resolution SELDI-TOF analysis. Ciphergen (Fremont, CA) immobilized metal ion-affinity chromatography 3 protein arrays were used for this study. All steps were carried out on a Tecan Genesis 2000 robotic processor (Tecan, Research Triangle Park, NC) equipped with a 96-position Temo unit to reduce any potential for bias due to variation in sample preparation. The Ciphergen Bioprocessor unit was used to hold the protein arrays in place for all steps in the process. The patient-matched preexposure and postexposure samples were randomized and comingled on the same chip surface to reduce any bias that may occur due to run order and daily spectrometer fluctuations. The surfaces were activated with two applications of 100 µL per spot of 50 mmol/L nickel sulfate (Sigma, St. Louis, MO) incubated for 5 minutes per application. After aspiration, the surfaces were washed twice with 100 µL of deionized water. Dulbecco's PBS without calcium and magnesium chloride (Invitrogen, Grand Island, NY) with 0.1% Triton X-100 (Sigma) was added to each spot with a volume of 100 µL per spot and incubated for 5 minutes, aspirated, and discarded. This was repeated a second time. The chip surfaces were then dried using vacuum aspiration to remove any excess liquid. A total of 5 µL of undiluted serum was added to each spot and incubated for 20 minutes. The wash steps were carried out on the Tecan processor as follows: 100 µL of PBS was added and mixed by aspiration and dispensing. This was done for three cycles. Sterile deionized water was added and mixed by aspiration and dispensing. Remaining liquid was removed by vacuum aspiration and the chips were allowed to dry completely before the bioprocessor gasket was removed. Once removed, 1 µL matrix consisting of 12 mg/mL cinnamic acid (Fluka, St. Louis, MO) in 50% acetonitrile (Sigma) and 0.5% trifluoracetic acid (Sigma) was applied by the Tecan processor using a fixed pipette tip and a serpentine path to each spot on the arrays. The arrays were then dried at room temperature for 5 minutes and the matrix was reapplied. After the chips were completely dried, they were read on an ABI Q-Star mass spectrometer equipped with a Ciphergen PCI 1000 Protein Chip Interface as previously described (23).
Data analysisSELDI-TOF. High-resolution MS data were derived by the unexposed and exposed blood draws from 68 cancer patients for an initial total of 136 spectra. Regions of spectra known to be dominated by matrix effect (<1,000 Da) or terminal measurement effects (>11,000 Da) were excluded a priori. Quality assurance/quality control (QA/QC) analyses resulted in the elimination of one spectrum from the unexposed group due to low average amplitude and total ion counts. The QA/QC tools and procedures have been previously described (23). Quartile values were computed for each spectra. Initial aggregate pattern analyses using visual data mining techniques showed interesting groupings involving the most abundant ions (third quartile or greater). Spectral data below this threshold were excluded. High-resolution mass spectral data were then binned as a function of the nominal mass values, while generating a variety of statistical measures for the captured elements (mass and amplitude values), including, means, SDs, variances, and sums.
An amplitude-based concept hierarchy was constructed for each spectra based on percentage contributions. Normalization was achieved through transforming the summed amplitude values through the concept hierarchy mapping scheme (24). Aggregate pattern analyses were done by three-dimensional visual data mining techniques to explore for regions of interest. An example of this is illustrated in Fig. 1A. Given the heterogeneity present in the data due to large variety of diseases/cancers (18), as well as a wide range of both anatomic sites (9) and degree of radiation received, the visual data mining activities aided greatly in the identification of interesting ion patterns for additional pursuit (25). Custom T-SQL stored procedures were written to further aid in feature selection through the implementation of a minimax algorithmic procedure to maximize the differences between candidate ion groups while minimizing their coefficient of variation.
|
These computational techniques were also used to construct supervised predictive classifiers to discriminate high (n = 32) versus low (n = 36) dose-volume radiation exposure from the exposed group. The method for dividing exposure dose-volumes into high and low was done by a statistical stratification method using quartile analysis of dose-volume measurements coupled to aggregation of both cancer type and site of radiation. This revealed a natural cut in the dose-volume measurements at 190 Gy/m2; therefore, measurements >190 Gy/m2 were recorded as a high exposure and the remainder as low. In the low dose-volume classification, the mean dose-volume was 126 Gy/m2 (range 44-190); in the high dose-volume classification, the mean dose-volume was 350 Gy/m2 (range 200-1,050). Attention to disease/cancer type and anatomic site of radiation was taken to avoid bias. For instance, there are 18 prostate cancer patients in the low dose-volume group, and 15 in the high dose-volume group. In this study, the units for dose-volume measurements are Gy x fraction body volume exposed to >50% radiation dose. This definition was chosen a priori and assumes that both radiation dose and irradiated volume contribute equivalently to changes in the serum proteome.
Deliberate steps were taken to produce the most general predictive models and to avoid overtraining. To this end, strategies used included the selection of overtraining prevention options, the use of global pruning with a pruning severity of 75%, requiring minimum records per child branch to be greater than or equal to a threshold value (i.e., 4), and winnowing of attributes for a more parsimonious solution. To enhance predictive accuracy of the decision tree models, boosting was used.
Visual data mining techniques were also used in an iterative fashion as candidate ions were meticulously winnowed. In many cases, candidate ion sets, whether they were considered for promotion or elimination, were further analyzed by high-resolution visual data mining methods. These techniques use a high-end PC graphics card (NVIDIA Quardro FX 3000, NVIDIA Corp., Sunnyvale, CA) coupled to 9-megapixel high-resolution IBM T221 computer graphics monitor (IBM, Armonk, NY). Thus, high-resolution mining of extremely complex data sets may be addressed. Figure 1B illustrates aspects of a three-dimensional visual data mining exercise involving an intermediate set of candidate ions.
Both a high-end computer workstation and server were used to accomplish the aforementioned computational tasks. These are located at the NCI, Center for Cancer Research, Clinical Proteomics Reference Laboratory (CPRL, Gaithersburg, MD). The server consists of dual processor AMD Opertron CPU module (AMD Corporation, Sunnyvale, CA) housed on a Newisys motherboard (Newisys Corp, Austin, TX) fitted with a total of 16 GB of main memory. Server disc capacity consists of an Adaptec RAID housing assembly containing twelve 146 GB IBM disc drives in a RAID 5 configuration, allowing one disc for parity checking and one disc for a hot spare. Thus, the effective storage capacity for this server at the CPRL is
1.3 TB. Following CPRL specification, the server system was custom built by Colfax International (Sunnyvale, CA).
The server computer operating system is the Windows 2003 Advanced Server (Microsoft Corp, Redmond, WA). Database activities are done through the use of the Microsoft SQL Server 2000 Enterprise Edition. Computer workstations at the CPRL consist of one to two Xeon processors (Intel Corporation, Santa Clara, CA), with 2 to 4 GB of memory, running the Windows XP Professional operating system. Visual data mining activities are done on workstations outfitted with high-end graphics cards and displays. Data intensive computational tasks are off-loaded to the server.
MS-MS protein identification of pooled sera. From the original study set, a subset of 25 patients was randomly selected and their serum was pooled for MS-based sequencing and peptide identification from samples obtained before and during radiation therapy.
Albumin and associated low-molecular-weight bound peptide purification was done in the following fashion and has been further described previously (22). Forty microliters of preradiation or postradiation treatment pooled serum (
5 mg protein) were diluted to 200 µL with an equilibration buffer (Millipore) and run twice through a Montage (Millipore, Billerica, MA) albumin-specific affinity column. The bound protein was washed thoroughly and eluted from the column by equilibrating with 70% ACN/30% H2O/0.2% trifluoroacetic acid for 30 minutes, followed by a slow spin-through of the elution mixture. The eluate was lyophilized to <10 µL in a HetoVac rotofor (CT 110) and reconstituted in a 95% H2O/5% ACN/0.1% formic acid buffer (buffer A). Samples were sometimes desalted with a ZipTip cleanup or with Vivaspin 500 centrifugal membranes and always reconstituted in a 1:1 mixture of water and SDS sample buffer (20 µL total volume).
Analysis then proceeded with one-dimensional gel separation and digestion. Twenty microliters of sample in SDS sample buffer (40 µL of original serum) were boiled for 5 minutes at 95°C and run on a one-dimensional precast gel (4-20% Tris-glycine or 4-12% bis-Tris) to separate albumin from the low abundant proteins/peptides/fragments of interest. The gel was stained with Coomassie blue for 1 hour and destained overnight in 30% methanol/10% acetic acid solution. The entire lane containing stage-specific serum proteins was excised from the gel and finely sliced into very small molecular weight regions (
40 slices/lane). Gel bands were reduced and alkylated with 10 mmol/L DTT and 55 mmol/L iodoacetamide, incubated at 4°C for 1 hour in porcine trypsin (20 ng/µL; Promega) and allowed to digest overnight at 37°C in 25 mmol/L NH4HCO3. The following morning, peptides were extracted from the gel with a 70% ACN/5% formic acid solution.
Samples were lyophilized to near dryness and reconstituted in 6.5 µL of buffer A for mass spectrometric analysis. Microcapillary reverse phase liquid chromatography (LC)/MS/MS analysis was done with Dionex's LC Packings LC system (Dionex, Sunnyvale, CA) coupled online to a ThermoFinnigan LCQ Classic ion trap mass spectrometer (ThermoFinnigan, San Jose, CA) with a nanospray source. Reverse phase separations were done with an in-house, slurry-packed capillary column. The C18 silica-bonded column is 75 µm ID, 360 µm OD, 10-cm-long fused silica packed with 5 µm beads with 300 Å pores (Vydac, Hesperia, CA). A µ-precolumn PepMap C18 cartridge (Dionex) acts as a desalting column. Sample is injected in microliter pick-up mode and washed with buffer A for 5 minutes before a linear gradient elution with buffer B (95% ACN/5% H2O/0.1% formic acid) up to 85% over 95 minutes at a flow rate of 200 nL/min. Full MS scans are followed by three MS/MS scans of the most abundant peptide ions (in a data-dependent mode) and collision-induced dissociation is done at a collision energy of 38%.
Data analysis was done by searching MS/MS spectra against the European Bioinformatics Institute of the nonredundant proteome set of Swiss-Prot, TrEMBL, and Ensembl entries through the Sequest Bioworks Browser (ThermoFinnigan). Peptides were considered legitimate hits after filtering the correlation scores and manual inspection of the MS/MS data (2630).
Accepted peptide hits are required to have an Xcorr ranking = 1 relative to all other peptides in the database. The albumin extraction, gel electrophoresis, protein digestion/extraction, and LC/MS/MS analysis was repeated in six distinct trials (three for preradiation treatment and three for postradiation treatment)each time yielding diminishing returns of new identifications for low abundance peptide hits. Repetitive sequencing of identical peptides in multiple trials further validates our experimental procedureboth within and between radiation treatment stages.
| Results |
|---|
|
|
|---|
|
|
|
MS-MS protein identification of pooled sera. A total of 182 protein fragments were identified in this study, 82 of which were uniquely identified in pooled serum before patients were exposed to ionizing radiation, and 23 of which were uniquely identified in pooled serum while patients were being exposed to ionizing radiation (Table 4ions listed in alphabetical order). The remaining 77 protein fragments were identified both before and during radiation exposure (data not shown).
|
| Discussion |
|---|
|
|
|---|
In this study, we opted to analyze serum samples in an aggregate manner from a population of patients with known exposure to a wide range of therapeutic doses of ionization radiation, over varying time intervals, and at different body sites and volumes of irradiation. This study design was chosen with the overarching goal of identifying serum protein markers of radiation exposure that are independent of volume and body site of exposure at dose levels that may have relevance to moderate dose whole body exposures.
Challenges unique to this particular study include a large degree of inherent heterogeneity due to two factors. First, the 68 patients exhibit 18 different medical diagnoses, which are predominately different forms of cancer. Second, there are nine different anatomic sites of radiation exposure/delivery. Nonetheless, deliberate computational strategies were used to attempt to model and understand the innate heterogeneity, while also maximizing robustness of classification methods. This was accomplished mainly through visual data mining activities that allowed for aggregate pattern analysis and discovery of key ion groupings, including an initial observation indicating a discriminatory ability in the most abundant ions (likely influenced by injury response and immune mediated events), as well as the use of n-fold cross-validation strategies for the testing of classifier error statistics. Novel visualization techniques facilitated aggregate spectra pattern analyses as well as reduced data set strategies, thus enabling study of cohort heterogeneity.
As a secondary objective, we provide preliminary evidence of a dose-response relationship to the proteomic changes induced with radiation exposure. This early data must be interpreted with caution given the small sample size and lack of a patient control study design for this end point. Future studies in larger groups of patients are warranted to determine the linearity of dose-volume effects and the time course of serum proteome changes after exposure.
To further characterize the nature and identify of the SELDI-TOF profiles, we used a new method of analyzing the albumin-bound peptide archive (22). The molecules that comprise the SELDI portraits have been recently shown to emanate from carrier protein binding, which serves to amplify and enrich low-molecular-weight peptides that would otherwise be efficiently cleared by glomerular filtration (22). Two pooled sera sets, representing a randomly obtained subset from patients before and during exposure, were used for this study. The lower abundance proteins uniquely identified in pooled serum may be secreted or shed by cells as a result of signaling, necrosis, and/or apoptosis of cells in the high-dose region. In fact, one of the entities identified by this approach and that may be a component of the SELDI-TOF spectra is a fragment of interleukin (IL)-6 that was uniquely measured only in the postexposure pools. This finding is consistent with known cellular effects of ionizing radiation at the transcriptional level. A major transcription factor known to be activated by ionizing radiation is nuclear transcription factor-
B (NF-
B; refs. 33, 34). Genes induced by NF-
B following irradiation include intercellular adhesion molecule-1, galectin-3, tumor necrosis factor-
(TNF-
), IL-1ß, and IL-6. In a study by Fedorocko et al. (35), mice exposed to a single 9 Gy dose of radiation were found to have elevated serum levels of IL-6 and TNF-
peaking at 5 days following exposure. This is further supported by a clinical study of patients receiving radiotherapy to their liver, where IL-6 was found to be variably elevated in serum at 24 and 48 hours postexposure (7).
The SELDI-TOF method described in this study has been optimized and adapted to a robot for the purpose of reproducibility and enabling of a high-throughput test. This gives precision of pipetting, reproducibility of pipette dispensing position, and ensures accurate timing. In addition, the use of barcodes on both samples and Ciphergen arrays allows for traceability of sample location. The time for processing a serum sample is
2.5 hours. Finally, once a classifier is formulated and validated, the computational or wall time required to classify an unknown spectrum is a few milliseconds.
The low-molecular-weight serum proteome is a newly discovered information archive reflective of metabolism and pathologic events. In the recent past, it has been successfully exploited for early diagnosis of ovarian cancer as well as diagnosis of other cancers. This study adapted these established principles for the successful discrimination of both radiation exposure as well as the degree of radiation exposure. Applications of such a high-throughput blood test may include personalized proteomic measures of clinical response to radiation therapy and rapid response measures for populations exposed to moderate doses of radiation. Further refinement of the methods reported in this study may lead to personalized molecular medicine approaches directly applicable to adjuvant cancer care (36). Instead of categorical radiation dosing techniques, a strategy using a serum proteomic test, with the aim to better quantify the biological effect of serial exposures, may better avoid toxicity while maximizing therapeutic efficacy.
| Conclusion |
|---|
|
|
|---|
| Acknowledgments |
|---|
Received 9/27/05. Revised 11/10/05. Accepted 11/29/05.
| References |
|---|
|
|
|---|
irradiation. Mol Cancer Res 2003;1:44552.
B. J Clin Invest 1991;88:6915.[Medline]This article has been cited by other articles:
![]() |
Y. Miura, M. Kano, M. Yamada, T. Nishine, S. Urano, S. Suzuki, T. Endo, and T. Toda Proteomic Study on X-irradiation-responsive Proteins and Ageing: Search for Responsible Proteins for Radiation Adaptive Response J. Biochem., August 1, 2007; 142(2): 145 - 155. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |