| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Cell, Tumor, and Stem Cell Biology |
1 Department of Obstetrics and Gynecology, 2 Microarray Facility, Department of Medical Genetics, University of Tuebingen, Tuebingen, Germany; 3 Department of Surgery, Indiana University School of Medicine, Indianapolis, Indiana; and 4 Center of Bioinformatics, and 5 Department of Pathology, University of Tuebingen, Tuebingen, Germany
Requests for reprints: Hans J. Neubauer, Tumor Progression Lab, Gynecological Hospital, Calwerstrasse 7/6, 72076 Tuebingen, Germany. Phone: 49-7071-298-5380; Fax: 49-7071-29-4947; E-mail: hans.neubauer{at}med.uni-tuebingen.de.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The goal of this study was to identify genes that provide an understanding of the molecular biology of DCIS and to obtain genes marking the transition of stationary epithelial cells to migrating invasive cells. In order to reduce the false-positive rate caused by genetic variability, we used patient-matched DCIS/IDC tumor samples.
| Materials and Methods |
|---|
|
|
|---|
Laser capture microdissection. Serial frozen tissue sections (10 µm) were fixed and rehydrated sequentially in decreasing concentrations of ethanol and RNase-free water followed by H&E staining. Finally, sections were dehydrated in 95% and 100% ethanol and 8 minutes in xylene. The PixCell IIe LCM System (Arcturus Engineering, Mountain View, CA) was used according to the manufacturer's instructions. For microarray experiments,
20,000 cells (7,000 laser pulses, 30 µm diameter laser) were dissected. LCM caps containing the isolated tumor cells were stored at 80°C in 40 µL RLT buffer (Qiagen, Hilden, Germany) + 1% ß-mercaptoethanol until RNA extraction.
RNA extraction and linear amplification. Lysates of dissected tissue from several LCM caps were pooled into a final volume of 350 µL RLT buffer (Qiagen) + 1% ß-mercaptoethanol. Total RNA extraction was carried out using RNeasy Micro Kit (Qiagen) according to the manufacturer's instructions, combined with the protocol for LCM-derived small samples (15). Two-round linear amplification was carried out according to the GeneChip Eukaryotic Small Sample Target Labeling Assay version II protocol (Affymetrix, High Wycombe, United Kingdom), including 0.25 µL T4gp32-protein in every reverse transcriptase reaction (16). The quality of total RNA of tumor sections, of microdissected tissue and of antisense RNA (aRNA) was monitored by Agilent 2100 Bioanalyzer using the RNA 6000 Nano LabChip Kit (Agilent Technologies, Boeblingen, Germany) following the manufacturer's instructions.
Oligonucleotide microarrays and acquisition of data. After the second round of amplification, 5 µg of biotinylated fragmented aRNA was hybridized to Test3 arrays (Affymetrix). Affymetrix high-density oligonucleotide microarrays (GeneChip HG U133A and GeneChip HG U133 plus 2.0; both Affymetrix) were used for gene expression analysis. Hybridization experiments and evaluation were done by the Microarray Facility Tuebingen. The arrays were scanned at 3 µm resolution using a GeneChip System confocal scanner (Agilent Technologies). Scanned images were subjected to visual inspection and analyzed using the Affymetrix's Microarray Suite version 5.0 (MAS 5.0) algorithms (Affymetrix) to generate report files for quality control. MAS 5.0 was used to compute detection P values. Transcripts with P < 0.04 were classified as present, whereas they were classified as absent at P > 0.04. RMAExpress (17) was applied for normalization of raw data (CEL files) independently of the two different microarrays. The data discussed in this publication have been deposited in National Center for Biotechnology Information's Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) and are accessible through Gene Expression Omnibus Series accession number GSE3893.
Hierarchical cluster analysis. RMA-normalized expression values of the arrays of the HG U133A and the HG U133 plus 2.0 series were z scorenormalized independently. A z score is computed by subtracting the probe's profile mean value from its expression value and subsequently dividing it by its SD. These normalized values were then combined into one data set, considering only genes represented on both Affymetrix arrays. Hierarchical clustering was done using the neighbor-joining method (18) applied to the Pearson correlation distance matrix. In addition, other hierarchical clustering methods, such as UPGMA, and other distance-generating functions, such as Euclidean, were used. All analyses were done within the Mayday Software Package.8
Statistics. Multiple statistical methods were conducted to obtain differentially expressed genes between DCIS and IDC. The HG U133A and HG U133 Plus 2.0 probe sets were combined, and only those probes for which at least one in all HG U133A samples was characterized as present were considered (as determined by the MAS 5.0 software using P < 0.04). Thus, a total of 13,431 probes were assembled into a common expression matrix. Each test was applied to the RMA-normalized expression values of these probes. (a) Rank product (19) was conducted by using R library package RankProd distributed by the Bioconductor project (http://www.bioconductor.org). The rank product is a new nonparametric type of statistical method suggested by Breitling et al. It detects genes that are consistently found among the most strongly up-regulated or down-regulated genes in a number of replicate experiments. First for each gene or transcript, the log ratio of the matched samples expression values were determined, and genes were then ranked with respect to decreasing log ratio. Then, for each gene, the geometric mean of its ranks was determined. From these means, two sorted lists for putative up-regulated and down-regulated genes and their rank product values were generated. To evaluate the significance of a rank product value, the percentage of false-positive predictions also known as the false discovery rate is calculated using a permutation-based procedure (100,000 permutations were conducted). Genes with a false-positive prediction < 0.01 were regarded as significantly differentially expressed. (b) Paired t test was used to detect genes that are up-regulated and down-regulated in the IDC samples in comparison with the DCIS samples (P < 0.01). (c) Paired Mann-Whitney test was used to detect genes that are up-regulated and down-regulated in the IDC samples in comparison to the DCIS samples (P < 0.01). The respective lists of all methods (a-c) were combined, and the intersection was taken resulting into a "master gene list." This data set was analyzed through the use of Ingenuity Pathways Analysis (Ingenuity Systems, http://www.ingenuity.com). Functional Analysis identified the biological functions and/or diseases that were most significant to the data set. Genes from the data set that met the negative logarithmic significance cutoff of 5 or higher, and were associated with biological functions and/or diseases in the Ingenuity Pathways knowledge base were considered for the analysis. Fischer's exact test was used to calculate a P value determining the probability that each biological function and/or disease assigned to that data set is due to chance alone.
RT-PCR. A 1 µg aliquot of aRNA after the first round of amplification was reverse-transcribed for single-stranded cDNA using random primers and Superscript II kit (Invitrogen, Karlsruhe, Germany). No separate microdissection of tissue was carried out for the RT-PCR experiments to enable direct validation of the microarray experiments, avoiding discrepancies due to technical variations. Classical PCR was done using Taq DNA polymerase kit (Invitrogen). Real-time PCR was done with the LightCycler System (Roche, Mannheim, Germany) and the Platinum SYBR Green qPCR Supermix UDG (Invitrogen) according to the manufacturer's instructions (primer sequences and specific annealing temperature; Supplementary data 1). For relative quantification of gene expression, triplicate reactions were set up. Expression of pyruvate dehydrogenase ß-subunit (PDH) was used to determine the relative regulation of candidates employing the efficiency-corrected equation (20). The efficiency of every single PCR reaction was determined with LinRegPCR software (version 7.5, February 2004; ref. 21).
In situ hybridization. PCR products (primer sequences and specific annealing temperature; Supplementary data 1) were cloned into TOPO II vector (Invitrogen). Hybridization experiments on 4-µm paraffin sections were carried out as described previously using digoxigenin-labeled RNA probes of
300 bp in length targeting the 3'-end of the transcripts (RNA labeling kit, Roche; refs. 3, 22). The following improvements were accomplished: during the acetylation step, following proteinase K digestion, slides were rinsed in 0.5% acetic anhydride/0.1 mol/L triethanolamine for 10 minutes. Furthermore, an RNase One digest (10 U/mL; Promega, Mannheim, Germany) was carried out for 30 minutes during the washing steps after hybridization. The riboprobe (1 ng/µL) was hybridized. Hybridizations were considered successful if the sense probe gave no significant signal.
Immunohistochemistry. Cryosections (10 µm) were fixed for 10 minutes in acetone, followed by incubation with the primary antibody for 30 to 60 minutes. Further steps were carried out using the respective Vectastain kit (Vector Laboratories, Burlingame, CA) according to the manufacturer's instructions. HistoGreen (Linaris, Wertheim-Bettingen, Germany) was used as the chromogen. Slides were counterstained with hematoxylin and mounted in pertex (Medite, Burgdorf, Germany). The following primary antibodies were used: polyclonal rabbit anti-human DACT1 antibody (Orbigen, San Diego, CA), polyclonal goat anti-human GREM1 antibody and polyclonal goat anti-human MEF2C (both from Santa Cruz Biotechnology, Heidelberg, Germany). Polyclonal rabbit anti-human BPAG1 antibody was a kind gift from Dr. J.R. Stanley (Department of Dermatology, University of Pennsylvania, Philadelphia, PA).
| Results |
|---|
|
|
|---|
Tissue selection, LCM and RNA extraction. Twenty breast tumors, which were diagnosed to carry both DCIS and IDC, of histological grades 2 and 3, were selected from the tissue bank of the Department of Gynecology and Obstetrics, Tuebingen, Germany. None of the patients included in this study had received preoperative systemic treatment. Finally, nine specimens with significant DCIS and IDC components were selected for further experiments (Table 1 ), based on repeated pathologist's evaluations of high-quality H&E-stained sections, and the integrity of its total RNA. All specimens passed through the RNA quality controls successfully, as is exemplified in Supplementary data 2. Twenty-four biological specimens were isolated from the nine tumors (nine matched DCIS/IDC and replicate DCIS/IDC preparations from three of nine tissues; Table 1). A representative example depicting different phases during the microdissection procedure of DCIS cells is shown in Fig. 1 . Only purified epithelial cells adhering to the LCM cap after LCM were processed further (Fig. 1). At least 125 ng of total RNA was retrieved after RNA extraction of all microdissected specimens and checked for high quality (see Supplementary data 2). Exclusively good quality RNA was used for linear RNA amplification.
|
|
20,000 captured cells. Before hybridization, the quality of aRNA was verified as described in Materials and Methods (data not shown). Gene expression patterns were measured by hybridization of biotinylated aRNA to the Affymetrix GeneChip arrays HG U133A and HG U133 Plus 2.0, respectively. The mean 3'/M ratio retrieved from the high-density arrays for glyceraldehyde-3-phosphate dehydrogenase (HUMGAPDH/M33197) was 1.61 (range, 1.11-2.79; Table 1). For ß-actin (HSAC07/X00351), it was 3.72 (range, 1.76-8.5; Table 1). Hierarchical clustering for experiments. Hierarchical clustering was done using the neighbor-joining method applied to the Pearson correlation distance matrix. In addition, other hierarchical clustering methods, such as UPGMA, and other distance-generating functions, such as Euclidean, were used. Only marginal deviations between the methods and the distance matrices used were observed. We found most stable results for the neighbor-joining method applied to the Pearson correlation distance matrix. In Fig. 2 , the clustering results for the combined data set is depicted, indicating that different tumor stages do not form distinct groups. Instead, the different synchronous stages of progression within an individual patient cluster more closely to one another than to their respective stage from different patients. Three matched tumor specimens (T661, T787, and T808), microdissected and hybridized independently as technical replicates, cluster to their matched tumor. In between these clusters, the respective tumor stages cluster together, corroborating our approach of combining the gene expression results retrieved from two different Affymetrix microarrays. Additionally, tumors can be stratified into a homogenous ER-negative tumor cluster, separated from distinct subgroups represented by ER-positive tumors.
|
Validation of candidate genes by RT-PCR. Evaluation of gene expression for two invasion-related candidates, matrix metallopeptidase 11 (MMP11) and urokinase-type plasminogen activator (PLAU), by classical RT-PCR confirmed their up-regulation in IDC compared to matched DCIS, thus verifying microarray results (data not shown).
For 16 additional differentially regulated candidate genes, real-time PCR was applied in triplicate experiments. In Table 2
, relative gene expression values (fold changes) between DCIS and corresponding invasive components are listed. In agreement with the microarray results, real-time PCR shows the up-regulation of pleckstrin homology domain containing member 1 (PLEKHC1), MADS box transcription enhancer factor 2 (MEF2C), biglycan (BGN), periostin (POSTN), olfactomedin-like 2B (OLFML2B), fibroblast activation protein
(FAP), W46291 (Affymetrix-ID, 213790_at), ras-induced senescence 1 (RIS1), tumor necrosis factor-
-induced protein 6 (TNFAIP6), leucine-rich repeat containing 15 (LRRC15), gremlin 1 homologue (GREM1), squamous cell carcinoma antigen recognized by T cell 2 (SART2), and dapper homologue 1 (DACT1) and down-regulation of bullous pemphigoid antigen 1 (BPAG1) in IDC (see Supplementary data 3). Up-regulation for large tumor suppressor homologue 2 (LATS2) and W74476 (Affymetrix-ID, 226997_at), not included in the list derived from the statistical computations because they are only represented on the HG U133 2.0 Plus array, also corroborate microarray data.
|
|
|
| Discussion |
|---|
|
|
|---|
In the first part of the research, the major focus was to establish and standardize protocols to enable microarray analysis of small sample RNA from microdissected tissue. The involvement of the pathologist, documentation of LCM and stepwise RNA quality controls are fundamental attributes of this study. 3'/M ratios, a measure of quality for the labeled aRNAs after two-round amplification (GeneChip eukaryotic small sample target labeling assay, version II, Affymetrix), are in the recommended range for glyceraldehyde-3-phosphate dehydrogenase (3'/M
3). For ß-actin, the ratios are slightly increased. Thus, the quality of the hybridized specimens, the comparability of matched-pair samples, and the robustness of the approach were finally determined by hierarchical cluster analysis. Here, different progression stages derived from the same patient cluster together, which was also previously observed (9). Additionally, replicate DCIS and IDC cluster to their equivalent from the same patient. These findings corroborate the robustness and stringency of our established experimental approach. The observation that different progression stages derived from the same patient cluster more closely to one another than the corresponding stages from different patients suggests, that "interpatient" genetic variability is overriding the differential gene expression between DCIS and IDC. These concordant findings emphasize the need to investigate pathological lesions from the same patient to reduce false-positive gene expression differences due to genetic variability between individual patients.
Recently, discussions have arisen regarding the cross-platform comparability of gene expression data and regarding reproducibility and the demand for standardization of microarray experiments (23, 24). Therefore, attention was focused on statistical analyses. A "master gene set" was composed by applying three independent statistical tests to increase the chance of identifying a robust gene set. Four hundred and forty-five probe sets (360 genes) were identified as significantly up-regulated in IDC (rank product, false-positive prediction <0.01; t test and Mann-Whitney: P < 0.01), making
2% of the total amount of probe sets analyzed (22,283 probe sets; HG U133A). In the comparable microarray- and LCM-based study done by Ma et al., 85 genes are significantly up-regulated in IDC (
0.1%, P < 0.01; ref. 9). An explanation for the increased number of differentially up-regulated genes in our experiment might be that Ma et al. used cDNA arrays that are more prone to cross-hybridization, and marginal differentially expressed genes might be lost in the background of signals (9, 23). However, these marginal differentially expressed genes may be the most significant or biologically causative (25). Many of the genes represented in our "master gene list" have relatively small but statistically significant fold changes which were validated for 18 candidate genes by RT-PCR.
It is known that comparison of gene expression data obtained from different array platforms results in a marginal overlap (24). Thus, it is not surprising that only four genes from our set are also represented in the gene list of Ma and colleagues: adipocyte enhancer-binding protein 1 (AEBP1), syndecan 2 (SDC2), chromosome 18 open reading frame 1 (C18orf1), and collagen type XV
1 (COL15A1). AEBP1 interacts with PTEN, a known tumor suppressor in breast cancer, whose inactivation is correlated with invasiveness and metastasis in breast cancer (26, 27).
We observe a disproportion of up-regulated versus down-regulated probe sets in IDC compared to DCIS (445:101). A possible explanation might be the clonal expansion of cell lineages during tumor progression, thereby increasing the number of mRNA species (28). Furthermore, some genes might be present in IDC samples because of co-isolation of nonepithelial cells in the invasive component during microdissection, although isolation of contaminating cells can be reduced to 0.6% using LCM (11). There might also be a biological explanation for the identification of genes supposed to be "stromal" or "fibroblast": tumor progression includes processes like epithelial-mesenchymal transition. Therein epithelial cells lose epithelial polarity and acquire a fibroblastoid phenotype, making it a correlative for metastasis (29). Our "master gene set" contains TWIST1, which was recently published to be a master regulator promoting epithelial-mesenchymal transition contributing to metastasis in breast cancer (30), and several genes (e.g., DCN, SDC2, SPARC, MMP13, PDGFRB, INHBA, and DAB2)9 represented in a gene list associated with epithelial-mesenchymal transition in mammary epithelial cells (31).
The master gene set also contains genes whose relative up-regulation in IDC was validated by RT-PCR and which are already correlated with invasion: MMP11/STR-3 (32, 33), PLAU/uPA (34), LRRC15/Lib (35), and BPAG1 (36). In addition, MMP11 is also a predictor for poor prognosis (37). PLAU is a novel prognostic factor and is validated at the highest level of evidence regarding its clinical utility in breast cancer (34). LRRC15 was found to be highly and uniquely expressed in breast tumors compared to their matched normal tissue (38). It is present at the leading edge of migrating cells and siRNA-mediated mRNA suppression in highly invasive Hs467T breast carcinoma cells leads to abrogation of invasiveness in these cells (35). Being a transmembrane protein makes it an easily targetable protein. BPAG1 is expressed in hemidesmosomes connecting epithelial cells to the basement membrane. Invasive breast cancer cells do not express hemidesmosomes or most of the component proteins (36).
The presence of MMP11, PLAU, LRRC15, and BPAG1 in our master gene set suggests that our approach might have detected more transcripts involved but not yet correlated with migration and invasion. Further support is provided by Functional Analysis using Ingenuity software, showing that a significant fraction of our master gene set is involved in cell-to-cell signaling and interaction. This emphasizes the importance of changing intercellular communication during the process of invasion in breast cancer. According to Nagaraja et al., a small set of nine transcripts playing a role in cell-to-cell signaling and interaction is sufficient to distinguish invasive breast cancer cell lines from noninvasive (39). Among this set, annexin A1 (ANXA1), cadherin 11 (CDH11), and claudin 3 (CLDN3) show the same pattern of expression in invasive and noninvasive cell lines as we observe for them in DCIS to IDC transition.
Using in situ hybridization and/or immunohistochemistry, several candidates which have thus far not been correlated with breast cancer invasion can be localized to the epithelial tumor cells, a result also corroborating the benefit of LCM for our experimental approach. These genes are DACT1, GREM1, MEF2C, SART2, and TNFAIP6. DACT1 is not yet correlated with breast cancer. Its homologue, DAPPER1 in Xenopus, modulates WNT signaling, which is important in the development of breast cancer (40, 41). GREM1 is a secreted antagonist of bone morphogenic proteins, which are essential in initiating epithelial-mesenchymal signaling (42). Secreted proteins are potential interesting targets for diagnosis and therapy (43). MEF2C is a transcriptional enhancer whose biological function in human breast cancer is unknown. However, it was shown that its chromosomal localization is assigned to the mammary cancer susceptibility 1 locus (Mcs1) on chromosome 2q1 segregating with the sensitivity to mammary cancer development in rats (44). TNFAIP6/TSG-6 is not yet correlated with cancer. Published data suggest that it might down-regulate inflammatory response, potentially enabling invasive cells to escape immune response (45). SART2, belongs to a protein family encoding antigenic peptides capable of inducing tumor-reactive CTLs in HLA-A24+/2+ patients. Although SART2 could not be detected in MCF-7 cells by Northern blotting (46), we were able to detect its expression in different breast cell lines and tumor tissue using RT-PCR methods and in situ hybridization. Because SART2-derived peptides were used in vaccination studies in patients with lung, colon, and cervical cancers (47), it might also be considered to be applied in breast cancer, based on our results.
In conclusion, we identified progression-specific candidate genes using a quality-controlled approach, combining LCM and microarray analysis. We provided a deeper insight into the molecular biology of DCIS to IDC transition by characterizing the set of differentially regulated genes and single validated candidates. Among these candidates are gene products with potential clinical importance such as GREM1, SART2, or LRRC15.
| Acknowledgments |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
The skillful technical assistance of U. Hilcher, B. Kootz, K. Abt, and Y. Wachtel is gratefully acknowledged. We thank Dr. U. Vogel for providing us with paraffin-embedded breast cancer tissue. The polyclonal rabbit anti-human BPAG1 (no. 1133) antibody was a kind gift from Dr. J.R. Stanley and his colleagues.
| Footnotes |
|---|
R. Kurek and H.J. Neubauer share senior authorship.
6 We use traditional terminology although it was replaced by ductal intraepithelial neoplasia classification because the histopathology for the tumors used in our experiments is based on the previous nomenclature. ![]()
7 A. Endress, S. Clare, T. Fehm, H. Neubauer, E. Solomayer, S. Lessmann, C. Schuetz, H. Loeffler, K. Abt, D. Wallwiener, R. Kurek. TumorAGENTan interdisciplinary tumor database for translational research in breast cancer. Submitted for publication. ![]()
8 J. Dietzsch, N. Gehlenborg, and K. Nieselt, 2006. Mayday: a microarray data analysis workbench. Bioinformatics Advance Access published online on February 24, 2006 Bioinformatics, doi.10.1093/bioinformatics/bt1070. ![]()
9 DCN, decorin; SDC2, syndecan 2; SPARC, osteonectin; MMP13, matrix metallopeptidase 13; PDGFRB, platelet-derived growth factor receptor ß polypeptide; INHBA, inhibin ßA; DAB2, disabled homologue 2. ![]()
Received 12/27/05. Revised 2/22/06. Accepted 3/ 2/06.
| References |
|---|
|
|
|---|
-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet 2003;34:26773.[CrossRef][Medline]This article has been cited by other articles:
![]() |
J. O'Prey, S. Wilkinson, and K. M. Ryan Tumor Antigen LRRC15 Impedes Adenoviral Infection: Implications for Virus-Based Cancer Therapy J. Virol., June 15, 2008; 82(12): 5933 - 5939. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Axelsen, J. Lotem, L. Sachs, and E. Domany Genes overexpressed in different human solid cancers exhibit different tissue-specific expression profiles PNAS, August 7, 2007; 104(32): 13122 - 13127. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |