Abstract
Overexpression of the epidermal growth factor receptor (egfr) gene is a common feature in breast cancer. We demonstrated recently that the expression of EGFR in breast cancer strongly correlates with the length of a CA simple sequence repeat within the first 2000 bases in intron 1 of the egfr gene [CA simple sequence repeat (CA-SSR) I; H. Buerger et al., Cancer Res., 60: 854–857, 2000].
Using a standardized semiautomated method of microsatellite analysis for loss of heterozygosity detection, we identified an allelic imbalance (AI) at the egfr locus in 55 of 163 primary breast cancer cases. Fine mapping of the chromosomal region at 7p12–15 around the egfr gene using 10 CA-SSR markers showed that mutations of egfr in breast cancer are frequently restricted to the first intron of egfr. Thereby, the simple sequence repeat CA-SSR I in intron 1 was affected in 84% of the patients with AI. Reverse transcription-PCR analysis of 23 breast cancer tissues with AI excluded the presence of in-frame deletions between exon 2 and exon 7. For additional characterization of the underlying phenomenon leading to the detection of an AI in microsatellite analysis, a quantitative 5′-nuclease assay for the first CA-SSR I in intron 1 was established. In breast cancer cases with AI the presence of amplifications of this sequence was shown. Kaplan-Meier analysis revealed a statistically significant worse prognosis for patients with AI in the cancer tissue at the egfr locus compared with patients without AI. Interestingly, 75% of the patients bearing AI of CA-SSR I in the tumor also showed AI at normal, nontumorous breast tissue. Our data strongly support the assumption that distinct amplifications in intronic sequences of the egfr gene, which enhance the basic transcription activity of the gene, represent one of the first steps in breast carcinogenesis. Furthermore, they point to the presence of prognosis-associated markers for breast cancer already in morphological normal breast tissue.
Introduction
The egfr 3 is a prominent member of the erbB gene family, which encodes a receptor tyrosine kinase with a pivotal role in the regulation of cell growth and differentiation in response to specific ligands such as epidermal growth factor and transforming growth factor α. Overexpression of EGFR, which has been shown in a variety of tumors, resulted in cellular transformation as shown in NIH-3T3 cells (1 , 2) . In breast cancer patients, EGFR overexpression was associated with a poor clinical outcome and no benefit from adjuvant endocrine therapy (3 , 4) . Amplifications of egfr in breast cancer, as described for c-erbB2 (3) , account for <5% of all breast cancer cases. Therefore, it was broadly accepted that overexpression of EGFR in breast cancer is commonly regulated on the transcriptional level. Furthermore, in a few breast cancer cases an in-frame deletion comprising exon 2 and exon 7 has been described, encoding a constitutively activated EGFR (egfr-vIII), which was found frequently in glioblastomas (5, 6, 7) .
Premature termination of transcription between exons 1 and 2 of the egfr gene occurring proximal to an enhancer region in intron 1 has been described (8) . The transcription activity of the egfr gene depends strongly on the cooperation of this enhancer in intron 1 with an enhancer sequence upstream of the transcriptional start side (9) . The enhancer in intron 1 is located in close proximity to a polymorphic CA-SSR containing 14–21 CA dinucleotides (10) . For this CA-SSR we have shown that egfr transcription is modulated in vitro and in vivo (11) . Decreasing numbers of CA dinucleotides in CA-SSR I correlated with an increased egfr transcription and EGFR protein expression (12) .
Our data indicate that CA-SSR I is the frequent target for mutations within the egfr gene in breast cancer. For the first time we show that the mutations are amplifications. Furthermore, these amplifications are likely to represent early steps in breast carcinogenesis, moreover thereby inducing a breast cancer subtype associated with poor clinical outcome. The discovery of one of the putative, first genetic hits in a subtype of breast cancer might serve as a starting point in the definition of new genetic pathways in breast cancer.
Materials and Methods
Patients.
In total, 163 Caucasian patients from the Department of Gynecology of the University Hospital of Muenster primarily diagnosed with breast cancer who had undergone mastectomy with full dissection of axillary lymph nodes were included. The average age at time of primary diagnosis was 57.3 years (range, 34–82 years). For 82 patients the time of follow-up exceeded 5 years. The patient stages were as follows: I, 29.5%; IIa, 25.6%; IIb, 15.4%; IIIa, 2.6%; IIIb, 6.4%, and IV, 20.5%. Lymph node involvement was observed in 52% of the cases. Histologically proven tumor tissues were frozen immediately after surgery.
Microsatellite PCR.
Genomic DNA was isolated from peripheral blood and fresh-frozen tumor tissue of patients with breast cancer. DNA isolation from peripheral blood was performed using the QIAamp Blood kit (Qiagen), and tumor tissue DNA was extracted using the QIAamp DNA Mini kit (Qiagen) according to the manufacturer’s instructions. As a control for PCR fragment length DNA from the tumor cell line MDA-MB-468 was analyzed.
PCR amplification was performed using AmpliTaq DNA Polymerase (Applera) in a 25-μl reaction volume containing 200 nm of each primer, 1 × GeneAmp buffer II, 2 mm MgCl2, 100 μm of each GeneAmp deoxynucleotide triphosphate (Applera), and 20 ng of sample DNA. The primer sequences specific for microsatellite markers near the chromosomal locus of egfr on chromosome 7p are given in Table 1 ⇓ . Downstream primers were labeled with a fluorescent dye (FAM). Separation was done with a four-color laser-induced fluorescence capillary electrophoresis system (ABI PRISM 310 DNA Analyser and ABI PRISM 3700 DNA Analyser). One to 2 μl of the amplified PCR products were diluted in 20 μl of water (high-performance liquid chromatography grade) containing 0.5 μl GENESCAN 500 TAMRA or 400 HD (Rox) fluorescent size standard (Applera). Denatured PCR fragments were separated on the ABI PRISM 310 and ABI PRISM 3700 DNA Analyser (Applera). Evaluation of the collected data was accomplished with the GeneScan Analysis Software (Applera). All of the analyses were performed at least in duplicates of independent PCRs.
Primer sequences of microsatellite markers and CA-SSR markers in the egfr region
All of the primers are shown in 5′-3′ orientation, and the order of 7p loci is shown from centromere (top) to telomere (bottom). The localization of D7S499 is highly controversial and may be regarded preliminary.
For standardization of the analysis, the AI score (which is similar to the loss of heterozygosity score defined by Canzian et al.; Ref. 13 ) was calculated according to the equation published by Canzian et al. (13) , which represents the ratio of the intensities of the two alleles in the tumor tissue corrected by that in nontumor. The AI score estimates the imbalance of the two allele intensities in tumor. The reliability of peak area and lengths for AI score determination was tested by 10 runs of independent leukocyte PCR product showing a SD of the quotient of the allele peak areas (a1/a2) of 7% as described previously (14) . The dc was calculated based on the prior determined SD (dc = 2 × √2 × SD). The cutoff values for AI were calculated by 1-dc for the loss of the longer and 1/(1-dc) of the shorter allele resulting in cutoff values of 0.79 (loss of longer allele) and 1.27 (loss of shorter allele). Because of the low SD of the method, AI could be detected sensitively if the peak area for one allele in relation to the other decreased in the tumor DNA to 79% of the normal. Therefore, a microsatellite was scored positive for AI if the AI score was <0.79 or >1.27.
RT-PCR Analysis.
Quick-frozen tumor tissues from 23 breast cancer patients with AI at the egfr region were submitted to extraction of total RNA using the single-step isolation described by Chomczynski and Sacchi (15) using RNAzol (Diagnostic International).
RT-PCR was performed with the Titan One Step RT-PCR kit purchased from Roche Molecular Diagnostics according to the manufacturer’s instructions. In brief, total RNA was reverse-transcribed using avian myeloblastosis virus reverse transcriptase. Amplification was performed using Taq DNA polymerase and Pwo DNA polymerase, and the following oligonucleotides: K1:5′-(Fl)-GC TCT ACA ACC CCA CCA CG-3′, V3A:5′-(Fl)-GT ATT GAT CGG GAG AGC CG-3′, and V3B:5′-GTG GAG ATC GCC ACT GAT G-3′.
Cycling conditions for primer pairs V3A/V3B and V3B/K1 were as follows: reverse transcription at 50°C for 30 min, initial denaturation at 94°C for 2 min, 35 cycles of denaturation (94°C for 1 min), annealing (59°C for 1 min), and extension (68°C for 1 min), and a final extension step at 68°C for 7 min. Amplified fragments were separated on an agarose gel and visualized by ethidium bromide staining. Gel-excised fragments were purified using the QIAExII system (Qiagen) and submitted to direct DNA sequencing using Big Dye Terminator Cycle Sequencing reagents (Applera) and the ABI PRISM 3700 DNA Analyser. The primer pairs and amplification conditions were optimized using both normal lymphocyte DNA and plasmid DHΔE801P containing an exon2/exon7 deletion fragment of the egfr. The primer pair V3A/V3B results in a regular fragment of 1153 bp spanning exon 1 to exon 8, and a 352-bp fragment is amplified when exons 2–7 are deleted. Primer pair V3B/K1 generates a 307-bp fragment when the mRNA is intact and a 251-bp fragment when exons 2–7 are deleted.
Quantitative Real-Time PCR (5′ Nuclease Assay).
Primers specific for sequences flanking the first CA repeat in the first intron of the egfr gene (CAIfor: 5′-tgaagaatttgagccaaccaaa-3′ and CAIrev: 5′-cacttgaaccagggacagca-3′) were designed using Primer Express software (Applera) and a universal probe consisting of 15 CA repeats, and a 5′ fluorescent label (CA-Fam) was designed. The primers represented specific sequences for egfr, which were checked by BLAST search. 4 Primers and probes were also designed for two different single-copy genes, SOD2 (chromosome 6q25 GenBank accession no. 65965, forward primer: 5′-GGAGAAGCTGACGGCTGC-3′, reverse primer: 5′-CCTTATTGAAACCAAGCCAACC-3′, VIC-labeled probe: 5′-CAACCTGAGCCTTGGACACCAACAGA-3′) and HBB (chromosome 11p, GenBank accession No. V00499, forward primer: 5′-GTGAAGGCTCATGGCAAGAAAG-3′, reverse primer: 5′-CAGCTCACTCAGTGTGGCAAAG-3′, VIC-labeled probe: ATGGCCTGGCTCACCTGGACAACC). For all three of the genes, the amplicon length was minimized (68–97 bp) to allow for the most efficient PCR amplification. PCR analysis was performed using TaqMan Universal Mix (Applera) and an ABI Prism 7900HT Sequence Detection System (Applera). A PCR cycle consisted of denaturation at 95°C for 15 s and primer annealing and extension at 60°C for 1 min, and was repeated 40 times. PCR reactions were performed at least in triplicates, and serial dilutions of DNA ensured accuracy of gene dosage quantification.
Amplification of the egfr gene was measured in the breast cancer-derived cell line MDA-MB-468 in comparison to normal leukocytes. MDA-MB-468 DNA reportedly displays a 30–50-fold amplification of the egfr gene (16) and was used as a positive control. DNA concentrations were normalized to both SOD2 and HBB, two different single-copy genes (17) .
Induction and Detection of Fragile Sites in B Lymphocytes.
Heparinized whole peripheral blood (1 ml) was cultured in 10 ml Ham’s F-10 medium supplemented with 20% FCS, 100 μmol l-glutamine, 144 μg phytohemagglutinin, and 10,000 units of streptomycin/penicillin at 37°C for 72 h. At 24, 24, and 6 h before harvest, aphidicolin (0.4 μm final concentration), ethanol (0.5% final concentration), and caffeine (2.2 mm final concentration), respectively, were added to the lymphocyte culture for fragile site induction.
Dividing cells were blocked in metaphase by addition of colcemide 1.5 h before harvest. Standard cytogenetic techniques were used for harvesting and slide preparation. The probe for egfr detection was derived from homo sapiens PAC clone containing the whole egfr gene (GenBank accession no. AC006977). DNA was labeled with digoxigenin-11-dUTP by nick translation following standard protocols (18) . The probe was denatured for 5 min at 70°C in 70% formamid-0.6× SSC. Hybridization to the metaphase spreads was carried out overnight at 37°C in a 50% formamid-1× SSC-10% dextran sulfate solution in the presence of Cot-1-DNA (Life Technologies, Inc.) and HPL-DNA (Sigma). Posthybridization washes were performed at 45°C in 50% formamide-2× SSC followed by blocking with 3% BSA in 4× SSC at 37°C. Probe detection was performed using mouse-anti-digoxigenin (Sigma) and Cy3-labeled goat-antimouse antibodies. The chromosomes were counterstained with 4′,6-diamidino-2-phenylindole, and images were captured with an Olympus BX61 microscope connected with a digital camera system DP50 (Hamburg, Germany). Giemsa-trypsin banding was performed according to the standard protocol of Seabright (19) . Briefly, slides were incubated 10–40 s in trypsin (1 mg/ml in NaCl) at 37°C, quickly rinsed in water, stained 4 min in Giemsa, and then washed in water. Complete metaphase spreads (n > 100) were assessed at the 350–400-band level by conventional light microscopy (Olympus BX61).
Microdissection of Tissue from Normal, Premalignant Lesions and Tumor Tissues.
Tissues from 16 patients, all known to harbor AI at CA-SSR I in their invasive carcinoma, were laser microdissected (Palm, Oberlensheim, Germany) from serial 10-μm sections of paraffin-embedded tissue after hematoxylin staining. The sections were evaluated by two pathologists (W. B., H. B.). DNA was isolated from both nontumorous lymphocytes and invasive carcinoma from paraffin-embedded tissue. In these 16 cases, nontumorous, morphologically normal breast lobules (n = 69) could be isolated. Breast lobules were pooled to maximize the DNA yield. Only breast lobules with a distinct myoepithelial layer and a single layer of luminal epithelial cells with small, uniform nuclei with evenly distribute chromatin were selected for AI analysis. Twenty-three samples consisted of benign, metaplastic, or hyperplastic proliferative breast lesions such as adenosis, ductal hyperplasia, or apocrine adenosis. In 3 cases ductal hyperplasias with nuclear atypias were present. Also, 3 samples of DCIS and 9 samples of associated lymph node metastasis could be analyzed. Benign and malignant breast lesions were defined according to Elston and Ellis (20) .
For isolation of DNA from deparaffinated, microdissected tissue the material was suspended in 10–200 μl TE buffer [10 mm Tris/HCl and 1 mm EDTA (pH 7.5)] and incubated with 1–20 μl proteinase K (600 mAu/ml) at 56°C for 10 h. The tube was boiled (10 min), chilled (10 min at 4°C), and enclosed centrifuged for 1 min at 14,000 × g. The supernatant was transferred into a fresh tube before DNA was pelleted in 70% ethanol and centrifuged for 20 min. After two washing steps (70% ethanol; 20 min 14,000 × g) the pellet was air dried and rehydrated by LTE or distilled water and stored frozen at −20°C. Resulting DNA amounts were determined before CA-SSR I microsatellite analysis by quantitative real-time PCR. Both previously mentioned single-copy genes, SOD2 and HBB (primer and probe sequences as shown above), were amplified and quantified via the 5′ nuclease assay. PCR analysis was performed using TaqMan Universal Mix (Applera), 400 nmol of forward and reverse primer, 200 nmol probe, 2.5 μl DNA solution, and an ABI Prism 7900HT Sequence Detection System (Applera). PCR conditions were the same as described above. As a control, DNA isolated from formalin-fixed, paraffin-embedded lymph node tissue was photometrically quantified. A standard curve was obtained by serial dilution of control DNA ranging from 120 pg to 10 ng per (each) PCR assay and single-copy gene. The test DNA was amplified in the same PCR run, and the concentration was determined using the standard curve. Each DNA was tested once for each single-copy gene, because the total DNA amount after microdissection is limited. The DNA was used for CA-SSR I microsatellite analysis if the obtained DNA concentrations for SOD2 and HBB were equal.
To ensure the specificity of microsatellite analysis in microdissected tissues, we determined the minimum DNA template concentration by performing serial dilutions of DNA starting at 30 ng. Reproducibility of results was approved by duplicate experiments using a minimum of 1 ng of DNA. Therefore, microsatellite analysis of microdissected tissues was performed with 1.2 ng of DNA.
The PCR protocol for the determination of AI at microdissected tissues was adopted for the use of low cell numbers as follows. PCR assays of egfr using primers for CA-SSR I (Table 1) ⇓ were performed in 10-μl reactions containing 1× PCR Buffer II (Applera), 2 mm MgCl2, 50 μm of each GeneAmp deoxynucleotide triphosphate (Applera), 1 μm of forward and reverse primer, 1.2 ng of template DNA, and 0.5 units of Ampli Taq Gold (Applera). The PCR reactions were overlaid with mineral oil and carried out in a 96-well thermocycler (GeneAmp PCR System 9700; Applera). A denaturation step at 95°C for 10 min was followed by 40 cycles of denaturation at 95°C for 30 s, annealing at 55°C for 30 s, primer extension at 72°C for 30 s, and one final extension at 72°C for 7 min. Separation of PCR fragments was performed with a four-color laser-induced fluorescence capillary electrophoresis system as described above.
Statistical Methods.
Actual survival probabilities were calculated by Kaplan-Meier method, and the differences between the groups were estimated by the log-rank or Wilcoxon test. It should be noted that the number of patients in the tables and figures does not add up to 163 because of missing data, e.g., not performed experiments, and patients who were not at least 5 years in or lost in follow-up. Multivariate analysis was performed using Cox proportional hazards regression model.
Results
AI at CA-SSR I as Determined by PCR-based Microsatellite Analysis Occurred in High Frequencies in Invasive Breast Cancer.
The egfr locus on chromosome 7p was tested for AI using a standardized semiautomated method for microsatellite analysis. DNA was isolated from normal lymphocytes and tumor tissue of 163 breast cancer patients, and assessed for AI using primers specific for CA-SSR I in the first intron of the egfr gene. We identified AI in 55 of 163 (34%) primary cases. CA-SSR I centers the known regulatory sequences of the egfr gene, which is located within the 5′-flanking region and intron 1. In the cancerous tissue AI occurred preferentially with an increase of the peak representing the shorter allele of the CA repeat.
To estimate the size of the mutated sequence we used 17 microsatellite markers on chromosome 7p to test 64 breast cancer patients (Fig. 1) ⇓ . The more distant microsatellite loci, D7S519, D7S665, and D7S510, showed no AI, and D7S674, D7S670, D7S478, and CA-SSR V were not informative (homozygous). Ten markers around and in the egfr gene revealed AI on 7p in 23 cases. No AI could be detected in 41 cases. Interestingly, 84% (16 of 19) of informative cases with AI showed AI at CA-SSR I located between enhancers 1 and 2 in the first intron of the egfr (Table 2) ⇓ . In addition to the AI at CA-SSR I, AI could be shown at different microsatellite markers on chromosome 7p. The affected sequence reaches up to 8 megabases with the shortest affected region being up to 2 megabases around CA-SSR I (e.g., see patients 78/414 and 7/425). However, all of the AIs occurred in the egfr gene, and/or the close vicinity upstream and/or downstream of the gene and at no other regions of chromosome 7p.
Mapping of AIs on chromosome 7p. In a PCR-based microsatellite analysis we used 17 different markers to map the region around the egfr gene on chromosome 7p. Ten microsatellite markers including six CA-SSR markers in the egfr gene showed AI. No AI could be detected using D7S510, D7S665, and D7S519, which are located more distal on chromosome 7p and markers D7S670, D7S674, D7S478, and CA-SSR V were not informative. As an example results of CA-SSR I microsatellite analysis are given in the right panel.
Frequency of AI in breast cancer samples
Genetic alterations of the egfr locus on chromosome 7p were analyzed in DNA from breast cancer patients (n = 64). DNA was isolated from both normal lymphocytes and tumor tissue of each patient. Ten informative microsatellite markers in or near the egfr gene were tested by microsatellite PCR and subsequent four-color laser-induced fluorescence capillary electrophoresis as described in “Materials and Methods.” Only heterozygous alleles were informative for AI and an AI score <0.79 and >1.27 was considered positive. Table 2 <$REFLINK> shows all of the patients with AI (n = 23). The frequency of AI in percentage of informative cases (heterozygous alleles) with AI in the egfr region is shown in the last lane. •, AI; ◑, no AI; ○, homozygous.
AI at CA-SSR I Was Not Associated with the vIII Deletion Mutant of EGFR.
To investigate whether the AI at CA-SSR I is caused by the well-defined exon2/exon7 deletion resulting in EGFR isoform vIII (5) , we performed RT-PCR analysis using specific primer pairs as described in “Material and Methods.” RT-PCR analysis of tumor tissues from 23 patients with egfr AI displayed only regular transcripts (Fig. 2) ⇓ . These results were confirmed by Western blotting where only full-length EGFR proteins could be detected (data not shown). Thus, exon2/exon7 deletions could not be detected in these tissues and, therefore, are not related to the occurrence of AI in the egfr gene.
RT-PCR analysis of egfr transcripts. RT-PCR analysis on egfr mRNA was performed using different primer sets, which were designed to identify a possible exon 2/exon 7 deletion (egfr-vIII) as described in “Material and Methods.” Exons 1–8 of egfr are shown, and the primer pairs are indicated (top panel). Amplified RT-PCR fragments were separated on agarose gels, and lanes 1–6 represent samples of patients with AI at CA-SSR I (bottom middle and right panel). As a positive control plasmid DHΔE801P with exon 2/exon 7 deletion was used, which generates a 352-bp fragment using primer pair V3A/V3B (bottom left panel). (Size standards VIII from Roche: 124, 147, 190, 242, 320, 404, 489 + 501, 692, 900, and 1114 bp, and 100 bp ladder from MBI: 100, 200, 300, 400, 500, 600, 700, 800, 900, 1031, 1200, 1500, 2000, and 3000 bp).
AI at CA-SSR I Represents an Amplification as Determined by Quantitative Real-Time PCR.
To additionally study whether the AI represented a deletion or an amplification of one allele, we developed a quantitative 5′ nuclease assay. Egfr-specific primers that flank the CA-SSR I in the first intron and a FAM-labeled (CA)15 probe were used in this assay. PCR amplification was performed in serial dilutions of leukocyte and tumor DNA normalized to two different housekeeping genes as described in “Materials and Methods.”
The sensitivity of the method allowed the detection of an amplification if the AI score exceeds <0.56 or >1.3 (Fig. 3) ⇓ . As a control, DNA isolated from the MDA-MB-468 cell line was tested and displayed 30–60-fold amplification of egfr CA-SSR I. Up to 16-fold amplification of the egfr CA-SSR I region could be detected in patients with an AI score of 0.23. In cases with an AI score between 0.56 and 0.79 or 1.27 and 1.29, no significant differences in the gene dosage between tumor DNA and normal DNA could be assessed, but no case with a decreased gene dosage was found for the egfr CA SSR I by this method.
Amplifications of the egfr gene in breast cancer patients with AI. Real-time PCR was used to quantitate egfr gene dosages in tumor tissue of breast cancer patients. Isolated DNA from normal lymphocytes (Lymphoc.), tumor tissue (tumor), and MDA-MB-468 cell line (MDA) was adjusted to 1 ng/μl, and serial dilutions (1:1) were performed. Duplicates of DNA dilutions were analyzed using egfr CA-SSR I-specific primers and probe in a 5′ nuclease assay. Specific primers and probes for the SOD2 gene were used to normalize DNA concentrations. Ct values (threshold cycle) allow quantification of DNA concentrations and represent the number of cycles necessary for a given DNA concentration to exceed the threshold. Here, we give an example of a patient (No. 78/414) with 2-fold amplification of CA-SSR I. Normal lymphocyte and tumor tissue DNA display identical Ct values using SOD2 primers and probe, whereas egfr primers and probe reveal a difference of Ct = 1. Rn, fluorescence in a semilogarithmic plot normalized to a passive reference.
The Induction of a Common Fragile Site in the egfr Region Gives First Hinds for a Higher Probability for Amplifications at That Locus.
Because of the fact that the sequence downstream from the CA-SSR I contains sequences that suggest a higher probability for DNA rearrangements (MER 22 and AluI), we treated lymphocytes for the induction of common fragile sites by the method of Yunis and Soreng (21) . Single-strand breaks at common fragile sites are well-known early events to trigger amplification by breakage-fusion-bridge cycles (22) . As shown in Fig. 4 ⇓ , a single-strand break could be observed by Giemsa stain and FISH analysis, which occurred near the egfr gene region as shown by the FISH analysis.
Single-strand break in chromosome 7p induced by drug treatment according to Yunis and Soreng (21) . Giemsa-trypsin banding (a and b) of chromosome 7 was carried out according to the standard protocol of Seabright (19) , and FISH analysis (c and d) was performed using an egfr-specific probe derived from PAC clone AC006977 by nick translation, mouse-anti-DIG (Sigma), and Cy3-labeled goat-antimouse antibodies counterstained with 4′,6-diamidino-2-phenylindole. Arrows indicate the induction of a common fragile site at chromosome 7p after treatment with aphidicolin, ethanol, and caffeine (a and c). Untreated reference chromosomes are shown in b and d.
AI at CA-SSR I Occurred in Normal Mammary Gland Tissue and Benign Lesions, as well as in Invasive and Metastatic Cancer Tissues.
In total, 127 microdissected samples of breast tissue from 16 patients with AI at the egfr locus were studied. Between 100 and 500 cells were microdissected from formalin-fixed, paraffin-embedded normal, noncancerous, and cancer tissues. In addition, invasive breast cancer tissue and metastatic tissue was evaluated as a control. AI analysis was performed using primers specific for the CA-SSR I of the egfr gene in microsatellite PCR. All of the 30 samples from metastatic tissue, invasive breast cancer tissue, and DCIS displayed AI at CA-SSR I (Table 3) ⇓ . In 5 of 23 and 4 of 5 samples, classified as benign proliferative breast lesions or intraductal proliferations with nuclear atypia, respectively, AI was detected. Furthermore, 30% of the microdissected samples of normal, noncancerous breast tissue showed an AI at the CA-SSR I of egfr. Twelve of 16 patients (75%) revealed AI at normal breast lobules, and at least two lobules were affected.
Frequency of AI in microdissected lesions of 16 breast cancer patients with AI at the matched invasive tumor
Between 100 and 500 cells were microdissected from the indicated tissues. A total of 127 samples from 16 patients with known AI at CA-SSR I was tested for AI by microsatellite PCR using CA-SSR I-specific primers.
AI at CA-SSR I Indicated a Subtype of Breast Cancer with Poor Clinical Outcome.
Patients with AI at CA-SSR I showed significantly higher risk of recurrence than the group with tumors diploid for egfr alleles (Fig. 5 ⇓ ; P < 0.001). The odds ratio for the 5-year probability of recurrence of the 82 patients with a follow-up >5 years reached 4.9 (95% confidence interval, 1.4–17.6). In the subgroup of 35 young early stage patients (<55 years) the odds ratio increased to 7.9 (95% confidence interval, 1.4–45.8).
Life table analysis of egfr AI. Metastasis-free survival of breast cancer patients dichotomized by the occurrence of AI in the egfr CA SSR I. The differences between the life table curves were tested by log-rank test.
Discussion
In a multistep tumor progression model of human breast cancer the first events at the molecular level are still unknown. Genomic instability leading to a higher mutation rate accounts for only a small proportion of breast cancers. Natural selection acting to increase the frequency of advantageous alleles in breast lobular cells can be the driving force behind tumor growth. Here, we give evidence that the segregation of alleles associated with a high transcriptional activity of the egfr gene may represent early events that promote progression from normal breast lobular cells to invasive and metastatic breast cancer cells.
Amplifications of the entire egfr gene are a frequent event in high malignant tumors of the central nervous system, and EGFR overexpression has been shown on the cytogenetic level in renal carcinoma (23, 24, 25) . However, gene dosage studies show very little evidence for egfr gene rearrangements or amplifications (26) . Against this background, the overexpression of EGFR was regarded mainly as a result of disturbed transcriptional mechanisms. We have shown previously that low-level gene dosage deviations at the boundary of exon 1 and intron 1 of the egfr gene occur frequently and identify patients with poor clinical outcome (14 , 17) . The deviations were not detectable on a cytogenetic level, or at most they were represented by low-level chromosomal gains in comparative genomic hybridization (CGH) (12) .
In a recent study, we gave evidence for the first time that AIs affecting regulating sequences of egfr transcription in the intron 1 of egfr were correlated with EGFR expression in the tumor (12) .
The data presented in this work clearly show that these genetic alterations involve the first 1200 bases of intron 1 of egfr. Although the frequency of AI on 7p differed substantially, a distinct peak in the CA-SSR I region as shown by microsatellite analysis using 16 additional markers was observed.
To exclude that this AI represents only a surrogate marker for the well-defined exon 2/exon 7 deletion mutation resulting in the constitutive active EGFR isoform vIII (6) , we performed RT-PCR analysis. Using a nested approach by applying primers for the cDNA of exon 1 to exon 8, as well as from exon 7 to exon 8, mutant egfr mRNAs could not be detected in tissues harboring a CA-SSR I AI. The integrity of mRNA was approved by β-actin cDNA amplification. From a subgroup of tumors, Western blotting was also performed using a COOH-terminal binding antibody and an exon 2 to exon 7 sequence-specific antibody as a control. In agreement with the RT-PCR results, no truncated receptor proteins were detected.
From a comparison study with microsatellite PCR and CGH analysis we know that AI at distinct polymorphic loci can also represent amplifications (27) . Therefore, we developed a quantitative real-time PCR assay for gene dosage measurements within the six CA-SSRs in intron 1 of egfr. Interestingly, an increase in CA-SSR I gene dosages could be detected in cases presenting with an AI score <0.56 and >1.3. The sensitivity of the method did not allow us to assess a gene amplification to all of the cases presenting with an AI score significantly different from diploids at the egfr locus, but in no case could a decreased gene dosage be measured. We interpret these data as a first suggestion that distinct amplifications of egfr occur in a substantial subset of invasive breast cancer cases. A basal gene dosage-dependent regulation might be assumed also for egfr as already established for c-erbB-2 (3) .
Searching for published data that could explain an increase of probability for the formation of an amplification at that locus, we found that fragile sites were suggested at 7p11.2 and 7p14.2 (21) , which have not been characterized in detail, thus far. Therefore, based on the knowledge that the induction of fragile sites triggers the formation of amplification we investigated chromosome 7p in normal lymphocyte metaphase spreads after drug treatment by cytogenetics and FISH analysis using an egfr-specific probe. In fact, a single-strand break could be detected in 30% of the metaphase spreads, and the FISH analysis confirmed the location near the egfr gene region (Fig. 4) ⇓ . The high frequency of this event suggests that this is a common fragile site. We are investigating currently the exact structure of the strand break at this fragile site to substantiate our working hypothesis that it is the initial event in the formation of egfr amplifications e.g., by breakage-fusion-bridge cycles (22) . Of note, AI at CA-SSR I could be detected in morphologically normal breast tissue in three quarters of the patients harboring this AI at the corresponding invasive carcinoma. This was not unexpected because of the fact that AIs at normal breast lobules in women with or without synchronous breast cancer (28) have already been described. Therefore, we conclude that this genetic alteration might be a primary event in breast cancer carcinogenesis.
Furthermore, 20% of nontumorous hyperplasias and metaplasias, and 80% of the atypical ductal hyperplasias presented with an AI of the CA-SSR I. To our knowledge no other genomic alteration leading to gene deregulation has thus far been reported with such a high frequency in nontumorous breast tissue. The segregation of the AI was additionally detected in all of the cases of associated DCIS and also in all of the lymph node metastases.
For squamous cell carcinoma of the lower jaw, we could already demonstrate that egfr and c-erbB-2 alterations in morphologically normal mouth tissue have predictive value concerning the recurrence rate (29) . Recently, a high-risk marker of local recurrence in early stage breast cancer patients has been narrowed to a region at chromosome 3p24.3 (30) . Therefore, it could be hypothesized that the AI at CA-SSR I of egfr is a genetic marker, which is also important for breast cancer prognosis. Our retrospective cohort study using the CA-SSR I AI at the primary tumor as a prognostic marker demonstrated an increased risk of recurrence for patients harboring the AI. This leads to the assumption that tumors with a CA-SSR I AI present with a distinct phenotype, which is at least determined genetically in the early steps of carcinogenesis.
However, an overexpression of EGFR was not observed in all cases of CA-SSR I AI, thereby indicating only basal impact of such a mutation in an intronic regulatory sequence of the gene. Factors with repressor function may counteract the enhancement of egfr transcription activity by the amplifications.
In consequence a better understanding of this phenotype might lead to the discovery of a target for new therapeutic or even preventive approaches especially counteracting egfr expression and signaling.
Acknowledgments
We thank Katrin Blaschei and Ute Hinz for outstanding technical assistance. We also thank Dr. Rita Exeler, Institute for Human Genetics, Muenster, Germany, for helpful suggestions and experimental advices.
Footnotes
-
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
-
↵1 Supported by grants from the Deutsche Krebshilfe Bü-1068-1, 10-1660-Br 4, the Federal Ministry of Education and Research (Fö01KS9604/0), and the Interdisciplinary Center of Clinical Research (IZKF Project No. F3-H1).
-
↵2 To whom requests for reprints should be addressed, at Institute of Clinical Chemistry and Laboratory Medicine, Westfälische Wilhelmsuniversität Münster, Albert-Schweitzer-Strasse 33, 48149 Münster, Germany. Phone: 49-251-8347226; Fax: 49-251-8347226; E-mail: brandt{at}uni-muenster.de
-
↵3 The abbreviations used are: egfr, epidermal growth factor receptor; TGF, transforming growth factor; CA-SSR, CA simple sequence repeat; FAM, carboxyfluorescein; AI, allelic imbalance; dc, critical difference; RT-PCR, reverse transcription-PCR; SOD2, manganese superoxide dismutase; HBB, β-globin; DCIS, ductal carcinoma in situ; FISH, fluorescence in situ hybridization.
-
↵4 Internet address: http://www.ncbi.nlm.nih.gov/Sitemap/index.html#BLAST.
- Received July 18, 2002.
- Accepted January 27, 2003.
- ©2003 American Association for Cancer Research.