Abstract
PLAG1, a novel developmentally regulated C2H2 zinc finger gene, is consistently rearranged and overexpressed in pleomorphic adenomas of the salivary glands with 8q12 translocations. In this report, we show that PLAG1 is a nuclear protein that binds DNA in a specific manner. The consensus PLAG1 binding site is a bipartite element containing a core sequence, GRGGC, and a G-cluster, RGGK, separated by seven random nucleotides. DNA binding is mediated mainly via three of the seven zinc fingers, with fingers 6 and 7 interacting with the core and finger 3 with the G-cluster. In transient transactivation assays, PLAG1 specifically activates transcription from its consensus DNA binding site, indicating that PLAG1 is a genuine transcription factor. Potential PLAG1 binding sites were found in the promoter 3 of the human insulin-like growth factor II (IGF-II) gene. We show that PLAG1 binds IGF-II promoter 3 and stimulates its activity. Moreover, IGF-II transcripts derived from the P3 promoter are highly expressed in salivary gland adenomas overexpressing PLAG1. In contrast, they are not detectable in adenomas without abnormal PLAG1 expression nor in normal salivary gland tissue. This indicates a perfect correlation between PLAG1 and IGF-II expression. All of these results strongly suggest that IGF-II is one of the PLAG1 target genes, providing us with the first clue for understanding the role of PLAG1 in salivary gland tumor development.
INTRODUCTION
Activation of the PLAG1 3 gene on chromosome 8q12 is the most frequent gain-of-function mutation found in pleomorphic adenomas of the salivary glands (1 , 2) . This mainly results from recurrent chromosomal translocations that lead to promoter substitution between PLAG1, a gene mainly expressed in fetal tissue, and more broadly expressed genes. The three translocation partners characterized thus far are the β-catenin gene on 3p21 found in the most common translocation, the t(3;8)(p21;q12), the leukemia inhibitory factor receptor gene on 5p13 found in a recurrent (5;8)(p13;q12) translocation (2) , and the elongation factor SII gene (3) . Breakpoints invariably occur in the 5′ noncoding part of the PLAG1 gene, leading to an exchange of the regulatory control elements while preserving PLAG1 coding sequence. The replacement of the PLAG1 promoter, inactive in adult salivary glands, by a strong promoter derived from the translocation partner, leads to ectopic expression of PLAG1 in the tumoral cells. This abnormal PLAG1 expression presumably results in a deregulation of PLAG1 target genes, causing salivary gland tumorigenesis.
The PLAG1 protein contains seven canonical C2H2 zinc finger domains and a serine-rich COOH terminus that exhibits transactivation capacities when fused to the Gal4 DNA binding domain (4) , suggesting that it may act as a transcriptional regulator.
To extend our knowledge on the function of the PLAG1 gene and the mechanisms by which it causes salivary gland adenomas, we decided to further investigate functional characteristics of PLAG1 and in particular its potential transcriptional role. We determined in which subcellular compartment PLAG1 exerts its function by immunofluorescence studies; determined whether PLAG1 could bind DNA in a sequence-specific manner and identified its consensus DNA binding site by performing CASTing experiments. The zinc fingers required for sequence-specific DNA binding were determined by deletion/mutation analysis; and we used the PLAG1 consensus was used to screen the eukaryotic promoter databank. Possible target genes were studied regarding the capacity of PLAG1 to bind and activate their promoter. Finally, we analyzed the expression of such target genes in normal salivary gland tissue and in pleomorphic adenomas with or without PLAG1 overexpression to determine whether these genes could be PLAG1 targets in salivary gland adenomas.
MATERIALS AND METHODS
Construction and Production of GST-PLAG1 Fusion Proteins.
The PLAG1 NH2-terminal region (N2-C244) as well as parts of it (N84-C244; N101-C244; N159-C244) were fused in-frame to GST by inserting in pGEX-5X-2 (Pharmacia) the DNA fragments obtained by the PCR with full-length PLAG1 cDNA as template. The NH2-terminal oligonucleotides used to generate the various constructs were: G8N2, 5′-CCCGAATTCTGGCCACTGTCATTCCTGGT-3′; G8N84, 5′-CCCGAATTCTGGCTACTCATTCTCCTGAGA-3′; G8N101, 5′-CCCGAATTCTGTTTCACCGGAAAGATCATC-3′; G8N159, 5′-CCCGAATTCCTTTTGAAAGCACGGGAGTG-3′; and as COOH-terminal oligonucleotide G8C244, 5′-GGGCTCGAGCTATTTGACCTTCAGAAGCTCTTGA-3′. Pfu-amplified fragments were gel purified, digested with EcoRI and XhoI, and ligated in the EcoRI-XhoI-digested pGEX-5X-2 vector. The construct GST-PLAG1 (N2-C203) was obtained by digesting GST-PLAG1 (N2-C244) by DraIII and XhoI, blunt ending, and recircularization. All of the fusion proteins were expressed in Escherichia coli BL-21 cells and purified on Glutathione Sepharose 4B (Pharmacia) according to the manufacturer’s protocol. The protein sizes were estimated by SDS-PAGE, followed by Coomassie blue staining; concentrations were determined by comparison to a well-defined concentration marker.
CASTing.
To prepare a pool of random double-strand oligomers for the first round of CASTing, 400 pmol of CAST25 (5′-CTGTCGGAATTCGCTGACGT-(N)25-CGTCTTATCGGATCCTACGT-3′) were incubated in 100 μl of polymerase reaction buffer containing 1200 pmol of CAST-LOW (5′-ACGTAGGATCCGATAAGACG-3′), 200 μm of each deoxynucleoside triphosphate, 2.5 μl [α-32P]dCTP (DuPont NEN), and 10 units of Amplitaq (Perkin-Elmer-Cetus) and treated as follows: 5 min at 94°C, 20 min at 65°C, and 20 min at 72°C. Fifty μl were incubated with 500 ng of GST-PLAG1 (N2-C244) bound to Glutathione Sepharose beads, 50 μg of poly polydeoscyinosinic-deoscycytidylic acid (Sigma), and 50 μg of BSA in 500 μl of binding buffer [10 mm Tris (pH 7.5), 200 mm NaCl, 50 μm ZnCl2, 10% glycerol, 1 mm MgCl2, and 1 mm DTT]. After a 30-min incubation on a rotator at room temperature the beads were washed four times with cold binding buffer, and the radioactivity still present on the beads was counted to monitor the level of enrichment in each step. The oligonucleotides were eluted from the beads by resuspending in 100 μl of water, followed by phenol extraction and ethanol precipitation. An aliquot was used for the subsequent amplification reaction in 100 μl of polymerase reaction buffer containing 200 pmol of each amplimer CAST-UP (5′-CTGTCGGAATTCGCTGACG-3′) and CAST-LOW, 200 μm deoxynucleotide triphosphates, and 2.5 units of Amplitaq (Perkin-Elmer Cetus) with 1 μl of [α-32P]-dCTP (DuPont NEN) with 25 cycles of 1 min at 94°C, 1 min at 65°C, and 1 min at 72°C. The amplified products were subsequently used for a second round of selection performed as described above. After four rounds of selection, the subsequent three steps of selection were performed by EMSA with 100 ng of eluted GST-PLAG1 (N2-C244). After X-ray exposure of the dried gel, the shifted bands were cut out of the gel, and the double-stranded DNA was eluted 3 h at 50°C in 200 μl of polymerase reaction buffer. An aliquot of the eluate was used for amplification. After a total of seven amplification cycles, the oligonucleotides were cloned into the pGEM-T Easy vector according to the manufacturer’s protocol (Promega), and 23 independent clones were sequenced.
EMSA.
The different probes were synthesized as complementary oligonucleotides with 4-bp sticky ends, annealed, subsequently end-labeled with[α -32P]dCTP and Klenow enzyme, and finally purified with the QIAquick Nucleotide removal kit (Qiagen). DNA-protein binding reactions were carried out for 10 min at room temperature in 30 μl of EMSA binding buffer [10 mm Tris (pH 7.5), 100 mm NaCl, 50 μm ZnCl2, 10% glycerol, 1 mm MgCl2, 1 mm DTT, 1μ g polydeoxyadenylic acid-polythymidylic acid (Sigma), 1 μg of salmon sperm DNA, and 300 ng of BSA] with 10,000 dpm of probes (about 0.1 ng of oligonucleotides) and an equimolar amount of proteins. DNA-protein complexes were analyzed on nondenaturing polyacrylamide gels [6% acryl-bisacrylamide (19:1), 0.5× TBE (1× TBE = 89 mm Tris base, 89 mm boric acid, 2 mm EDTA pH 8.0) and 5% glycerol]. Electrophoresis was performed at 4°C at 14 V/cm.
Plasmid Constructions.
The PLAG1 expression vector pCDNA3-PLAG1 was constructed by inserting into EcoRI-XhoI-digested pCDNA3 (Invitrogen) the complete open reading frame of PLAG1 preceded by its own Kozak consensus translation start site. This fragment was generated by PCR using Pfu polymerase (Stratagene) with the 5′ primer G8N-3 (5′-CCCGAATTCTAGGCTGCGATGGCCACTGT-3′) and the 3′ primer G8C500 (5′-GGGCTCGAGCTACTGAAAAGCTTGATGGAAAC-3′). The same blunt-ended fragment was also cloned in the blunt-ended EcoRI site of the pCAGGS vector (5) to get a second expression construct for PLAG1 (pCAGGS-PLAG1) with higher levels of expression in transfected cells. The three mutant PLAG1 proteins (PLAG1-F2mut, PLAG1-F3mut, and PLAG1-F7mut) were produced by replacing in pCDNA3-PLAG1 the first histidine in the C2H2 motif of the corresponding zinc finger (His81, His110, and His231, respectively) with an alanine. For this, we applied the QuickChange Site-directed Mutagenesis kit (Stratagene) according to the instructions of the supplier. All constructs were sequenced to confirm the fidelity of the PCR and the site-specific mutagenesis. The full-length PLAG1 protein as well as the three mutants were expressed by in vitro transcription and translation using the TnT kit (Promega). Quality of translation was monitored by SDS gel analysis of[ 35S]Met-labeled proteins. For the EMSA experiments, 3μ l of translation reaction products were used per lane.
The PDGF-B (−389/+22) luciferase reporter construct has been made by digesting the vector pA0166 (6) by SstI and HindIII and inserting this fragment in SstI/HindIII-digested pGL2basic (Promega). The GOS24 (−384/+27) luciferase reporter construct has been obtained by cloning in PGL2basic (Promega) the fragment obtained by PCR amplification on genomic DNA using the primers 5′-GGCGAGCTCTCCCCGCCCCCATCCGTCT-3′ and 5′-CCGCTCGAGAGTGGGAGCGCTGAAGTC-3′ derived from the sequence retrieved from Genbank (accession number M92844). The IGF-II-P3 (−1229/+140) luciferase reporter construct [called Hup3 (7)] is a generous gift of Dr. P. E. Holthuizen (1. Universiteit, Utrecht, the Netherlands) and has been obtained by cloning the IGF-II promoter 3 into the pSLA3 luciferase vector (8) . The c-Ha-Ras (−325 to +58 in the first intron) luciferase reporter construct was obtained by inserting the 384-bp fragment XmaIII blunt-ended/SacI deriving from the pbc-N1 vector ( Ref. 9 ; ATCC) into the pGL2basic with a splice acceptor site added. The prohormone convertase 2 (−789/+137) luciferase reporter construct (10) and the somatostatin (−192/+50) luciferase reporter construct [called pSRIF-Luc (11)] are generous gifts of Dr. E. Jansen (University of Leuven and Flanders Interuniversity Institute for Biotechnology, Belgium) and Dr. B. Peers (University of Liege, Belgium), respectively. (WT)6-TK-luc, (mCO)6-TK-luc, (mCLU)6-TK-luc, and mCLUmCO)6-TK-luc have been obtained by inserting into pTK81luc (12) six copies of the corresponding ds oligonucleotides: WT, 5′-CTAGAAGGGGCTCTAGAAAGGGTAA-3′; mCO, 5′-CTAGAATGCACTCTAGAAAGGGTAA-3′; mCLU, 5′-CTAGAAGGGGCTCTAGAAA-TACTAA-3′; and mCOmCLU, 5′-CTAGAATGCACTCTAGAAATACTAA-3′.
Transfections and Luciferase Assay.
The human fetal kidney epithelial cell line 293 (ATCC; CRL 1573) was cultured according to the suppliers’ protocols. Cells (6-well plates) were transiently cotransfected in triplicate with 200 ng of the expression vector DNA, 200 ng of reporter plasmid, and 200 ng of internal control Rous sarcoma virus β-galactosidase DNA using 3 μl of FuGENE 6 Transfection Reagent (Boehringer Mannheim) according to the manufacturer’s protocol. Cells were harvested 40 h after the transfection, and luciferase reporter enzyme activity was measured using a Monolight 2010 luminometer (Analytical Luminiscence Laboratory) and performing end point assays.
Preparation of RNA and Northern Blot Analysis.
Total RNA was extracted from primary tumors and analyzed by Northern blot analysis as described previously (2) . The human IGF-II exon 9 probe, common to the four different transcripts P1, P2, P3, and P4, was generated by PCR and contained nucleotides 7970 to 8774 of the published gene sequence ( Ref. 13 ; GenBank/EMBL, accession number X03562). A human IGF-II exon 5 probe specific for the P3 transcript was a kind gift of Dr. P. E. Holthuizen. The human GOS24 probe was generated by PCR and contained nucleotides 1828 to 2345 of the sequence submitted to GenBank (accession number M92844). The human c-Ha-Ras probe was generated by isolation of 1.2-kb HindIII coding fragments from RSV-ras(leu61) vector (14) . The human PDGF-B probe was generated by purification of the 1.7-kb BamHI fragment isolated from the pAO73 plasmid (15) .
Immunofluorescence Analyses.
COS-1 kidney fibroblast cells (ATCC, CRL1650) were grown on glass chamber slides (Nunc) and transfected with 1 μg of pCAGGS-PLAG1 and 1μ g of PM3 plasmid, which encodes the Gal4 DNA binding domain (amino acids 1–147; Ref. 16 ). Twenty-four h after transfection, the cells were washed twice with PBS and fixed with 4% paraformaldehyde in PBS for 30 min at room temperature, followed by two washes with PBS. Slides were then incubated in PBS-BT [PBS/0.5% blocking reagent (Boehringer Mannheim)/0.2% Triton X-100] for 30 min at room temperature. To simultaneously detect the PLAG1 and Gal4 proteins, cotransfected cells were incubated at room temperature for 1 h in PBS-BT containing the rabbit polyclonal anti-PLAG1 together with the mouse monoclonal anti-GAL4 (SC-510; Santa-Cruz). The polyclonal anti-PLAG1 was obtained by immunizing rabbits with the peptide FSSTSYAISIPEKEQPL (amino acids 336–352 in PLAG1) and the specificity of the antibody was verified by Western blot analysis (data not shown). After three washes with PBS-T (PBS/0.2% Triton X-100), the slides were incubated in PBS-BT with FITC-labeled swine anti-rabbit (DAKO, F0205) and Texas red-labeled sheep antimouse (Amersham, N2031). This allowed simultaneous visualization of the PLAG1 (FITC, green) and GAL4 (Texas red) proteins. After three washes in PBS-T, slides were mounted in Citifluor containing 0.5 μg/μl of 4′,6-diamidino-2-phenylindole (DAPI) and analyzed with a Zeiss Axiophot microscope equipped with UV optics. Images were recorded with a CE200A CCD camera (Photometrics), using Smart capture (Digital Scientific) and Iplab Spectrum (Signal Analytics) software.
RESULTS
PLAG1 Is Localized in the Nucleus.
The presence of seven canonical C2H2 zinc fingers and the transactivation capacities of the COOH-terminal domain suggest that PLAG1 may act as a transcriptional regulator. A prerequisite for such a role is that the protein be localized, at least in some conditions, in the nucleus. To determine the subcellular localization of PLAG1, we have made an eukaryotic expression construct (pCAGGS-PLAG1) directing the synthesis of a full-length PLAG1 protein in transfected cells. Immunofluorescence staining of transfected COS-1 cells indicates that PLAG1 protein is confined to the nucleus as demonstrated by the green staining observed with the PLAG1-specific antibody (Fig. 1A) ⇓ , which perfectly coincides with the DAPI nuclear DNA staining (Fig. 1B) ⇓ . The PLAG1 protein also colocalizes with the well-characterized nuclear protein Gal4 (Fig. 1, C and D) ⇓ . Nuclear localization of the endogenous PLAG1 protein was also established by immunofluorescence analysis of the fetal kidney 293 cell line and independently confirmed by Western blot analysis of 293 nuclear extracts (data not shown).
Nuclear localization of the PLAG1 protein. Immunofluorescence of Cos-1 cells cotransfected with the PLAG1 expression vector construct, pCAGGS-PLAG1 and with the pM3 expression vector expressing the DNA binding domain of Gal4 (16). The cells were costained with the PLAG1 antibody, 4′,6-diamidino-2-phenylindole and the GAL4 antibody (see “Materials and Methods”). This allows the visualization in the same cells of PLAG1 (A), nuclear DNA (B), GAL4 (C), or all three (D).
The PLAG1 Binding Site Is Composed of Two Essential Parts, a GRGGC Core and a G-Cluster.
The presence of a zinc finger domain in PLAG1 as well as its nuclear localization suggests that PLAG1 is a DNA binding protein. To test this hypothesis and to identify the putative PLAG1 binding consensus sequence, CASTing was performed as described in “Materials and Methods.” The protein used for this study was a chimeric protein containing the complete zinc finger domain (amino acids 2–244) fused in-frame to the GST. The fusion protein GST-PLAG1 (N2-C244) was immobilized on Glutathione-Sepharose beads and incubated with a pool of oligonucleotides containing a central region of 25 random nucleotides. The pool of oligonucleotides selected by seven rounds of PLAG1 binding was cloned, and 23 independent clones were sequenced. The alignment of all the sequences with the program Macaw (National Center for Biotechnology Information) clearly revealed a consensus sequence composed of a core GRGGC followed 6–8 nucleotides further by a cluster of at least three guanidines (Fig. 2) ⇓ .
Determination of the PLAG1 binding site. Alignment of the 23 oligonucleotides selected by seven cycles of CASTing using GST-PLAG1 (N2-C244). The frequency of each of the bases at each position is shown at the bottom of the figure, and N represents the number of oligonucleotides that carried a base at that position.
To assess the importance of the two motifs in the consensus, we performed EMSAs on a double-strand probe containing these two motifs and on four probes presenting mutations in the consensus (Fig. 3A) ⇓ . We found that the fusion protein GST-PLAG1 (N2-C244) binds strongly to the consensus (Fig. 3B ⇓ , Lane 1), whereas mutations in the G-cluster (Fig. 3B ⇓ , Lane 2) reduce drastically the binding (∼8-fold) and mutations in the Core nearly completely abolish it (∼37-fold reduction; Fig. 3B ⇓ , Lane 3) as well as mutations in both (Fig. 3B ⇓ , Lane 4). The distance between the G-cluster and the core is also important because PLAG1 binds weakly to the probe containing the G-cluster separated by 2 bp instead of 7 bp from the core (WT2ml; Fig. 3B ⇓ , Lane 5). These results indicate that both motifs in the consensus are important for the binding of PLAG1, with an importance more pronounced for the core compared with the G-cluster.
The G-cluster is recognized by finger 3 and the core by fingers 6 and 7 of PLAG1. A, nucleotide sequences of the different oligonucleotides used in EMSA analysis. Mutations are underlined. B, EMSAs performed with equimolar amount (0.6 pmol) of bacterially expressed GST-PLAG1 proteins. GST-PLAG1 (N2-C244) (Lanes 1–5), (N84-C244) (Lanes 6–10), (N101-C244) (Lanes 11–15), (N159-C244) (Lanes 16–20) and (N2-C203) (Lanes 21–25) were incubated with the probes WT2 (Lanes 1, 6, 11, 16, and 21), mCLU2 (Lanes 2, 7, 12, 17, and 22), mCO2 (Lanes 3, 8, 13, 18, and 23), mCLUmCO2 (Lanes 4, 9, 14, 19, and 24), and WT2ml (Lanes 5, 10, 15, 20, and 25) as described in“ Materials and Methods”. The percentage of binding of all these mutants on the different probes were compared with the binding of F1–F7 to the probe WT2 and is the mean of at least six experiments. C, competition experiments performed on the probe WT2 in presence of increasing amounts of unlabeled double-strand oligonucleotides (10, 30, 100, 300, and 1000 ng) using recombinant full-length PLAG1 expressed in vitro in reticulocytes lysates; D, EMSAs performed with recombinant PLAG1 proteins produced in vitro in reticulocytes lysates. Wild-type PLAG1 (Lanes 1–5), F2mut (Lanes 6–10), F3mut (Lanes 11–15), and F7mut (Lanes 16–20) were incubated with the probes WT2 (Lanes 1, 6, 11, and 16), mCLU2 (Lanes 2, 7, 12, and 17), mCO2 (Lanes 3, 8, 13, and 18), mCLUmCO2 (Lanes 4, 9, 14, and 19), and WT2ml (Lanes 5, 10, 15, and 20) as described in “Materials and Methods”. Equal efficiency of protein expression was obtained for the different constructs as demonstrated by SDS-PAGE of proteins labeled with [35S]methionine (data not shown). The percentages of binding of all of these mutants on the different probes were compared to the binding of the wild-type PLAG1 to the probe WT2 and is the mean of at least three experiments.
The relative importance of the two motifs was also tested in a series of competition EMSAs using WT2 as probe. As shown in Fig. 3C ⇓ , WT2 competes much more efficiently than mCLU2 or mCO2, confirming the importance of the two motifs in the consensus. mCLU2 competes poorly but nevertheless better than mCO2, confirming by this way that the destruction of the G-cluster is not so deleterious for the binding as the destruction of the core. As expected, no competition could be observed, with mCOmClu2 presenting a mutation in both motifs.
Fingers 6 and 7 of PLAG1 Bind to the Core, Whereas Finger 3 Interacts with the G-Cluster.
To determine which of the zinc fingers contributes to the binding to the consensus sequence, we examined the binding of bacterially expressed GST fusion proteins containing different combinations of zinc fingers (Fig. 3B ⇓ , Lanes 6–25). A similar binding pattern is observed with the protein containing fingers 3 to 7 compared with the protein F1–F7, which contains the complete zinc finger region (compare Lanes 6–10 with Lanes 1–5), suggesting that fingers 1 and 2 are not directly involved in the binding. In contrast, the protein containing fingers 4–7 binds 3-fold less to the WT2 probe (Fig. 3B ⇓ , Lane 11). More importantly, protein F4-F7 binds nearly as well to the probe, presenting a mutation in the G-cluster (mCLU2; Fig. 3B ⇓ , Lane 12) as to the WT2 probe, indicating that the G-cluster is not important for the binding of F4–F7. This is a good indication for the requirement of finger 3 for the interaction with this cluster. The F6–F7 protein binds nearly with the same affinity as F4–F7 to all of the different probes (Fig. 3B ⇓ , compare Lanes 16–20 to Lanes 11–15), suggesting that fingers 4 and 5 are not directly involved in the binding. This F6–F7 protein still binds clearly to all of the probes containing an intact core motif (Fig. 3B ⇓ , Lanes 16, 17, and 20), indicating that fingers 6 and 7 are sufficient for the interaction with the core. Finally, the protein F1–F5 does not show any clear binding (Fig. 3B ⇓ , see Lanes 21–25), indicating the absolute requirement of the two last fingers F6 and F7. All of these results suggest that finger 3 of PLAG1 interacts with the G-cluster, whereas the core is recognized by fingers 6 and 7.
To confirm this model, we performed additional EMSAs using full-length PLAG1 protein translated in vitro in reticulocyte lysates instead of the GST fusion proteins expressed in bacteria. Three mutant PLAG1 proteins (F2mut, F3mut, and F7mut) were also produced by replacing with an alanine the first histidine of the corresponding C2H2 motif. This mutation hinders the coordination of the zinc and has been shown to prevent the formation of a functional zinc finger (17) . The full-length PLAG1 protein expressed in reticulocyte lysate binds with the same specificity as the bacterial F1–F7 protein (compare Fig. 3D ⇓ with 3B, Lanes 1–5). The destruction of the zinc finger 2 (PLAG1/F2mut) does not affect the binding specificity of this protein but decreases its affinity ∼3-fold (Lanes 6–10). In contrast, the finger 3 destruction decreases drastically the affinity of this protein as F3mut binds 17-fold less to the WT2 probe than the natural PLAG1 protein (compare Lane 11 with Lane 1). The specificity was also completely modified since F3mut binds equally well to WT2 (Lane 11), mCLU2 (Lane 12), and WT2ml (Lane 15). Thus, the presence or absence of a G-cluster does not affect the binding of F3mut, indicating that finger 3 is actually the finger required for the interaction with this motif. This conclusion is in agreement with the conclusions drawn from the EMSA experiments performed with the bacterial protein. Finally, destruction of finger 7 completely prevents any binding (Lanes 16–20), confirming the absolute requirement of this finger.
PLAG1 Can Stimulate Transcription Through Its Consensus Binding Site.
To investigate whether PLAG1 binding sites could mediate a transcriptional activation by PLAG1, six copies of the minimal consensus (WT) were cloned upstream the herpes simplex virus thymidine kinase promoter, followed by the luciferase reporter gene. This reporter construct was then transfected into the fetal kidney 293 cell line in the presence or absence of the expression vector pCAGGS-PLAG1.
Table 1 ⇓ shows that PLAG1 stimulates expression of this reporter construct∼ 19-fold. This stimulation is completely abolished by mutations in the G-cluster or in the core since no stimulation was detectable with six copies of mCLU or mCO. This demonstrates that the activation is completely dependent of the presence of the two motifs.
PLAG1 can stimulate transcription through its consensus binding site
Two hundred ng of the indicated reporter luciferase (luc) construct were cotransfected into the fetal kidney cell line 293 together with 200 ng of the expression vector pCAGGS-PLAG1 or the empty vector pCAGGS. PLAG1 induction levels are expressed as the ratio of luciferase activity obtained in the cell transfected with pCAGGS-PLAG1 expression vector versus the activity obtained in cells transfected with the empty vector pCAGGS. The data are means ± SE of at least two independent transfection experiments, each performed in triplicate.
PLAG1 Binds IGF-II Promoter 3 and Up-Regulates Its Promoter Activity.
A computer search in the EPD (18) with the PLAG1 binding consensus GRGGC(N)7 RGGK revealed potential PLAG1 binding sites in the promoter region of many genes (about 176 of the 1280 screened). Table 2 ⇓ presents the list of all of the human promoters present in the EPD containing at least two potential binding sites for PLAG1. This list includes several proto-oncogenes and growth factors. Four promoters, i.e., IGF-II promoter 3, GOS24, c-Ha-ras, and PDGF-B, have been selected for further investigation. By transient transfection experiments, we next investigated whether these promoters are responsive to PLAG1. To that end, we generated four reporter plasmids where each promoter has been cloned independently in front of the luciferase gene. Table 2 ⇓ shows that the activity of the promoter 3 of IGF-II was highly stimulated by cotransfection of a PLAG1 expression vector (8-fold). In contrast, a weaker stimulation was obtained on the GOS24 promoter, and the promoters c-Ha-Ras and PDGF-B did not respond significantly to PLAG1. As expected, no stimulation was observed of promoters that do not contain any PLAG1 binding sites. We thus focused our attention on IGF-II and investigated whether we could really observe a binding of PLAG1 to the putative PLAG1 binding sites in the promoter 3 of IGF-II. For that, EMSAs were performed on a probe located from −192 to −172 in the P3 promoter, which contains one putative PLAG1 binding sites. This element called P3-4 has been shown to be essential for P3 activity (19) . As shown in Fig. 4A ⇓ , a clear binding of PLAG1 is seen on the probe P3-4. The affinity of PLAG1 for this site is comparable with the one for WT2 because approximately the same level of competition is obtained with P3-4 and WT2 in EMSA competition experiments (Fig. 4B) ⇓ .
PLAG1 binds effectively to the P3 promoter of IGF-II. A, EMSAs performed with wild-type PLAG1 protein produced in vitro in reticulocyte lysates and incubated with the probes WT2 and P3-4. B, competition experiments performed on the probe WT2 in the presence of increasing amounts of unlabeled double-strand oligonucleotides (10, 30, 100, 300, and 1000 ng) using wild-type PLAG1 expressed in reticulocyte lysates.
Identification of human promoters with potential PLAG1 binding sites and determination of their capacity to be induced by PLAG1
The EPD (18) has been screened for the presence of the PLAG1 consensus GRGGC(N)7 RGGK with the pattern-matching algorithm implemented in the program findpatterns of the GCG software package (32). The list includes all of the human promoters present in the EPD containing at least two PLAG1 DNA binding consensus in their promoter. The ability of each promoter to be induced by PLAG1 has been estimated by cotransfection of the fetal kidney 293 cell line with pCAGGS-PLAG1 or pCAGGS expression vectors, together with reporter constructs in which each promoter has been cloned in front of a luciferase gene (see“ Materials and Methods”). PLAG1 induction levels are expressed as the ratio of luciferase activity obtained in the cell transfected with pCAGGS-PLAG1 expression vector versus the activity obtained in cells transfected with the empty vector pCAGGS. The data are means ± SE of at least two independent transfection experiments, each performed in triplicate.
IGF-II Transcript Is Up-Regulated in Salivary Gland Tumors with PLAG1 Overexpression.
The fact that PLAG1 is able to bind and activate the promoter 3 of IGF-II suggested that IGF-II could be a target for PLAG1 in salivary gland tumors. To test this hypothesis, we analyzed the expression of these two genes in specimens of primary salivary gland tumors and of normal glands. As described previously (1) , PLAG1 expression could not be detected by Northern blot analysis in normal salivary gland tissue (Fig. 5 ⇓ ,Lanes 1, 5, and 8). In contrast, PLAG1 expression is readily detectable in tumors carrying a chromosomal translocation affecting the PLAG1 gene (Fig. 5 ⇓ , Lanes 2, 3, 6, and 7). This overexpression is the result of promoter substitution between PLAG1 and ubiquitously expressed genes such as those coding for β-catenin (cg644 and cg682) or the leukemia inhibitory factor receptor (c895 and c904; Refs. 1 and 2 ). The two tumors carrying chromosome aberrations outside the PLAG1 locus (cg650 and cg601; Refs. 20 and 21 ) do not show any PLAG1 expression (Fig. 5 ⇓ , Lane 4 and Fig. 5B ⇓ ). We next investigated IGF-II expression in these tumors using a probe specific for exon 9 of IGF-II, which is a common part of all the IGF-II transcripts. No IGF-II expression could be detected in normal salivary gland tissues (Fig. 5 ⇓ , Lanes 1, 5, and 8) and in tumors without PLAG1 overexpression (lane 4 and cg601 in Fig. 5B ⇓ ). In contrast, a drastic up-regulation of IGF-II was observed in tumors with PLAG1 overexpression (Fig. 5 ⇓ , Lanes 2, 3, 6, and 7). The detected 6-kb IGF-II transcript corresponds to the transcript deriving from the P3 promoter, because hybridization with a probe specific for the P3 transcript (exon 5 probe) detects exactly the same band (data not shown). Thus, we have a perfect correlation between PLAG1 and IGF-II gene expression, suggesting that IGF-II is one of the PLAG1 targets. In contrast, no significant expression of c-Ha-Ras and PDGF-B could be detected in tumors and in normal salivary gland tissue, whereas no alteration in GOS24 expression was observed (Fig. 5B) ⇓ .
IGF-II P3 transcript is up-regulated in tumors with PLAG1 overexpression. A, Northern blot analysis of normal salivary gland (n.s.g.) tissues and pleomorphic adenomas hybridized sequentially with a 3.7-kb PLAG1 cDNA probe, an IGF-II exon 9 probe, and a 2-kb β-actin probe. RNAs tested included samples from three different normal salivary gland tissue specimens (Lanes 1, 5, and 8) and from adenomas c895 (Lane 2), c904 (Lane 3), cg650 (Lane 4), cg644 (Lane 6), and cg580 (Lane 7). B, recapitulation of the Northern blot analysis of normal salivary gland tissues and pleomorphic adenomas hybridized with probes specific for the genes encoding PLAG1, IGF-II, PDGF-B, GOS24, or c-Ha-Ras. The karyotype of the tumors has been described elsewhere (1, 2), and in tumors cg650 and cg601, the breakpoint occurs outside the PLAG1 region (20, 21).
DISCUSSION
In this report, we show that PLAG1, the major translocation target gene in pleomorphic adenoma of the salivary glands, codes for a nuclear protein that binds DNA in a sequence-specific manner. The minimal PLAG1 binding site is composed of two essential parts, a GRGGC core separated by seven random nucleotides from a RGGK cluster. This bipartite binding site is quite unusual and can be explained by the particular way PLAG1 binds DNA. Indeed, we show that two noncontiguous regions in PLAG1 are essential for DNA recognition, finger 3 interacting with the G-cluster and fingers 6 and 7 recognizing the core. This model of interaction we found by deletion/mutation analysis (see Fig. 3 ⇓ ) is corroborated by the stereochemical rules governing the DNA contacts of individual zinc fingers (22) . These rules are based on different types of studies: (a) structural studies have shown that each finger module folds to form a compact ββα structure with the α-helix fitting into the major groove (reviewed in Ref. 23 ). Residues −1, 2, 3, and 6 (numbering with respect to the start of the α-helix) typically make key base contacts that are responsible for defining sequence specificity; (b) by phage-display selections and site-directed mutagenesis, correlations were established between the amino acids present in key positions −1, 2, 3, and 6 and the nucleotide sequences of their optimal binding sites. This led to a recognition code governing zinc-finger/DNA interactions. As depicted in Fig. 6A ⇓ , the consensus we found by CASTing shows good agreement with the binding site predicted by such a recognition code. It is interesting to note that the fingers that are not involved in DNA recognition (e.g., fingers 1, 4, and 5) do not present in key position amino acids known to interact with DNA. The distance between the G-cluster and the core is also in good agreement with the presence of two noninteracting zinc fingers that are predicted to cover six nucleotides. However, alignment of all the sequences selected by seven rounds of CASTing indicates that the distance between the G-cluster and the core seems to vary from six to eight nucleotides (see Fig. 2 ⇓ ). One explanation to this variability could be the presence of a longer linker region between fingers 5 and 6 (14 amino acids instead of the usual 7 amino acids). This longer linker may generate sufficient flexibility to allow an interaction with a cluster at different positions.
Prediction and schematic representation of PLAG1 binding consensus site. A, amino acids at position −1, 2, 3, and 6 (numbering with respect to the start of the α-helix) within the PLAG1 zinc fingers are shown in the first column. Bases predicted to be preferred for binding by these amino acids are shown in the second column (22, 23). The consensus found by CASTing is shown in the third column, and thick lines indicate that the predicted base matches with the selected one. B, comparison between the PLAG1 binding site and other reported consensus binding sites like the Zac1 consensus (27), one of the characterized WT-1 binding sites (33), the consensus sequence for Sp1 binding described as the decanucleotide 5′-(G/T)GGGCGG(G/A)(G/A)(C/T)-3′ (34) and the Egr-1/Zif268 consensus binding sequence (35).
Recently, we and others identified two PLAG1-related proteins, PLAGL1[ also called Lot1 (24 , 25) or Zac1 (26)] and PLAGL2 (4) . PLAGL1 and PLAGL2 are highly homologous to PLAG1 in their NH2-terminal zinc finger domain (73 and 79% identity, respectively), whereas the COOH-terminal region is much more divergent. Strikingly, the best homology is found in fingers 6 and 7, suggesting that PLAGL1 and PLAGL2 should also recognize a core motif analogous to the PLAG1 core. In contrast, fingers 2–5 are much less conserved but still present conservation for the amino acids present in key positions (−1, +2, +3 and +6). This suggests that the three PLAG proteins would interact with similar consensus sequences. The consensus binding site for Zac1/PLAGL1 has been defined recently as GGGGGGCCCC through a CASTing assay (27) . Actually, the PLAG1 core is present in this large consensus sequence (Fig. 6B) ⇓ . However, no G-cluster was identified, suggesting that Zac1/PLAGL1 does not interact in the same way as PLAG1. Both PLAG1 and Zac1/PLAGL1 are possibly involved in tumorigenesis, PLAG1 as a putative oncogene that contributes to pleomorphic adenomas whereas Zac1/PLAGL1 as a tumor suppressor candidate that regulates apoptosis and cell cycle arrest (26) . It is thus interesting to further investigate the differences of specificities between these two related proteins to determine whether the opposite functions are the result of different target genes or different actions on the same set of genes.
The PLAG1 DNA binding consensus is highly GC-rich, which is a hallmark of most promoters of genes controlling cell growth. It is thus tempting to speculate that PLAG1 will exert its oncogenic effect via the activation of growth factors. The IGF-II is an excellent candidate because IGF-II is a peptide growth factor that plays an important role in embryonic development and also in carcinogenesis (28) . The human IGF-II is a complex transcription unit that is regulated by activation of multiple promoters designated P1 to P4. Promoter 1 activity has been demonstrated only in adult liver, whereas promoters 2, 3, and 4 are coexpressed in a variety of fetal tissues, notably fetal liver, and at a lower level, in many adult tissues with the exception of adult liver. Promoters 3 and 4 are also highly active in numerous tumor tissues, suggesting that transcriptional up-regulation of P3 and P4 activities may be importantly involved in tumorigenesis. Our study is the first demonstration of IGF-II up-regulation in tumors of the salivary glands. This up-regulation is the result of a drastic up-regulation of promoter 3 activity, as demonstrated by the hybridization performed with a P3-specific probe. Several lines of evidence strongly suggest that IGF-II up-regulation in salivary gland tumors results from transcriptional activation by PLAG1: (a) five potential binding sites were found in promoter 3 of IGF-II; (b) PLAG1 binding was effectively demonstrated on the site P3-4; (c) IGF-II promoter 3 activity is up-regulated by PLAG1; (d) IGF-II is highly expressed in tumor cells that overexpress PLAG1 but could not be detected in tumors without PLAG1 up-regulation or in normal salivary gland tissues. Our study suggests that the oncogenic activity of PLAG1 results from its positive regulation of IGF-II expression, known to potently stimulate cell proliferation in human tumors through autocrine or paracrine mechanisms (28, 29, 30) .
The PLAG1 consensus sequence is also reminiscent of G-rich sequences recognized by an important group of zinc finger proteins that include Sp1, Zif268/Egr1, and WT-1. As shown in Fig. 6B ⇓ , the PLAG1 consensus binding motif overlaps with the consensus binding sequence for the transcription factor Sp1 and the tumor suppressor WT-1, suggesting that these proteins could at least partly regulate the same set of genes. In fact, the growth factor gene IGF-II has been shown to be also one of the targets of the tumor suppressor WT-1. High levels of IGF-II in Wilms’ tumor were attributed to the loss of function of WT-1, which normally represses IGF-II transcription (31) . The WT-1 protein binds within the IGF-II P3 promoter to multiple sites with some overlapping with potential PLAG1 binding sites. This suggests that one control for IGF-II expression during development could be provided by a balance between activators (like PLAG1) and repressors (like WT-1) acting on overlapping sequences. Deregulation of one of these factors is probably an important step in tumor formation.
Acknowledgments
We gratefully acknowledge G. Stenman for the specimens of primary salivary gland tumors and P. E. Holthuizen for her kind gift of the IGF-II exon 5 probe and for (IGF-II-P3)luc plasmid. We thank J. Remacle, S. Tejpar, and B. Peers for critical review of the manuscript.
Footnotes
-
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
-
↵1 This work was supported by the “Geconcerteerde Onderzoekacties 1997–2001” and the “Fonds voor Wetenschappelijk Onderzoek Vlaanderen” (FWO). M. V. is a post-doc of the Flanders Interuniversity Institute for Biotechnology. K. K. is a post-doc of the FWO.
-
↵2 To whom requests for reprints should be addressed, at Laboratory for Molecular Oncology, Center for Human Genetics, University of Leuven and Flanders Interuniversity Institute for Biotechnology, Herestraat 49, B-3000 Leuven, Belgium. Phone: 32-16-346041; Fax: 32-16-346073; E-mail: marianne.voz{at}med.kuleuven.ac.be
-
↵3 The abbreviations used are: PLAG1, pleomorphic adenoma gene 1; CASTing, cyclic amplification and selection of target sequences; GST, glutathione S-transferase; EMSA, electrophoretic mobility shift assay; ATCC, American Type Culture Collection; PDGF, platelet-derived growth factor; EPD, Eukaryotic Promoter Databank; IGF, insulin-like growth factor.
- Received July 20, 1999.
- Accepted October 28, 1999.
- ©2000 American Association for Cancer Research.