Skip to main content
  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

AACR logo

  • Register
  • Log in
  • Log out
  • My Cart
Advertisement

Main menu

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • RSS Feeds
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Webinars
  • Search More

    Advanced Search

  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

User menu

  • Register
  • Log in
  • Log out
  • My Cart

Search

  • Advanced search
Cancer Research
Cancer Research
  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • RSS Feeds
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Webinars
  • Search More

    Advanced Search

Molecular Biology & Genetics

The 630-kb Lung Cancer Homozygous Deletion Region on Human Chromosome 3p21.3: Identification and Evaluation of the Resident Candidate Tumor Suppressor Genes

Michael I. Lerman and John D. Minna
Michael I. Lerman
Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, Dallas, Texas 75390-8593 [J. D. M.], and Laboratory of Immunobiology, National Cancer Institute, Frederick Cancer Research and Development Center, Frederick, Maryland 21702 [M. I. L.]
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John D. Minna
Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, Dallas, Texas 75390-8593 [J. D. M.], and Laboratory of Immunobiology, National Cancer Institute, Frederick Cancer Research and Development Center, Frederick, Maryland 21702 [M. I. L.]
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, Dallas, Texas 75390-8593 [J. D. M.], and Laboratory of Immunobiology, National Cancer Institute, Frederick Cancer Research and Development Center, Frederick, Maryland 21702 [M. I. L.]
DOI:  Published November 2000
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

We used overlapping and nested homozygous deletions, contig building, genomic sequencing, and physical and transcript mapping to further define a ∼630-kb lung cancer homozygous deletion region harboring one or more tumor suppressor genes (TSGs) on chromosome 3p21.3. This location was identified through somatic genetic mapping in tumors, cancer cell lines, and premalignant lesions of the lung and breast, including the discovery of several homozygous deletions. The combination of molecular manual methods and computational predictions permitted us to detect, isolate, characterize, and annotate a set of 25 genes that likely constitute the complete set of protein-coding genes residing in this ∼630-kb sequence. A subset of 19 of these genes was found within the deleted overlap region of ∼370-kb. This region was further subdivided by a nesting 200-kb breast cancer homozygous deletion into two gene sets: 8 genes lying in the proximal ∼120-kb segment and 11 genes lying in the distal ∼250-kb segment. These 19 genes were analyzed extensively by computational methods and were tested by manual methods for loss of expression and mutations in lung cancers to identify candidate TSGs from within this group. Four genes showed loss-of-expression or reduced mRNA levels in non-small cell lung cancer (CACNA2D2/α2δ-2, SEMA3B [formerly SEMA(V), BLU, and HYAL1] or small cell lung cancer (SEMA3B, BLU, and HYAL1) cell lines. We found six of the genes to have two or more amino acid sequence-altering mutations including BLU, NPRL2/Gene21, FUS1, HYAL1, FUS2, and SEMA3B. However, none of the 19 genes tested for mutation showed a frequent (>10%) mutation rate in lung cancer samples. This led us to exclude several of the genes in the region as classical tumor suppressors for sporadic lung cancer. On the other hand, the putative lung cancer TSG in this location may either be inactivated by tumor-acquired promoter hypermethylation or belong to the novel class of haploinsufficient genes that predispose to cancer in a hemizygous (+/−) state but do not show a second mutation in the remaining wild-type allele in the tumor. We discuss the data in the context of novel and classic cancer gene models as applied to lung carcinogenesis. Further functional testing of the critical genes by gene transfer and gene disruption strategies should permit the identification of the putative lung cancer TSG(s), LUCA. Analysis of the ∼630-kb sequence also provides an opportunity to probe and understand the genomic structure, evolution, and functional organization of this relatively gene-rich region.

INTRODUCTION

Lung cancer kills >150,000 patients each year in the United States and many more around the world. These are more deaths than attributable to colon, prostate, and breast cancer combined (1, 2, 3, 4) . The scale of this epidemic has heightened efforts to understand the molecular pathogenesis of lung cancer, including genes frequently undergoing somatic mutation. The isolation of such“ lung cancer genes” should guide the development of new therapeutic interventions and new early detection and prevention strategies. Although tobacco smoking is a well-established environmental etiology in lung carcinogenesis (5) , our understanding of the acquired genetic changes leading to lung cancer is still rudimentary and incomplete (6) . The lack of (and difficulty of performing) genetic linkage studies of lung cancer in families (4 , 7) that have the potential of identifying initiating cancer causing genes has directed the search toward allele loss mapping in tumors, cell lines, and premalignant lesions of the lung and breast (8, 9, 10, 11, 12, 13) . A convergence of evidence from these allele loss mapping studies, including identification of overlapping homozygous deletions, strongly suggests the presence of a TSG 4 in the chromosome 3p21.3 band (6 , 9 , 10 , 14) . Allele loss in the 3p21.3 area is the earliest premalignant change thus far detected in lung cancer development (12 , 15 , 16) . Biallelic or monoallelic inactivation of this putative TSG gene(s) likely represents a critical (rate-limiting) step in the development of sporadic lung cancer. As part of our efforts to identify a lung cancer TSG (which we will provisionally call LUCA) on 3p21.3, we physically mapped and cloned the genomic DNA surrounding this locus as defined by homozygous deletions (6 , 9 , 10 , 14 , 17, 18, 19, 20) . Subsequently, the ∼630-kb clone contig was sequenced jointly by The Washington University 5 and The Sanger 6 Human Genome Sequencing Centers. In more recent work, we placed the putative 3p21.3 TSG(s) in a ∼120-kb segment that was defined by a homozygous deletion in a breast cancer specimen that was nested within the three small cell lung cancer homozygous deletions (10) . In parallel with these genetic and physical studies, we have been constructing a map of transcript sequences with the aim to identify a complete set of all transcripts encoded in the region and to define/annotate the respective genes. Here we report the catalogue of genes we have discovered to be residing in the 630-kb sequence and their experimental and informatics characterization. Of these, only two“ G protein” genes, i.e., GNAI2 (21) and GNAT1 (22) , had been cloned and characterized previously, and from this catalog we positioned these two genes within the contig DNA sequence. The set of 19 genes found in the overlapping homozygous deletions in SCLCs NCI-H740, NCI-H1450, and GLC20 (18) , including eight in the smaller critical 120-kb sequence defined by a breast homozygous deletion (10) , were analyzed extensively. We used both manual experimental methods to study expression and search for mutations and web-based computational servers to predict possible protein functions. Four of the genes by Northern analysis showed frequent reduced or absent mRNA levels in NSCLC (CACNA2D2, SEMA3B, BLU, and HYAL1) and SCLC (BLU, HYAL1, and SEMA3B) cell lines. We found that six of the genes had mutations, but none of the 19 genes showed a high frequency of mutations (>10%) in the analyzed lung tumor samples. This raises the possibility that the putative TSG, LUCA, may be one of the genes with frequent loss of expression that occurs through acquired tumor promoter hypermethylation (23) . Alternatively, it could belong to the class of haploinsufficient TSGs. This novel class of TSGs is predicted to predispose to cancer in a hemizygous (+/−) state but does not show a second hit in the remaining wild-type allele in tumors. Further functional experimental analysis such as growth suppression studies and gene knockout strategies will be required to reveal the identity of the putative 3p21.3 TSG(s). In addition, our study shows that genomic DNA sequencing in combination with high-quality gene annotation is an effective method of gene discovery.

MATERIALS AND METHODS

Cell Lines and DNA Samples

Lung cancer cell lines were started and maintained by us using published methods, and information on the lines are summarized in the NCI-Navy collection (American Type Culture Collection; Ref. 24 ). DNA from the SCLC line, GLC20 (17) , was a gift of C. H. C. Buys (University of Groningen, Groningen, the Netherlands); the SCLC cell line, U2020 (25) , was a gift of P. Rabbitts (MRC, Cambridge United Kingdom). Lung cancer normal/tumor paired DNA samples were from the NCI-Navy collection established by B. Johnson (26) .

Commercial Reagents

The following materials were purchased from the vendors indicated: PCR tool kits, Perkin-Elmer Cetus; DNA sequencing tool kits, Applied Biosystems (Foster City, CA); fluorescent in situ hybridization reagents, Boehringer Mannheim (Mannheim, Germany); cDNA and cosmid libraries, ClonTech Laboratories (Palo Alto CA) and Stratagene (La Jolla, CA); EST clones from public databases, the I.M.A.G.E consortium 7 or Research Genetics (Rockville, MD); SSCP tool kits and blotting nylon membranes, Amersham (Arlington Heights, IL); oligonucleotide primers, Life Technologies, Inc. (Rockville, MD); restriction enzymes, Life Technologies, Inc., New England Biolabs (Beverly, MA), and Amersham (Arlington Heights, IL); MTN poly(A)+ RNA blots, ClonTech Laboratories (Palo Alto, CA); buffers, blotting solutions, and RNase free water, Quality Biologicals (Gaithersburg, MD); chemicals, Sigma Chemical Co. (St. Louis, MO); and cell culture media, Life Technologies, Inc. (Rockville, MD).

Informatics Tools

The software package, GENSCAN (27) , was licensed from Christopher Burge (MIT, Cambridge, MA) and installed at the NCI Advanced Biomedical Computing Center at the Frederick Cancer Research and Development Center. The integrated informatics package, PANORAMA, incorporating BLAST, GENSCAN, GRAIL, and other gene interpretation features, was developed at University of Texas Southwestern Medical Center at Dallas (TX) by H. Garner and run on a Hewlett Packard Exemplar supercomputer. PANORAMA is available for internet use. 8 For this analysis, GenBank was downloaded December 1999.

Manual Molecular Procedures

All molecular manipulations (DNA and RNA isolations, screening genomic and cDNA libraries, Northern and Southern blot analyses, and PCR) were performed using standard methods according to Sambrook et al. (28) . For DNA sequencing, cDNA clones were sequenced on an Applied Biosystems 373 or 377 DNA sequencer (Stretch) using Taq Didroxy Terminator Cycle Sequence kits (Applied Biosystems, Foster City, CA) with either vector or clone-specific walking primers. Cosmid and P1 phage DNAs were sequenced by the Washington University and Sanger Human Genome Sequencing Centers using the shotgun procedure as described (21 , 22) . FISH and two-color FISH were used to locate and orient the cosmid contig on chromosome 3p. Normal metaphase chromosomes were hybridized simultaneously with digoxygenin-labeled NotI linking clone NL1–210, part of cosmid LUCA1, (green) and biotin-labeled cosmid LUCA20 (red) (29) . 4′,6-Diamidino-2-phenylindole was used as a counterstain. Both the metaphase spreads and interphase nucleus staining confirmed the single-site location of each probe on 3p21.31, establishing the following order: centromere-cosmids LUCA1-LUCA20-telomere. Pulsed-field gel electrophoresis analysis was performed as follows. High molecular weight DNA was prepared in agarose plugs as described (30) . Slices containing∼ 10 6 cells were digested for 16 h with 50 units of enzyme (NotI, Nru1, and Mlu1; Boehringer Mannheim) and resolved on 1% agarose gels using a Bio-Rad CHEF Mapper (Hercules, CA) and electrophoresis profiles, allowing separation in the range 50–1000 kb. For expression analyses, Northern blot hybridization was performed with cDNA probes using commercial MTN poly(A)+ RNA blots ClonTech (Palo Alto, CA) from a variety of adult human tissues and tumor cell lines and in-house blots with total or poly(A)+ RNA were prepared from lung cancer cell lines. Radioactive DNA probes were prepared by random priming Rediprime II (Amersham, Arlington Heights, IL). Hybridization was performed in ExpressHyb hybridization solution according to manufacturer’s instructions (ClonTech Laboratories, Palo Alto, CA). In addition, the presence of gene transcripts was monitored in silico by BLAST homology searches (31) in public EST databases. Mutational analyses were performed by RT-PCR-SSCP or exon-PCR-SSCP, followed by sequencing of shifted bands as described previously (32 , 33) . Experimental gene discovery by using conserved and transcribed genomic fragments was performed as detailed previously (18) .

Computational and Bioinformatics Procedures

World Wide Web-based Servers and Databases.

World Wide Web-based servers and databases (34) were used to analyze genomic, cDNA, and predicted protein sequences. In addition, the Wisconsin Genetics Computer Group, package 10 (35) , and the GENSCAN (27) programs were run at the Advanced Biomedical Computing Center (Frederick Cancer Research and Development Center), whereas the University of Texas Southwestern Medical Center integrated gene analysis software PANORAMA was run at the University of Texas Southwestern Medical Center.

DNA and Protein Sequence Analyses.

Global sequence alignments were done using BLAST (31) and Advanced BLAST 9 ,(36) programs as provided by National Center for Biotechnology Institute,10and BLAST2/WU-BLAST.11Multiple sequence alignments, global and local, were done using the CLUSTAL version W program12as provided by EMBL, Baylor Computing Center,13and the Wisconsin Genetics Computer Group, package 10 program (35) . Protein structural features were delineated using the EXPASY proteomics tools.14Protein domains were discovered using Pfam (37) and SMART (38) programs as provided by EMBL.15Protein subcellular localization was predicted using PSORT16 (39) . Signal peptides, transmembrane helices, and membrane topologies were predicted by SPLIT17 (40) , TMHMM18 (41) , and PSORT (39) programs. Protein motifs were found by visually inspecting local alignments or using the protein motifs (42) , ProfileScan and Prosite (43) programs. In addition, we used the INTERPRO server (44) .19

Discovery of Orthologous Genes in Model Organisms.

Stringent criteria for identification of candidate orthologous genes were applied as suggested (45 , 46) . In the mouse, orthologous pairs were >90% identical on the protein level with> 90% alignment of their entire lengths (47) . In the fly, worm, and yeast, candidates were identified with 20–50% identity over at least 80% of their lengths. The TBLASTN program was used to search nonredundant nucleotides, Unigene, and EST databases of the model organisms. EST clusters were then built by the EST assembly machine server20or the EST Assembler21at Max Delbrück Center.22The advanced BLAST2 and Orthologue program (48) at EMBL was used to confirm the putative orthologous relationships and obtain and ascertain phylogenetic trees.

In Silico Gene Discovery Was Performed following Two Different Protocols.

Genome-wide repeats and low complexity regions in the genomic DNA sequences were identified and masked using the program RepeatMasker.23They were then used in BLASTN searches against EST, Unigene, and nonredundant nucleotide databases to identify potential transcripts (ESTs and cDNAs) and build EST clusters. Next, genomic sequences assembled from the individual cosmid sequences were subjected to gene prediction programs, i.e., GENSCAN (27) and XGRAIL (49) , with default settings to identify coding DNA sequences and corresponding protein sequences. These were then used in BLASTN and TBLASTN searches, respectively, against nonredundant nucleotide and EST databases to identify ESTs and cDNAs. The ESTs were then assembled into clusters (see above). Genomic information (repetitive elements, coding exons, ESTs, and known and predicted genes) was also obtained and analyzed by first-pass automatic genome annotation programs, PANORAMA (18) , for individual cosmid sequences and by the Rummage package24for the whole assembled ∼630-kb contig sequence. The Rummage analysis was kindly performed by Drs. A. Rosenthal and R. Schattevoy, both at the Genome Sequencing Center (Jena, Austria). Recently, The Genome Annotation Channel25made available their first-pass annotations for most of the contig sequences.

Gene Annotations.

Annotations for the proteins for all of the genes discovered in the contig sequence were compiled from computational predictions, experimental observations, and by transfer of information from the yeast, worm, and fly orthologue pairs. Functional conservation between human proteins and their orthologous counterparts was repeatedly demonstrated experimentally (46 , 50) .

RESULTS

Overlapping and Nested Homozygous Deletions Define 120-kb and 250-kb Regions for a TSG Search and Identification of 25 Resident Genes in the Overall ∼630-kb Region.

LOH (allele loss) is an important sign of somatically acquired genetic events in the natural history of many tumors and is a useful tool to discover the location of new TSGs. However, the regions of allelic losses involving 3p in lung cancer (6 , 9) and premalignant lesions (12 , 14, 15, 16) are multiple and often large, making it difficult to define a small consensus region that would facilitate a realistic positional cloning effort. Fortunately, three overlapping homozygous deletions in lung cancer cell lines (NCI-H740, NCI-H1450, and GLC20; Refs. 9 ,, 17, 18, 19 ,, 51 ) and one in a breast tumor specimen (HCC1500; Ref. 10 ), in conjunction with functional evidence of suppressor activity (9 , 52, 53, 54) , strongly support the existence of a putative TSG(s) in this location and successively narrowed the critical gene region, first to 370 kb (18) , and more recently, to ∼120 kb (Ref. 10 ; Fig. 1 ⇓ ). This latest reduction of the critical segment by a nesting deletion in a breast carcinoma assumes that the same gene(s) was targeted in both lung and breast malignancies (10) . However, if the targeted gene(s) were different, the lung cancer gene(s) could lie in either the ∼120-kb region or in the more telomeric ∼250-kb segment of the 370-kb region.

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

Genetic and physical characterization of the human tumor nested homozygous deletion region on 3p21.3 showing the steps leading to the identification of the cloned resident candidate tumor suppressor genes. Top, ideogram of the banding pattern of chromosome 3p with some genetic framework markers found in and flanking the deleted region. Next is shown a physical pulsed-field gel electrophoresis map of the homozygously deleted region and its flanking sites. The positions of the rare cutter restriction sites are indicated above the line and the sizes (in kb) of the corresponding NotI restriction fragments are given below the line. The physical map is followed by a diagrammatic representation of the overlapping homozygous deletions discovered in SCLC cell lines (NCI H1450 in blue, NCI H740 in magenta, and GLC20 in green) and a nested deletion in a breast cancer tumor sample and corresponding cell line (H1500 in brick). The sizes of the homozygous deletions, deletion overlaps, and the precisely mapped breakpoints are indicated. The minimum tiling cosmid contig covering the deleted region is shown below the deletions. The position of the framework genetic markers is shown by downward arrows; the GenBank accession nos. for cosmid sequences are given under the lines, and the abbreviated cosmid/P1 clone numbering system given to the Genome Centers (e.g., Luca01, Luca02… Luca22, and P1 clone p3938) is given above the lines. The sequencing was done jointly by The Washington University (Luca12 to P3938) and The Sanger (Luca01 to Luca11) Genome Sequencing Centers using the high-redundancy shotgun procedure and is available from GenBank or the centers ftp sites (see: ftp.sanger.ac.uk/pub/human/sequences/Chr_3/ and ftp://genome.wustl.edu/pub/gsc1/sequence/st.louis/human/). Finally, a line representing the ∼630-kb assembled genomic sequence with the positioned resident gene and breakpoints defining critical regions is shown at the bottom. The genes are represented by pointed rectangles, indicating the orientations of transcription. The gene names are given above and the GenBank accession nos. below the rectangles. The ∗ with downward arrows indicates the position of two genes (temporary names LUCA1.2 and LUCA2.3) on cosmids Luca01 and Luca02, respectively, whose sequencing is not yet completed. The 6 genes outside of the critical region in the centromeric portion are in magenta, the 8 genes in the combined breast and lung 120-kb critical region are in blue, whereas the remaining 11 genes still within the rest of the lung cancer nested homozygous deletions (and additional 250 kb) are all in green. The 6 genes exhibiting at least some point or small mutations have an “M” in their box. Probable pseudogenes and other in silico predicted genes for which we currently cannot confirm their status as a bona fide gene are not shown.

The genomic DNA cosmid/P1 clone contig (Fig. 1) ⇓ covering the overlapping homozygous deletions in 3p21.3 is now sequenced almost to completion and is ∼630 kb long.26Fig. 1 ⇓ summarizes the genetic and physical map of the 3p21.3 region on which we have searched for a lung cancer TSG, including contig structure, and successive steps leading to the definition of the critical regions of ∼370 kb, ∼250 kb, and ∼120 kb. We have systematically proceeded to identify and test all of the genes in the region for their candidacy as a TSG (Fig. 1 ⇓ ; Table 1 ⇓ ). In addition to the genes we detected by manual methods (such as screening cDNA libraries with cosmid DNAs and using the cosmids for exon capture), the genomic sequence allowed us to detect other gene candidates in silico. We used BLAST searches for ESTs, and gene prediction programs such as GRAIL, GENQUEST, and GENSCAN (27 , 49) including our own integrated informatics tool, PANORAMA.27The total number of verified resident genes in the 630-kb region currently is 25, 6 of which were in the centromeric portion of the 630-kb region and outside of the overlap of SCLC or breast cancer homozygous deletions (Fig. 1 ⇓ ; Table 1 ⇓ ). For these 25 genes, we have reported mutational and expression analysis on 3pK, IFRD2/SM15, and SEMA3B and laid out the rough outline of some of the gene locations in the contig (18 , 55, 56, 57) . The gene prediction programs, particularly GENSCAN, identified all of the 25 verified genes and fairly accurately (>90%) predicted their ORFs and intron/exon structure (although in some cases distinct genes were combined into one gene).

View this table:
  • View inline
  • View popup
Table 1

Candidate and verified genes identified in the ∼630-kb 3p21.3 homozygous deletion region and status of their mutation analysis

In addition to the 25 genes, by an intensive informatics effort using the genomic DNA sequence, we found several potential genes (Table 1) ⇓ . These included two gene candidates (arbitrary names Gene29, cosmid LUCA2, and Gene19, cosmid LUCA10; Table 1 ⇓ ), identified by one or two EST hits that did not have intron/exon structure, were not detected by the GENSCAN prediction program, did not have a significant ORF, and did not have a detectable mRNA signal on MTN blots. We believe these may represent genomic DNA contamination of the EST database. Another set of gene candidates were detected by GENSCAN (29) and GRAIL-EXP (49) gene prediction programs analysis of genomic DNA (arbitrary names Gene22, Gene23, Gene24, Gene25, and Gene27; Table 1 ⇓ ). Although programs provide a predicted cDNA sequence, ORF, and intron/exon structure, thus far we have found no EST hits in GenBank, no mRNA signal on MTN blots, and no homology of the predicted protein to proteins in GenBank. Thus, at the present time we cannot verify these GENSCAN predicted candidates as actual genes, and they are noted for future reference. These observations suggest that we have found all, or nearly all, resident genes in the ∼630-kb sequence. This contention is likely true for the ∼120-kb sequence (Fig. 1) ⇓ that contains all of seven complete genes and a large part of the CACNA2D2 gene. In this critical region, the genes reside in the following order with the indicated sizes and intergenic distances: CACNA2D2 (80 kb), 5-kb intron; PL6 (4.5 kb), 1.5-kb intron; 101F6 (2.3 kb), 161-bp intron; NPRL2/g21 (3.3 kb), 1.8-kb intron; BLU (4.7 kb), 4.4-kb intron; RASSF1/123F2 (7.6 kb), 1.6-kb intron; FUS1 (3.3 kb), 4.5-kb intron; and HYAL2 (2.7 kb). With a total of ∼18 kb of intergenic and ∼17 kb of intronic sequence, this region is extremely gene dense and contains 20–25% of coding sequence.

The location of the contig on 3p21.3 and its position relative to other 3p genes were determined by the presence in the ∼630-kb sequence of several framework genetic markers mapped to this location (e.g., D3S1621 and D3S1568; Fig. 1 ⇓ ), multiple radiation hybrid mapped sequence tagged sites (SHGC-11855 shown), and other linked genetic markers (see detailed marker information on genomic sequence contigs NT_002322, NT_000067, and NT-000069).28In addition, in situ hybridization [FISH and two-color FISH with DNA from cosmids LUCA1 (Z74618), LUCA8 (Z84495), and LUCA20 (AC004693)] located the contig to 3p21.3 and proved the orientation as shown on Fig. 1 ⇓ (data not shown). We also performed radiation hybrid mapping with the TNG panel with selected markers for finer resolution of the location (not shown). In addition, using the contig sequence, we have determined intergenic distances, exon-intron structures, the intron sizes of the resident genes, and ascertained the direction of transcription. These are all features that are immediately available by performing a BLAST analysis with our deposited cDNA sequences (Table 1) ⇓ against the individual cosmid or assembled genomic sequences.29

In total, the 25 genes occupy ∼630-kb of genomic DNA, resulting in an average size of ∼25 kb/gene, which agrees well with the size of an average gene (∼30 kb) estimated for the whole human genome. However, distribution of these genes along the sequence is rather uneven, with the highest gene density (genes in green in Fig. 1 ⇓ ) in the∼ 120-kb region (average gene size, ∼15-kb). Gene sizes varied dramatically, with the smallest gene size of 2.3 kb (g101F6) and the largest of ∼140 kb (CACNA2D2). Actually, this large gene is interrupted by the centromeric breakpoints of both the HCC1500 breast cancer homozygous deletion (in cosmid LUCA6) and in the SCLC GLC20 homozygous deletion in LUCA10 (Fig. 1) ⇓ . The intergenic distances also differed enormously, from 161 bp (between g101F6 and NPRL2/g21) to ∼60 kb (between genes g20 and CACNA2D2). The intron sizes also varied dramatically, the smaller ones ranging from 50–1000 bp and the largest being 40–50 kb (in CACNA2D2).

It is worthwhile to examine what type of genes (if any) would have been missed by our manual and informatics prediction analyses. These omissions could include genes that lack GenBank matches, genes whose sequences would not be recognized by the available gene prediction programs, such as non-protein coding genes, and genes whose mRNAs have a very restricted pattern of tissue expression. Another approach to this gene saturation problem will be to compare the human sequence with the corresponding mouse genomic sequence. Because functional sequences such as transcriptional enhancers and mRNA-like noncoding RNAs are highly conserved in mammalian genomes, they might be readily detected in this comparison (58 , 59) . The effectiveness of this approach has been demonstrated recently (60 , 61) . The mouse BAC clone (#AC025353) covering this region was used for this alignment (data not shown).

The three lung cancer homozygous deletions identified an ∼370-kb region extending from cosmid LUCA10 (Z75742; defined by the centromeric end of the GLC20 homozygous deletion) to P3938 (AC004814; defined by the telomeric end of the NCI-H740 homozygous deletion (Fig. 1) ⇓ . The nested homozygous deletion in breast cancer HCC1500 (10) , which covered part of cosmid LUCA06 (Z84493) to part of cosmid LUCA13 (AC002455; Fig. 1 ⇓ ), divided the 19 genes in the 370-kb region into two critical gene sets: 8 genes in a 120-kb segment extending from part of cosmid LUCA10 to part of cosmid LUCA13, and 11 genes in the telomeric portion (∼250 kb, extending from cosmid LUCA13 to P1 clone P3938, AC004814). These deletions eliminated the 3pk/MAPKAP3 (U09578), CISH (AF132297, temporarily called Gene18 in our GenBank deposit), HEMK isoforms I and II (AF131220 and AF172244), Gene20 (AF188706), and two partially characterized genes[ Gene28(Luca1.2) and Gene30(Luca2.3)], all lying in the centromeric portion of the contig (∼260 kb of genomic DNA located in cosmids LUCA1, LUCA2, LUCA3, and LUCA4), from further consideration. In addition, expression and mutation analysis by us of 3pk (U09578) and by Sithanandam et al. (55) and Uchida et al. (62) of CISH (AF132297) provided other evidence excluding these genes as well.

Expression Analysis of the Candidate Tumor Suppressor Genes.

All of the remaining19 genes were analyzed extensively by manual and computational methods. Northern analyses with cDNA probes for each gene using commercial poly(A)+ RNA MTN blots (Clontech) and a panel of lung cancer cell line RNAs revealed the sizes of the respective transcripts and patterns of expression in normal human tissues and lung cancer samples (Figs. 2 ⇓ and 3 ⇓ ; Table 2 ⇓ ). Expression in Northern blots prepared from total or poly(A)+ RNA of 20–30 RNA samples from lung cancer cell lines, representing both SCLC and NSCLC, revealed, for many of the genes, levels of expression similar to normal lung. No abnormal transcript sizes suggestive of mutations were found. However, several genes showed reduced expression in the lung cancer lines, i.e., the CACNA2D2 gene not expressed in 50% of the lines, the BLU gene expressed only in 30%, the HYAL1 gene expressed in <30% (Fig. 3) ⇓ , and SEMA3B and SEMA3F (19 , 57 , 63) expressed in <50%. Thus, expression analysis in lung cancers identified these five genes as potential TSG candidates based on loss of expression in a sizable number (but not all lung cancers). Because a possible mutation mechanism is tumor-acquired promoter hypermethylation (23) , the methylation status of the CpG islands associated with the genes showing reduced or absent expression is currently under investigation.

Fig. 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 2.

mRNA expression in normal human tissues using MTN blots of 15 of the 630-kb 3p21.3 homozygous deletion region resident genes. The RNA filters (#7759, #7760 from Clontech, Palo Alto, CA) contain 2μ g of poly(A)+ mRNA per lane for each tissue indicated: Lane 1, heart; Lane 2, brain; Lane 3, placenta; Lane 4, lung; Lane 5, liver; Lane 6, skeletal muscle; Lane 7, kidney; and Lane 8, pancreas. cDNA probes for the genes were labeled and hybridized as described in “Materials and Methods.” Published examples of MTN expression for some of the other genes are given for 3pk/MAPKAP3 (55) , CACNA2D2 (68) , SEMA3B and SEMA3F (57) , IFRD2/SM15 (56) , and HYAL3 (79) .

Fig. 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 3.

mRNA expression in lung cancer cell lines of 14 of the genes resident in the 630-kb 3p21.3 homozygous deletion region. Replicate Northern blots were made using 20 μg of total RNA per lane for each of the samples; probes were labeled and hybridized as described in “Materials and Methods.” Each of the blots was used for two to three different probes with stripping of label between hybridization. The lung cancer cell lines are shown above the panel as well as one B lymphoblastoid cell line (BL5). As positive controls, the various replicate blots showed approximately equivalent loading by rRNA amounts and for various positive control probes (not shown). Note that for genes PL6 and G15/RBM5, there is an approximately equal expression in most of the samples for these two genes. As a negative control, note that NCI-H740 RNA homozygously deleted for the entire region provides a negative background signal. The size of the mRNA species for each gene is given on the right of the panel. For published examples of expression of several of the other genes in lung cancer cell lines, see 3pk/MAPKAP3 (55) , CACNA2D2 (68) , SEMA3B and SEMA3F (57) , and IFRD2/SM15 (56) . The tumor histologies of the lung cancer lines are: SCLC (H82, H146, H249, H524, H740, H1514, H1618, H2141, H2171, and H2227); adenocarcinoma (H358 [bronchioloalveolar] H838, H1742, and H2077); large cell (H460, H1155, and H1299); and mesothelioma (H290 and H2052; Ref. 24 ).

View this table:
  • View inline
  • View popup
Table 2

mRNA expression of 3p21.3 genes in Multiple Tissue Northern (MTN) blotsa

Mutation Analysis of the Candidate Tumor Suppressor Genes.

Initially, we performed Southern blot analysis on ∼100 genomic DNAs from our large panel of lung cancer cell lines representing both SCLC and NSCLCs (24) using cDNA or genomic probes representing each of the 25 genes in our search for other homozygous deletions or genomic DNA rearrangements (data not shown). We found only the homozygous deletions listed in Fig. 1 ⇓ and ∼30-kb homozygous deletion in SCLC NCI-H524 involving most of the genomic sequence in cosmid LUCA13-interrupting gene FUS1 to gene HYAL1 (data not shown, and see discussion below). Mutational analyses of the resident genes (summarized in Tables 1 ⇓ and 3 ⇓ ) were performed on lung cancer cDNAs or genomic DNAs by RT-PCR-SSCP or PCR-SSCP, respectively, followed by DNA sequencing of any altered bands detected. A large number of both SCLC and NSCLC cell lines and for some genes, paired normal and tumor DNA samples, were used. In addition, the coding sequences of the FUS1 and the BLU genes were sequenced completely in a large number of lung cancer samples, including paired tumor and normal tissue samples. The results for the eight-gene set in the 120-kb region show that the genes either had no mutations at all (CACNA2D2, PL6, 101F6, 123F2, and HYAL2), or the mutation rate was in the range of 5% (NPRL2/g21, BLU, and FUS1; Table 1 ⇓ ). The same absence or low frequency of mutations was detected in the 11 genes lying in the more telomeric ∼250-kb portion of the 630-kb contig with HYAL1, FUS2, and SEMA3B, each exhibiting a few mutations (Table 1) ⇓ . Many examples of the mutations detected in lung cancer cell lines are given in Table 3 ⇓ . Thus, this extensive mutational analysis (involving 1102 separate tumor sample/gene mutation tests; Table 1 ⇓ ) did not pinpoint a strong candidate gene with a high frequency of mutation among either of the critical gene sets. This finding was unexpected and disappointing. Because it is possible that genes not involved in tumorigenesis may show a low frequency of mutations in common tumors related to a “mutator phenotype” expressed by tumors (64 , 65) , the finding of a low mutation rate in a candidate TSG(s) must be regarded with caution. Accordingly, other lines of evidence besides finding frequent mutations are needed to rule out positionally defined candidate TSGs. These include analysis for tumor-acquired promoter hypermethylation, gene transfer into tumor cells with tests for suppression of the malignant phenotype, and disruption of the candidate genes in “knock-out” mice. In addition, we now also have to consider the newly recognized class of haploinsufficient TSGs, where the presence of only one wild-type allele facilitates tumorigenesis (see “Discussion”).

View this table:
  • View inline
  • View popup
Table 3

Examples of tumor cell lines bearing homozygous amino acid sequence altering mutations of candidate 3p21.3 TSGs

Informatics Analysis for Predicted Protein Functions and Orthologue Identification in Model Organisms.

Computational predictions of biochemical functions could also provide clues as to whether a particular gene could serve a tumor suppressor function by affecting cell growth and/or survival. Therefore, we next studied the biochemical functions and subcellular localization of the proteins encoded by the genes using a variety of computational tools. Information about protein function is a continuum that begins with the finding of homology between proteins within and between species, domain composition, functional motifs, and subcellular localization signals, extending ultimately to demonstration of function by biochemical analysis. Finding orthologous genes in model organisms (mouse, fly, worm, and yeast) permits the transfer of any functional annotation to the human gene in question; therefore, we performed an extensive search for orthologues of the resident candidate TSGs in these model organisms. The results of these computations are summarized and discussed in the annotations for each of the genes (see below). As expected, we found that all 19 genes have true murine orthologues discovered in mouse EST databases. These genes showed nearly 90–100% identity/similarity on the protein sequence level and about 80–90% on the mRNA sequence level (GenBank accession nos. are provided in the annotations). In the worm, highly likely orthologous pairs were identified for 14 of the 19 genes using stringent criteria of orthology, i.e., 30–50% amino acid sequence identity or similarity with >80% alignment of their entire amino acid lengths (45 , 46) . In the fly, highly likely orthologues were found for 10 of the 19 genes among the complete set (∼14,000) of fly genes30 (66) . We noticed that orthologous pairs fall into three categories: common for both worm and fly, only present in the worm, and only present in the fly. Yeast genes sharing ∼50% similarity in common domains/ features were found for two of the genes, i.e., PL6 and NPRL2/Gene21, probably only the Schizosaccharomyces pombe counterpart (NPRL2) of NPRL2/g21 should be considered a candidate orthologous gene. The availability of the complete DNA sequences of yeast (50) , worm31, and fly (66) genomes makes it unlikely that we have missed any of the orthologous gene pairs for our TSG candidates in these three model genomes. We now provide annotations for each of the 19 genes found in the ∼370-kb segment starting with the genes in the smaller ∼120-kb sequence (Fig. 1) ⇓ .

The α2δ-2 calcium channel subunit gene, CACNA2D2, was discovered in silico by both finding EST matches with fragments of genomic sequence and by exons predicted by GENSCAN. The gene occupies ∼140 kb of genomic space and is composed of at least 40 exons. It is expressed as a 5.5–5.7 kb mRNA. Three mRNA splice forms have been detected that code for two protein isoforms in several normal tissues. GenBank deposits AF040709 (mRNA isoform 3) and AF042792 (mRNA isoform 1) differ in the 5′ untranslated region and encode the same amino acid sequence (protein isoform I), whereas AF042793 (mRNA isoform 2) differs in the 5′ translated region and has a slightly different amino terminal amino acid sequence (protein isoform II). The expression of CACNA2D2 is reduced or absent in >50% of lung cancer cell lines, particularly NSCLCs. However, no mutations were detected in analysis of 60 lung cancer cell lines and 40 paired normal/SCLC tumor samples. The nucleotide sequence suggests that the gene encodes an auxiliary regulatory α2-δ subunit of calcium channels and joins theα 2-δ-1 (previously A subunit) gene (67) as a new and second member of the α2-δ gene family. Three putative transmembrane helices predicted previously in the α2δ-1 protein (67) were also predicted in both protein isoforms of the CACNA2D2 gene with the SPLIT 35 program (40) . In addition, protein isoform I of the CACNA2D2 gene has another membrane helix at the very amino terminus. Using the TMHMM program (41) , all three α2δ proteins were predicted to span the membrane only once at the amino (α2δ-2 isoform I) or at the carboxy termini (α2δ-1 and α2δ-2 isoform II), favoring the single-transmembrane model for the α2δ subunit proteins, which was verified experimentally for theα 2δ-1 protein (67) . A protein binding a VWA-like domain was discovered by the PFAM (37) program in the extracellular part at similar positions in all three α2δ subunit proteins (amino acid residues: 291–469 and 222–400 for α2δ-2 protein isoforms I and II, respectively, and residues 253–430 for theα 2δ-1 protein). The VWA-like domain may facilitate the binding of the α2δ complex with the calcium channel α-1 pore forming subunit protein (67) . The almost identical membrane topologies, similar domain structures, and posttranslational modifications of all three α2δ subunit proteins strongly support the identity of the new α2δ-2 gene as a member of theα 2δ gene family. To provide experimental confirmation of this predicted function, through injection of CACNA2D2 cRNA into Xenopus oocytes, we have confirmed recently that CACNA2D2 acts as a regulatory subunit of voltage-gated calcium channels able to augment the function of all three pore-forming units (68) . BLAST (36) searches in the mouse EST database detected two different nonoverlapping EST clones (accession nos. AA000341 and AA008996), which showed 91 and 85% cDNA sequence identity with CACNA2D2 (residues 2925–3421 and 4989–5391), respectively. These EST sequences showed only limited homologies to the murine α2δ-1 gene splice forms, indicating that they represent true orthologous sequences of the human CACNA2D2 gene. This was further corroborated by protein alignment of the 86-amino acid ORF encoded by mouse EST AA000341, which was 96% identical to the CACNA2D2 isoform I protein (amino acid residues 922-1005). The worm genome also contains two α2δ genes; by stringent criteria of orthology (i.e., ∼50% identity/similarity with >80% alignment of their entire amino acid sequence) the worm gene, T24F1.6, appears to be the orthologue of CACNA2D2, whereas the second worm α2δ gene, UNC-36 (accession no. P34374), is the orthologue of the α2δ-1 gene (67) . The UNC-36 phenotypes do not affect growth, and no phenotypes were yet reported for the T24F1.6 locus. The fly proteome (∼14,000 proteins; Ref. 66 ) contains three α2δ proteins (accession nos. AAF53505, AAF53476, and AAF58335) of which the first contains a likely orthologue of α2δ-1 (44) , the second is a likely orthologue of the α2δ-2 gene, and the third of a still-not-cloned human gene; all three have the VWA_DOMAIN. The yeast proteome (50) contains only one ion channel gene and appears to have no orthologues for either of the α2δ genes. Despite the lack of mutations, the absence of CACNA2D2 expression in many but not all NSCLCs with high CACNA2D2 expression in normal lung makes CACNA2D2 an excellent candidate TSG with the need for testing of function in tumor cells and study of acquired promoter hypermethylation as a method of inactivation of gene expression.

The PL6 gene was discovered manually by probing Northern blots with genomic fragments. The gene occupies 4.5 kb of genomic space, is composed of two exons, and expressed as a 2.2-kb mRNA in many normal human tissues including lung. The expression of PL6 is slightly reduced in some SCLC lines and abundantly represented in the human and mouse EST databases. No mutations were detected in 38 cell lines and 40 paired normal/SCLC tumor samples. By sequence analysis, PL6 encodes an integral plasma membrane protein[ PSORT program (39)] with six [SPLIT program (40)] or seven to eight [TMHMM program (41)] transmembrane helices. The predicted cytoplasmic portion of the protein (the last 103 amino acids, residues 249–351) contains an OMPdecase domain [residues 274–298; PFAM, (39)] that may involve PL6 in protein-protein interactions and a bipartite NLS (residues 282–299; Ref. 39 ) that may guide it to the nucleus. The mouse orthologue was discovered in an EST (accession no. W96860), sequenced (our accession no. AF134238), and shown to be 92% identical on protein and 87% on cDNA levels. The worm F11A10.3 gene encoding a multidomain protein that aligns with PL6 and the aligned region contains the NLS and the OMPdecase domains, suggesting that it is the orthologue of PL6. The fly gene CG9536 product (450 residues; accession no. AAF52388) is a likely orthologue of PL6 (48% similarity over the first 306 residues of PL6) and is also an integral membrane protein with seven to eight transmembrane helices. It has two HMW kininogen domains (residues 325–349 and 353-3760) but no NLS and OMPdecase domains. The yeast gene Yol107w product has substantial homology with PL6 but does not have the NLS and the OMPdecase domain. The absence of mutations and robust expression of PL6 in most lung cancers suggest that PL6 is an unlikely candidate TSG.

The 101F6 gene was discovered manually by screening arrayed cDNA libraries with cosmid LUCA12 DNA. The gene space of 3.2 kb contains four exons encoding a 1.5-kb mRNA. The gene is expressed in many normal tissues including lung, is highly expressed in SCLC and NSCLC cell lines, and is abundantly represented in the EST data bases. No mutations were detected in 40 cell lines and 40 paired normal/SCLC tumor samples. By sequence analysis, 101F6 encodes an integral plasma membrane protein [PSORT program (39)] with six [SPLIT program (40) and TMHMM program (41)] transmembrane helices with both termini in the cytoplasm. No other known domains or significant motifs were detected. The mouse orthologue was discovered in the mouse EST database (accession nos. AA285935, AA198541, and AA198960), sequenced (our accession no. AF131206), and shown to be 95% identical on the protein and 85% on the cDNA sequence level. No orthologous pairs were detected in the fly (66) , worm, and yeast proteomes (50) . The absence of mutations and robust expression of 101F6 in most lung cancers suggest that 101F6 is an unlikely candidate TSG.

The NPRL2/Gene21 gene was discovered in silico by finding both ESTs matches and GENSCAN predicted exons. The gene space of 3.3 kb contains 11 exons coding for a 1.5-kb mRNA with multiple splice isoforms that are expressed in many normal tissues including lung and testis and is abundantly represented in the EST databases. NPRL2/Gene21 is well expressed in SCLC and NSCLC lines except for the SCLC line NCI H1514. A frameshift mutation producing a stop codon was detected in 1 of 40 lung cancer cell lines. Sequence analysis shows NPRL2/Gene21 encodes a soluble protein that has a bipartite NLS (residues 62–79) and a protein binding domain, granulin (residues 86–98), predicted by PFAM ( 35 ,, 37 . The mouse orthologue was discovered in mouse EST databases (accession nos. AI037102, AA764527, AA709972, and W64225), sequenced (our accession no. AF131206), and shown to be 97% identical on protein and 90% on cDNA sequence levels. True orthologues were identified: in yeast (the NPR2 gene in Saccharomyces cerevisiae, GenBank accession no. P39923, and the hypothetical Mr 47,000 protein in S. pombe, accession no. Z99163); in the fly (66) , the CG9104 gene product (accession no. AAF48677) with 65% similarity over the whole length of the NPRL2/g21; and in the worm (accession no. U61949) proteome databases. However, only the mouse orthologue contains the bipartite NLS (residues 62–79) and the granulin domain (residues 86–98). NPRL2/Gene21 mRNA is expressed in most lung cancers. The mutations in NPRL2/Gene21, particularly the stop mutations, indicate the need for further study of this gene as a candidate TSG.

The BLU gene was discovered manually (and serendipitously) using PCR primers (kindly provided by B. Vogelstein, Johns Hopkins, Baltimore, MD) to screen for the presence of the β-catenin gene (at the time recently assigned to chromosome region 3p21) in our cosmid contig. Although a PCR product was identified, DNA sequence analysis showed no sequence relationship of the product to β-catenin. This PCR product was used as a probe that identified a mRNA on Northern blot analysis, which then led to the subsequent isolation of the full BLU cDNA by library screening. The gene space of ∼4.5 kb contains 11 (testis version) or 12 (lung version) exons coding for a 2-kb, alternatively spliced mRNA, well expressed in lung and testis but not expressed in all other tested human tissues. The EST databases contain a moderate number of hits, mostly from lung and testis cDNA libraries. The testis isoform contains 11 exons because of a complex selection of an alternative acceptor site. The testis-specific protein isoform contains a different amino acid sequence between residues 199 and 234 as compared with the lung-specific isoform; this change results in the loss of one of three PKC phosphorylation sites (residues 229–231). The expression in SCLC and NSCLC cell lines is reduced or virtually undetectable in 70% of tested lines. Three missense mutations were discovered in a sample of 61 lung cancer cell lines. The BLU protein is likely a soluble cytoplasmic protein and shares 30–32% identity over a stretch of 100–112 amino acids (residues 334–437 or 318–430) with proteins of the MTG/ETO family of transcription factors (69) and the suppressins (70) that may regulate entry into the cell cycle and suppress growth of colon carcinoma cells. The “Zn knuckle” motif involved in specific protein-protein interactions is part of this domain and is present in many proteins.32No orthologous pairs were found in the worm and yeast proteomes (50) . However, the fly genome (66) contains a true orthologue of BLU: the CG11253 gene product (accession no. AAF49850) is of similar size (451 residues), has 49% amino acid sequence similarity over the whole length of BLU, and also has a MYND finger domain (residues 412–448). Several other fly, worm, and S. pombe proteins share 35% identity with the MTG/ETO domain. The mouse orthologue was discovered in mouse EST databases (accession nos. AI595515 and AI429164), sequenced (our accession no. AF123386), and shown to be 89% identical on protein and 87% on cDNA levels. The loss of expression in most lung cancers and the occurrence of a few mutations make BLU an attractive TSG candidate requiring further functional and promoter methylation status studies.

The RASSF1/123F2 gene was discovered manually by screening gridded cDNA libraries with cosmid LUCA12 DNA. The gene space of 7.6 kb contains 5 exons coding for 2-kb, alternatively spliced mRNAs (“short” and “long” forms, 123F2SF and 123F2LF, that should now be referred to by Human Genome Organization-approved nomenclature as RASSF1C and RASSF1A, respectively) that are well expressed in all analyzed human tissues including lung. The RASSF1C/123FSF but not the RASSF1A/123F2LF mRNAs are well expressed in most lung cancer cell lines. The mRNA is well represented in EST databases from normal and tumor tissues. Using GENSCAN prediction programs, the RASSF1A/123F2LF splicing form was discovered using RT-PCR on mRNA with a difference in amino acid sequence in the NH2 terminus, giving a total amino acid sequence of 340 amino acids compared with 270 amino acids for the RASSF1C/123F2SF. The amino acid sequence of RASSF1A/123F2LF contains a predicted DAG binding domain also found in the related gene NORE1 but not found in the RASSF1C/123F2SF cDNA sequence. RASSF1A/123F2LF mRNAs also come in multiple tissue-related splicing forms, with slight differences in amino acid sequence, including forms for lung (RASSF1A, AF102770), heart (RASSF1D, AF102771, and pancreas (RASSF1E, AF102772). No mutations were detected in 40 paired normal tumor (SCLC/NSCLC) DNA samples (studied for RASSF1C/123F2SF and RASSF1/123F2 common region) and in 38 lung cancer cell lines (RASSF1C/123F2SF and RASSF1A/123F2LF, all regions). The RASSF1/123F2 protein is a soluble cytoplasmic protein that contains a Ras association domain (residues 124–218) discovered by the SMART (38) and PFAM (37) programs. Although not all Ras association domains bind RasGTP, the Ras association domain in the mouse paralogue of RASSF1/123F2, NORE1 was found to bind RasGTP (71) . The NORE1 protein also contains the PKC-C1 and DAG/PE domains, which are found in the RASSF1A/123F2LF predicted protein but not in the RASSF1C/123F2SF protein. Recently, the Kastan group has identified RASSF1/123F2 amino acid sequence (common to both the RASSF1C/123F2SF and RASSF1A/123F2LF proteins) as a potential phosphorylation target for ataxia telangiectasia mutated (72) . The mouse orthologue of RASSF1/123F2 was discovered in mouse EST databases (accession nos. AA543890, AA161846, and AA466998), sequenced (our accession no. AF132851), and shown to be 97% identical on protein and 88% on cDNA sequence levels. In contrast, the human orthologue of the mouse NORE1 and the rat MAXP1 (accession no. AF002251) genes is present in a single human EST (accession no. AA362184). Thus, RASSF1/123F2 is part of the same gene family as (but not the orthologue of) NORE1 and the rat gene MAXP1. The worm gene, T24F1.3 (accession no. Z49912), encodes a 615-amino acid hypothetical protein that shares 33% identity and 53% similarity over 95% of the length of the RASSF1/123F2 protein. The T24F1.3 protein contains in the shared portion with RASSF1/123F2 the Ras association domain (residues 396–496), and in addition a PH domain (residues 1–53), and PKC-C1, DAG/PE binding domains (residues:164–214), which are found in the RASSF1A/123F2LF predicted protein. The fly (66) and yeast (50) proteomes do not contain a gene with substantial homology to 123F2. The absence of expression of RASSF1A/123F2LF in many lung cancers makes this isoform an attractive candidate for further promoter hypermethylation and tumor-suppressing functional studies. In fact, recent studies by us have shown that RASSF1A/123F2LF promoter region CpG islands undergo tumor-acquired hypermethylation associated with loss of expression, and that forced re-expression of RASSF1A/123F2LF leads to suppression of the malignant phenotype.33

The FUS1 gene was discovered manually by screening cDNA libraries with a genomic fragment from the area of cosmids LUCA12 and LUCA13 showing sequence conservation by Southern blot hybridization and isolated as the fusion (FUS = “fusion”) junction of the ends part of a ∼30-kb homozygous deletion in SCLC NCI-H524 linking LUCA12 with LUCA13 sequences. The gene space of 3.3 kb contains three exons coding for a 1.8-kb mRNA that is well expressed in all analyzed human tissues including lung and in 20 lung cancer cell lines. The mRNA is well represented in EST databases from normal and tumor cells. Three mutations were discovered in 79 lung cancer cell line DNAs leading to truncated products. The FUS1 protein (110 amino acids) is probably a soluble cytoplasmic protein with a high pI of 9.69; no domains or known motifs were detected by SMART (38) or PFAM (37) programs. The mouse orthologue was discovered in mouse EST databases (AA867009, AA473614, and AA672013), sequenced (our accession no. AF123387), and shown to be 93% identical on the protein level and 87% on the cDNA level. The fly proteome (66) does not contain a gene with substantial homology to FUS1. However, the worm gene, C09E9.1, shows 41% identity on global alignment and 43% identity over 83% of the FUS1 protein length and should be considered a candidate orthologue of FUS1. This small worm protein (123 amino acids) is predicted to have a bipartite NLS (residues 84–101) and weak similarity with DNA-directed RNA polymerase subunit A′ (accession no. P31813). The mutations found in FUS1 make it an attractive candidate for further functional TSG studies.

The HYAL2 gene along with HYAL1 was discovered manually by screening cDNA libraries with a genomic fragment from LUCA13 conserved across species in Southern blotting. Because these were the first two genes we isolated in our positional cloning effort, they were initially given the working names LUCA1 and LUCA2 (see GenBank deposits). With the discovery of their function as hyaluronidases, they should now be referred to as HYAL1 and HYAL2 and the LUCA1, 2 names reserved for future discovery of a lung cancer functional TSG. The gene space of 2.8 kb contains three exons that encode a 2-kb mRNA well expressed in all analyzed human tissues including lung, well expressed in lung cancer cell lines except SCLC line NCI-H524 because of a small (∼30 kb) homozygous deletion/rearrangement.34HYAL2 is abundantly represented in EST databases from normal and tumor tissues. No mutations were detected in 40 lung cancer cell lines tested. The HYAL2 protein is a member of a large family of hyaluronidases (EC3.2.1.35), and in fact the expressed recombinant protein was shown to have enzymatic activity (73) . PSORT predicts a signal peptide and cell surface and lysosomal sublocalizations (37 , 39) . Similarly, SMART predicts a signal peptide and a Ca2+ binding epidermal growth factor-like domain (residues 365–440; Ref. 38 ). Recently, the mouse orthologue (AJ000059) was cloned and mapped to the syntenic region of mouse chromosome 9 between the microsatellite markers D9Mit183 and D9Mit17 (74) . The worm gene, T22C8.2, encodes a similar size protein of 458 amino acids and shows 32% global identity and 50% similarity with the HYAL2 protein and is predicted to be an orthologous protein by the Orthologue program (48) . The fly (66) and yeast (50) proteome databases do not have any members with homology to the hyaluronidase family of proteins.

HYAL1 was discovered along with HYAL2, manually by screening cDNA libraries with a conserved genomic fragment from cosmid LUCA13. It is another hyaluronidase with amino acid sequence homology to HYAL2 and HYAL3 (see below). The gene space of ∼3.5 kb contains three exons coding for a 2.6-kb mRNA well expressed in all analyzed human tissues, including lung, and is abundantly represented in EST databases from normal and tumor tissues. However, it is not expressed in 18 of 20 lung cancer cell lines. Two missense mutations were detected in 40 lung cancer cell lines. The HYAL1 protein is a member of a large family of hyaluronidases (EC3.2.1.35) and in fact was shown to have enzymatic activity (accession nos. U03056 and U96078.1). Triggs-Raine et al. (75) identified two mutations in the HYAL1 alleles of a patient with newly described lysosomal disorder, mucopolysaccharidosis IX, a mutation that introduces a nonconservative amino acid substitution (Glu268Lys) in a putative active site residue and a complex intragenic rearrangement, 1361del37ins14, which results in a premature termination codon. They reasoned that the mild phenotype engendered by these mutations was the result of redundancy resulting from the three tandemly located hyaluronidases HYAL1, HYAL2, and HYAL3 (discussed below). Thus far, no increased incidence of cancer has been reported in these kindreds. PSORT predicts a signal peptide and a cell surface and lysosomal sublocalizations (39) . Similarly, SMART predicts a signal peptide and a visible Ca2+ binding epidermal growth factor-like domain (residues 357–430; Ref. 38 ). Recently, the mouse orthologue was cloned and shown to map to the syntenic mouse chromosome 9 region (accession no. AF011567; Ref. 76 ). The worm gene, T22C8.2, encodes a similar size protein of 458 amino acids and shows 31% global identity and 46% similarity with the HYAL1 protein and contains the same domains. It is predicted to be an orthologous protein by the Orthologue program (48) . The fly (66) and yeast (50) proteome databases do not have a member homologous to the hyaluronidase family of proteins. The absent expression and occurrence of mutations make HYAL1 an attractive candidate for future promoter methylation and TSG functional studies.

The FUS2 gene was discovered manually by screening cDNA libraries with a genomic fragment occurring in cosmid LUCA14 that showed conservation in Southern blot cross species hybridizations. FUS2 also was present in the fusion junction genomic DNA clone isolated from the NCI-H524 30-kb homozygous deletion but was not involved or rearranged in this deletion but was given the “FUS” working name at the time of its isolation. The gene space of ∼3.5 kb contains an intronless, single-copy gene (accession no. AF040705) coding for a 1.9-kb mRNA expressed in normal human tissues including lung. However, an alternatively spliced form (accession no. AF040706) with one intron exists that results in the same predicted amino acid sequence. The alternatively spliced form contains an intron in the 5′ untranslated region, whereas the other form is intronless. The mRNA is well represented in EST databases from normal and tumor tissues. Four FUS2 missense mutations were detected in 78 lung cancer cell lines. The FUS2 protein was predicted to be a soluble nuclear protein[ predicted by PSORT (39)] with interesting domains and motifs. SMART (38) and PFAM (37) programs predicted an acetyltransferase (GNAT) domain (residues 66–189) and a proline-rich domain (residues 239–262) that overlaps (residues 234–249) with the Wilms’ tumor protein signature. A Src homology 2 domain (residues 240–250) was detected by the BLOKS (77) program, whereas the EMOTIF (42) program detected a ZP motif (residues 25–32) and an eukaryotic thiol (cysteine) protease signature (residues 180–188), which may explain the suggested weak similarity to furin-like proteases (accession no. AAC02732). The presence of these domains raises the intriguing possibility that FUS2 may be directly involved in nuclear activities. However, Zegerman et al. (78) demonstrated recently that FUS2 functions as an N-acetyltransferase using a ping-pong mechanism with a specificity for substrates and is a soluble cytoplasmic protein. The worm protein, C56G2.15, shows 32% identity and 65% similarity on global alignment and should be considered a true orthologue of the FUS2 gene. As expected, it also contains all but the ZP predicted protein domains and is predicted to be a nuclear protein. Interestingly, the worm gene contains three small introns in contrast to the one or no intron forms of the human FUS2 gene. The mouse orthologue was discovered in mouse EST databases (accession nos. AA051756, AA051686, AI425576, and AA833145), sequenced and shown to be 69% identical on protein level and 87% on cDNA level ( accession no. AF172275). The mouse mFUS2 protein contains an additional 28-amino acid stretch, and similar to the human protein, is predicted to be a nuclear protein by the PSORT program (39) . PFAM (37) and ProfileScan (43) both predict an acetyltransferase (GNAT) domain (residues 92–217) and a proline-rich domain (residues 267–291). The Flybase (66) contains several ESTs (accession nos. AI064351, AI109425, and AI404849), which could be assembled into a partial cDNA coding for a 168-residue protein that is 39% identical and 60% similar to the human FUS2 protein and is predicted to have an acetyltransferase (GNAT) domain (residues 13–138). However, the fly proteome (66) does not have a true orthologue of FUS2, only several proteins with a GNAT domain. The occurrence of mutations and the demonstration of its biochemical activity make FUS2 and attractive candidate for future TSG functional studies.

The HYAL3 gene was discovered in silico by finding EST matches and sequence relationship to the HYAL1 and HYAL2 genes. It occupies 5.5 kb of genomic space and codes for a ∼2.0-kb mRNA composed of two or three coding exons, expressed in several human tissues including lung and testis and well represented in EST databases. The protein belongs to the hyaluronidase family of enzymes (EC3.2.1.35) and thus represents the third member of this family in the region. Phylogenetically, it is closer to the worm hyaluronidase gene, T22C8.2, than the other members of the human family. No mutations were found in 40 lung cancer cell lines. No mouse orthologous sequences were found in the databases as of April 2000. HYAL3 RNA was not expressed in any lung cancer cell lines (data not shown); however, it has a very restricted pattern of expression in normal tissues and was not found by Csoka et al. to be expressed in normal lung (79) . The lack of mutations and absent expression in normal lung make HYAL3 a less attractive TSG candidate.

The IFRD2/SKMC15/SM15 gene was discovered experimentally by screening cDNA libraries with conserved genomic fragments (56) . GenBank refers to SKMC15/SM15 as IFRD2, and thus we will use the terminology IFRD2/SM15. The gene space is ∼6 kb and codes for a∼ 4-kb mRNA composed of 12 exons, expressed in several human tissues including lung (56) . It is well represented in EST databases from normal and tumor cells. The IFRD2/SM15 protein is a soluble nuclear protein as predicted by PSORT (39) and contains a bipartite NLS at residues 115–132 predicted by ProfileScan (43) . PFAM (37) discovered one Armadillo-β catenin-like repeat (residues 249–288), which suggests the possibility of involvement in APC signaling. No mutations were found in 63 lung cancer cell lines (56) . The mouse orthologue was discovered in ESTs (accession no. W65790), sequenced and shown to be 93% identical on the protein level and 87% on the cDNA level. This true mouse orthologue of IFRD2/SM15 is different from the mouse gene mIFRD1/PC4, which has its own human orthologue located on chromosome 7q22–31 (80) . Interestingly, mIFRD1/PC4 and its human orthologue, IFRD1, are not localized in the nucleus and probably are membrane proteins. IFRD2/SM15 and IFRD1/PC4 have different patterns of expression during mouse development (47 , 80) . The relation of IFRD2/SM15 and probably other members of the family (PC4 and IFRD1) to the IFNs is not really supported. The slightly shorter worm protein, F58B3.6 (accession no. Z73427) , shows on global alignment 36% identity and 52% similarity to the SM15 protein and should be considered a potential orthologue. The fly gene CG3098 product (accession no. AAF51186) is shorter (324 residues) and has 43% similarity to 277 residues of SM15, has a NLS signal (residues 93–110), and should be considered a potential orthologue. The expression of IFRD2/SM15 and lack of mutations make it a less attractive TSG candidate.

The SEMA3B/SEMA A(V) gene was discovered experimentally by using DNA fragments from cosmid LUCA14 to screen cDNA libraries and for capture of exons (57) . The correct nomenclature for this member of the semaphorin family is SEMA3B [previously referred to as SEMA-A(V)]. It is composed of 17 exons spread over 8–10 kb of genomic space coding for a 3.4-kb mRNA expressed in several normal tissues including lung and testis and not expressed at all in 12 SCLC lines (Fig. 2 ⇓ ; Ref. 57 ). It is well represented in EST databases from normal and tumor tissues. Three missense mutations were found in 39 lung cancer cell lines; all mutations were in NSCLCs. The mouse semaphorin A gene (accession no. X85990) is most likely the mouse orthologue of the SEMA3B gene (86% identity and 89% similarity on the protein level on global alignment; Ref. 48 ). Several mouse EST clones (accession nos. AI553114, AA518074, and AA466386) when translated show 80–94% identity. The worm genome contains three semaphorin genes, of which the CeSema gene (accession no. U15667) shows 33% identity and 49% similarity over the whole length of the CeSema protein and could be considered an orthologous gene (48) . The fly proteome (66) contains seven semaphorin proteins, of which the product of the Sema-2a gene (accession no. AAF57990) is of similar size, predicted to be secreted, has a similar domain structure, and should be considered a potential candidate orthologue of SEMA3B. The SEMA3B protein is predicted by PSORT (39) to be an extracellular secreted protein. SMART (38) and PFAM (37) programs identify a signal peptide (residues 1–25), a PFAM:SEMA domain (residues 55–497), and one IGc2 domain (residues 587–646). Interestingly, the PFAM:SEMA domain is also present in the extracellular part of the MET and RON oncoproteins belonging to the MET family of receptor tyrosine kinases, as discovered by the PFAM program (37) . Thus, it will be reasonable to test the hypothesis that interaction of SEMA3B and SEMA3F (see below) proteins with these oncogenes may disrupt the activation of MET and RON and therefore convey a negative growth signal. The lack of expression and mutation make SEMA3B an attractive candidate for methylation and TSG functional analysis.

The GNAI2 gene was discovered and cloned 12 years ago as part of studies on G proteins (21) . GNAI2 was mapped to 3p21.3 by us and others and located to the central part of the 370-kb region (Fig. 1 ⇓ ; Refs. 9 ,, 17 , 18, and 20 ). It is composed of 8 exons spread over ∼22 kb of genomic space. The∼ 2.5-kb mRNA is well expressed in normal tissues and lung cancer cell lines and is well represented in EST databases. No mutations were found in 34 lung cancer cell lines. The product is a G protein localized to the endoplasmic reticulum. PFAM (37) predicts a G-α domain (residues 6–354) and an arf domain (residues 157–307). The α GTPase function was established experimentally. The mouse orthologue (accession nos. RGMSI2 and P08752) is 98% identical and 99% similar on protein and 96% identical on cDNA levels. The worm orthologue ( accession no. P51875) is 67% identical, and the fly orthologue (accession no. P20353) is 76% identical on the protein level. The newly predicted fly gene product G-oα47A (accession no. AAF58790) is identical in size, 72% similar in amino acid sequence, and should be considered a potential orthologue of GNAI2. The lack of mutations and continued expression of GNAI2 in lung cancers suggest it is an unlikely TSG candidate.

The G17 gene was discovered experimentally by cDNA selection onto cosmid LUCA17, which was then used to screen cDNA libraries. The gene space of 17 kb encodes a 3-kb mRNA composed of 18 exons. The mRNA is expressed in several human tissues including lung, well represented in EST databases from normal and tumor tissues. No mutations were found in 38 lung cancer cell lines, and Gene17 was expressed in many lung cancers. The product is predicted to function as a plasma membrane amino acid transporter by homology to ABC transporters. It contains 10–11 transmembrane helices [predicted by SPLIT (40) and TMHMM (41) programs] and an aromatic amino acid permease-2/xan_ur_permease domain (residues 67–455) predicted by ProfileScan and PFAM servers. The mouse orthologue was discovered in several ESTs clones (accession nos. AI098786, AI048261, and AI466351), sequenced (our accession no. AI098786), and shown to be 90% identical on cDNA and 97% on protein levels. The yeast (50) , worm, and fly (66) proteomes contain several amino acid transporter genes of similar size. The lack of mutations and continued expression make Gene17 an unlikely TSG candidate.

The GNAT1 gene was cloned 10 years ago (Ref. 22 ; accession no. X15088) and encodes the transducin protein isolated from the eye. We and others positioned the gene in 3p21.3, i.e., in the homozygous deletion overlap region close to the GNAI2 gene (Fig. 1 ⇓ ; Refs. 18 and 20 ). The gene space of ∼3.5 kb contains seven exons and encodes a 1.5-kb mRNA expressed abundantly in the retina and fetal heart tissues and T-cell lines. GNAT1 was not expressed in lung and lung cancer cell lines. No mutations were found in analysis of genomic DNA in 35 lung cancer cell lines. The mouse orthologue (48) was cloned (accession no. P20612) and is 100% identical on global protein alignment. The worm (accession no. P51875) and fly (accession no. P20353) proteomes contain similarly sized G-proteins with 50 and 60% identity, respectively, with yet unknown function. The newly predicted fly gene product G-oα65A (accession no. AAF50626) is identical in size, 86% similar, and should be considered a true orthologue of GNAT1. The transducin protein is anα 1 G-protein subunit localized in the endoplasmic reticulum. PFAM (37) predicts a G-α domain (residues 2–349) and an arf domain (residues 161–342). The restricted tissue distribution of expression of GNAT1 and lack of mutations make GNAT1 an unlikely TSG candidate.

The SEMA3F/SEMA-IV/SEM IIIF gene, the second semaphorin gene in the region, was identified experimentally and cloned independently by several groups (Refs. 19 ,, 57, and 63 ; accession nos. U38276 and U33920). The current correct nomenclature for this gene is SEMA3F (previously referred to as SEMAIV and SEM IIIF). The gene space of ∼28 kb encodes a 3- and/or 4-kb mRNA composed of 18 exons. The gene is well expressed in several normal tissues including lung (19 , 57 , 63) and is represented in EST databases from normal and tumor tissues and cell lines. Comparison of the cDNA sequences of Xiang et al. (63) and Roche et al. ( 63 ; U38276 and U33920) indicate that there are alternatively spliced forms of SEMA3F. SEMA3F was expressed in several but not all lung cancer cell lines (15 of 19; Ref. 57 ). No mutations were found in 30 lung cancer lines by Sekido et al. (57) testing 456 of the total 753 amino acids for mutations and in tests of the full ORF in 28 SCLC cell lines by Xiang et al. (63) . ProfileScan (43) predicts an extracellular (secreted) protein containing several recognized domains. SMART (38) and PFAM (37) predict a signal peptide (residues 1–18), a single immunoglobulin-like domain (residues 619–680), two PFAM:SEMA domains (residues 57–153 and 184–529), and a proline-rich region (residues 715–726). As discussed for SEMA3B, the SEMA domains raise the possibility that the SEMA3F protein interrupts signaling by the MET and RON receptor tyrosine kinase receptors. The mouse orthologue isoforms a and b were cloned (accession nos. AF80090 and AF080091) and also exist in several EST clones (accession nos. AA216900, W75532, and AA216899). The fly proteome (66) contains seven semaphorin proteins, of which the product of the CG4383 gene (accession no. AAF57999) is predicted to be secreted, has a similar domain structure, and could be considered a potential candidate orthologue of SEMA3F. This fly protein is similar to the worm gene CeSema (accession no. U15667), which is 29% identical and 48% similar to SEMA3F, and both are predicted to be secreted proteins. The loss of expression of SEMA3F and results from the Naylor laboratory (52) , showing suppression of tumorigenicity of mouse A9 fibrosarcoma cells by a P1 clone in the SEMA3F region means this gene has to be considered in functional and methylation analysis.

The G15/RBM5 gene was discovered experimentally by screening arrayed cDNA libraries with cosmid LUCA22 (U73168) DNA. The same gene (called RBM5) was cloned by others (Ref. 81 ; accession no. AAD04159). The gene space of 18.5 kb encodes 18 exons expressed as a main 4-kb mRNA in several human tissues (a minor 7.5-kb species, and in some tissues, 1.5-kb and 2-kb species are also expressed). The mRNA is well expressed in several lung cancer cell lines and is represented in human and mouse EST databases from normal and tumor tissues. No mutations were found in 18 lung cancer cell lines. According to ProfileScan (43) , PFAM (37) and SMART (38) , the G15/RBM5 protein is a nuclear RNA binding protein featuring two bipartite NLSs (residues 34–51 and 708–725), two RNA-binding domains (residues 98–178 and 231–315), two zinc finger motifs, i.e., ZF RANBP (residues 181–210), and ZINC FINGER 2H2 2 (residues 647–677), and a recently recognized D111/G-patch DOMAIN (residues 749–786). G15/RBM5 has amino acid sequence homology with its immediate telomeric neighbor, G16/RBM6/NY-LU-12/DEF-3 (see below), indicating that they are part of the same gene family and arose through gene duplication. Studies by Drabkin et al. (82) have shown that both G15/RBM5 and G16/RBM6/NY-LU-12/DEF-3 recombinant proteins can specifically bind poly(G) RNA homopolymers in vitro. The mouse orthologue was discovered in several EST clones (accession nos. AA574979, AA139814, and AI197332) that are 90% identical on cDNA and 97% on the protein level. The worm gene, T08B2.5 (accession no. AF000263), is 28% identical and 45% similar on global protein alignment and is probably a true orthologue (48) . The fly proteome (66) contains three similar proteins, of which the CG4887 gene product (accession no. AAF51398) shows 43% similarity over 90% of the G15 length, has a similar domain structure, and is a likely orthologue of G15. The rat RNA-binding protein S1–1 ( accession no. P70501) has a similar domain structure and is close in amino acid sequence to G15/RBM5; however, a rat EST clone (accession no. AI112056) on translation showed 97% identity to the last 37 amino acids of G15/RBM5 (outside the recognized domains) and probably represents the true rat orthologue of G15/RBM5. The lack of mutations and continued expression in most lung cancers means that G15/RBM5 is an unlikely TSG candidate.

The G16/RBM6/DEF-3/NYLU12 gene, the second gene encoding an RNA-binding protein in the homozygous deletion region, was discovered experimentally by screening cDNA libraries with a conserved breakpoint genomic fragment from P1 clone 3938. Others have also cloned this gene (referred to as RBM6, NY-LU-12, and DEF-3). The identification of the NY-LU-12 clone through expression cloning using antibodies to detect the protein from a patient with lung cancer by means of the SERAX technology, and the def-3 clone as a differentially expressed gene during myelopoiesis, are of particular interest (81, 82, 83) . The gene spans the distal breakpoint of the NCI-740 deletion (Ref. 18 ; Fig. 1 ⇓ ), occupies more than 60 kb of genomic DNA, and encodes more than 16 exons expressed as a 4-kb mRNA in several human and mouse tissues, including lung. Alternatively spliced forms have been described (83) . This mRNA is well expressed in several lung cancer cell lines and is represented in human and mouse EST databases from normal and tumor tissues. One mutation was found in 39 lung cancer cell lines. According to ProfileScan (43) , PFAM (37) , and SMART (38) , the G16/RBM6 protein is a nuclear RNA binding protein. The following motifs and domains were predicted: a bipartite NLS (residues 1016–1033), one RNA-biding domain (residues 456–536), two zinc fingers, ZF-RANBP (residues 535–565), and ZF-C2H2 (residues 955–980, and a D111 DOMAIN (residues 1057–1094). Recombinant proteins containing the RNA recognition motifs of DEF-3(g16/NY- LU-12) and LUCA15 specifically bound poly(G) RNA homopolymers in vitro (82) . The mouse orthologue was cloned (accession no. AJ006486) and also exists in several EST clones (accession nos. AA607276 and AA549397) with 90% identity on the cDNA level and 97% on the protein level. The fly proteome (66) contains three similar proteins, of which the CG4887 gene product (accession no. AAF51400) shows 43% similarity over 85% of the G16 length, has a similar domain structure, and is a potential orthologue of G16. The lack of mutations and continued expression in lung cancers make G16/RBM6 an unlikely TSG candidate.

DISCUSSION

We initiated these studies to gain insight into the molecular mechanisms of lung cancer pathogenesis with the goal of identifying and cloning a lung cancer TSG(s) residing in chromosome region 3p21.3. Our interest in this region began with our initial cytogenetic description of a chromosome 3p deletion abnormality in SCLC and subsequently in NSCLCs >17 years ago (84, 85, 86) . This was followed by our discovery of 3p allele loss in SCLC (87, 88, 89, 90) . During this period, many groups also described 3p allele loss in lung cancer and other common cancers and found 3p cytogenetic abnormalities (9) . Karyotyping, comparative genomic hybridization, interphase FISH (summarized in Ref. 91 ), and identification of homozygous deletions, in conjunction with numerous molecular allelotyping studies, have identified a minimum of four separate consensus regions of 3p allele loss, i.e., 3p25-p26, 3p21-p22, 3p14.2 (FHIT region), and 3p12 (U2020 homozygous deletion region; Refs. 6 ,, 8 , 9 , 11 , 17 , 19 , 51, and 92, 93, 94, 95 ). A more recent study with very high density allelotyping, as well as results from multiple groups, have identified at least eight different 3p LOH sites (14) . To aid in this TSG positional cloning effort, the homozygous deletions were precisely mapped at 3p12 (U2020 region; Refs. 9 ,, 11, and 96, 97, 98 ), 3p14.2 (FHIT region; Refs. 93 and 95 ), 3p21.3 (6 , 9 , 10 , 17, 18, 19 ,, 51) , and 3p21.3–3p22 (94 , 99) . Allelotyping of histological subtypes of lung cancer showed differences in allele loss patterns at several chromosomal sites in addition to 3p between SCLC versus NSCLC histological types (8 , 14) . In addition, both SCLCs and squamous cell lung cancers usually undergo allele loss of a large part of the 3p arm. In contrast, adenocarcinomas of the lung tend to have much smaller regions of 3p allele loss (6 , 8 , 9 , 14 , 16 , 91) .

The 3p21.3 location was chosen for the current effort based on allele loss mapping studies showing that this is perhaps the first chromosomal region with allele loss in preneoplastic lesions and even in the histologically normal bronchial epithelium of current and former smokers (12 , 14 , 16 , 100) . Thus, TSG(s) residing in this region are likely to play a causative role in the earliest steps of lung cancer pathogenesis. In addition, the presence of overlapping/nested homozygous deletions in lung and breast cancer and functional studies demonstrating the presence of a tumor phenotype-suppressing function gave us both positional and functional reasons to pursue this search (6 , 9 , 10 , 17 , 19 , 20 , 52, 53, 54) . This exact region has also been identified as a site frequently undergoing loss in peripheral blood lymphocytes of lung cancer patients treated in vitro with the tobacco smoke carcinogen benzo(a)pyrene diol epoxide (101) . We first obtained a physical map of the homozygously deleted area, which we then covered with a cosmid and P1 clone contig (Fig. 1 ⇓ and Ref. 18 ). Through the auspices of the NCI, we were able to have this ∼630-kb clone contig sequenced by the Sanger Genome Center (Hinxton, United Kingdom; cosmids LUCA1-LUCA11) and the St. Louis Genome Center (LUCA12-LUCA22, P3938). The cosmid DNA sequence accession nos. are given in Fig. 1 ⇓ , and the DNA sequences are available from GenBank. These DNA sequence have been condensed to several contigs (NT_002322, NT_000067, and NT_000069).29

We identified genes in the 630-kb region by experimental and informatics methods (discussed above), bringing the total number of verified resident genes to 25. In addition, several candidate“ genes” were predicted by BLAST EST hits, GENSCAN and XGRAIL gene prediction software that we could not yet confirm as being real genes as defined by either detection of a mRNA on Northern blots or the presence of a cDNA with exon/intron structure and a reasonable ORF. With the complete genomic DNA sequence, all of the genes could, in retrospect, be identified by informatics methods “in silico.” The 19 genes residing in the largest homozygous deletion overlap region (from cosmid LUCA6 to part of P1 clone P3938) were analyzed by experimental (expression and mutational) analyses and computational methods (for predicted function using web-based servers) for information supporting their candidacy as a TSG. The extensive search for mutations in the 19-gene set did not identify a single gene with a high frequency of mutation in the analyzed lung tumors but did identify NPRL2/Gene21, BLU, FUS1, HYAL1, FUS2, and SEMA3B as candidates for further study because of the occurrence of a few mutations. In addition, a significant frequency of absent mRNA expression in lung cancer cell lines was found for the CACNA2D2, RASSF1A, HYAL1, BLU, and SEMA3B genes, also indicating their possible involvement in lung carcinogenesis as candidate TSGs with homozygotic inactivation in tumors. The combination of mutations and absent expression in BLU, HYAL1, SEMA3B, and for the RASSF1A/123F2LF mRNA isoform suggests special attention be paid to these genes. With the recognition of tumor-acquired promoter hypermethylation as a frequent mechanism of gene inactivation, those genes with loss of expression need to be studied for such acquired promoter changes (23) . In addition, functional information documenting tumor suppressor or other relevant activity (e.g., DNA repair) will be required to provide a convincing argument for TSG identification. Computational analyses predicting function for the encoded proteins among the 19-gene set did not reveal obvious homology to a known TSG, nor did it exclude genes from having a possible TSG function.

There are several obvious features of this 3p21.3 region that provide information on the evolution of this part of the genome. There are four examples of gene duplication: the G proteins GNAI2 and GNAT1; the hyaluronidases HYAL1, HYAL2, and HYAL3; the semaphorins, SEMA3B and SEMA3F; and the RNA binding proteins encoded by Gene15/RBM5 and Gene16/RBM6/DEF-3/Ny-Lu-12 genes. Each of these duplications (or triplications in the case of the hyaluronidase genes) occurred within 50–100-kb regions. In the case of the G-protein and semaphorin gene regions, a complex or sequential duplication must have occurred to account for the orientation of these four genes. In the case of Gene16 and Gene15 and the hyaluronidase genes, it appears tandem duplications occurred. Other genes (e.g., FUS2 and Gene17) then arose between these gene clusters. These duplicated genes pairs (or triplets) all have about the same amino acid sequence homology (∼40–50%) to their duplicated partners, indicating that these duplications arose at approximately the same time. Also with the exception of the G-protein genes (showing 67% homology), the hyaluronidase, semaphorin, and RNA binding protein encoding genes all have ∼30% amino acid sequence homology with their orthologous/homologous worm or fly genes. In each case, the worm or fly genomes have only one copy of the gene and not two, as are present in the human and mouse genomes.

The absence of frequent mutations in the resident candidate TSGs suggests we have to consider several other possibilities:

(a) We need to consider the possibility that the homozygous deletions only represent random genetic damage from tobacco smoke carcinogen exposure or the presence of a previously unknown fragile site, and thus, this specific 3p21.3 region does not contain a TSG. There are no obvious ways to prove or disprove that the three overlapping homozygous deletions in the SCLCs are random deletions. However, the recurrent presence of allele loss in this very specific region in multiple lung cancers and respiratory preneoplastic lesions and the occurrence of a homozygous deletion in a breast cancer not exposed to tobacco smoke carcinogens, as well as the microcell hybrid data indicating a tumor suppressor function in this region, all argue against this possibility (10 , 14 , 16 , 19 , 52, 53, 54 ,, 100) . Homozygous deletions have also been crucial in the identification of several TSGs in other tumor types such as p16 (102) , SMAD4 (103) , BRCA2 (104) , and p53 (105) . Also, homozygous deletions have been found in lung cancer for these and other TSGs including RB, p53, p16, FHIT, and PTEN (33 , 93 , 95 , 106, 107, 108) . Finally, in recently completed genome-wide scans of lung cancers using >600 polymorphic markers, we have found only a few chromosomal sites with recurrent homozygous deletions that include 3p12 (U2020 region), 3p14.2 (FHIT region), 3p21.3 (the current region under study), 9p21 (p16/INK4 region), 10q (PTEN region), and a few other regions [Girard et al. (109) and Refs. 8 ,, 11 , 14 , 33, and 95] . These studies include over 40 markers distributed over chromosome arm 3p. Thus, in our experience, the occurrence of recurring homozygous deletions in the same chromosomal region are very uncommon and appear to target TSGs.

(b) It is possible that we have not identified all of the genes residing in the critical region. Although this remains a possibility, the combination of extensive computational and experimental approaches (screening multiple cDNA libraries with cosmid clones and using the cosmid clones for exon capture) would argue against this possibility. No matches in EST databases suggestive of other genes have been detected in regular searches using the intergenic and intronic sequences. In addition, the gene identification programs GENSCAN (27) and XGRAIL (49) predicted only five candidates that we could not confirm as real genes (i.e., that did not give positive signals on Northern blots or show several matches in searches of the EST databases). In contrast, GENSCAN accurately predicted the location and nearly all of the intron/exon structures of the genes we did identify. Related to the gene detection question is the possibility that there is a tissue-specific splicing isoform that we have not detected, and it is this isoform that is mutated or has altered expression. Because of this possibility, we have particularly scrutinized GENSCAN exon predictions, ESTs mapping to intragenic regions not covered by our current cDNA sequences, and poly(A) Northern blots containing normal lung RNA for other possible mRNA forms. In fact, this methodology has allowed us to detect many different mRNA splicing variants, including those for at least 8 of the 19 genes including CACNA2D2, NPRL2/Gene21, BLU, RASSF1/123F2, HYAL1, FUS2, SEMA3F, and Gene16/RBM6/DEF-3/Ny-Lu-12. Our detailed gene analysis, coupled with an ever-enlarging EST database including information from many different tissues, will facilitate further mRNA splicing form detection. However, our analysis was not able to rule out the possibility that this 3p21.3 region encodes one or more mRNA-like noncoding RNA type genes that may function as TSG(s). We do not know of any currently available informatics approach that can identify such genes. Alignment of the human genomic sequence with the syntenic mouse sequence may provide clues to the location of additional genes such as these by detecting conserved sequences in the intergenic/intronic regions where these genes could reside (59) .35Nevertheless, continued searches of the ever-growing EST database as well as comparing the sequence of the homologous region in the mouse (when the whole sequence is available) would provide additional support to the assumption that we have identified all of the genes in this region.

Another possibility is that we have identified the gene and that it is inactivated by a combination of mutation (uncommon) and loss of expression (frequent) mechanisms. Recently, methylation of the promoter region and allele loss leading to loss of expression of p16INK4A have been identified as a combination of mechanisms leading to the inactivation of this accepted TSG in lung cancer (108) . In addition, several other TSGs, such as RB, VHL, and BRCA1, as well as a series of genes related to tumorigenesis such as E-cadherin (E-CAD), death-associated protein kinase (DAPK), glutathione S-transferase P1 (GSTP1), O-6-methylguanine-DNA-methyltransferase (MGMT), and CACNA1G, a T-type calcium channel gene, also have been found to have their expression turned off by tumor-acquired promoter hypermethylation, some of which occur in lung cancers (6 , 23 , 108 , 110, 111, 112) . The definition of the precise structure for each of the 19 genes and their associated CpG islands in putative promoter regions identified by the current work will greatly aid in our current search for promoter hypermethylation as a mechanism of inactivation of expression of genes in this region. Recent evidence indicates this to be the mechanism for loss of expression of RASSF1A. In this regard, 5 of the 19 genes showed loss of expression in a large fraction of at least one histological type of lung cancer (CACNA2D2, RASSF1A, SEMA3B, BLU, and HYAL1). A more complex version of this possibility is that more than one gene in this 3p213 region can serve as a TSG. Because> 15 different mutations occurred independently of one another (in 6 genes, BLU, NPRL2/Gene21, FUS1, HYAL1, FUS2, and SEMA3B) and independently of the loss of expression (in 5 genes) in the 40–80 lung cancers tested, it is possible that nearly all of the lung tumor lines had either a mutation or loss of expression for one of these 8 genes. Thus, these 8 genes in particular need to undergo detailed functional tests to rule out their potential tumor suppressor function.

A final possibility is that the TSG in this 3p21.3 region is 1 (or more) of the 19 cloned genes we have identified but does not conform to the classical two-mutation model for TSGs requiring homozygous inactivation in the tumor (113 , 114) . Instead, the gene may represent an example of the emerging class of haploinsufficient TSGs inferred from mouse models of human cancer (115, 116, 117, 118) . Recently it has been discovered that mice hemizygous (+/−) for p27/Kip1 or TGFβ show a higher spontaneous tumor rate compared with mice +/+ at these loci and rapidly develop multiple tumors when challenged with carcinogens. Analysis of these tumors demonstrated that the remaining wild-type alleles for these genes remained intact and expressed in the tumors (115 , 116) . Mice hemizygous for the Ptch gene frequently develop the rare tumor medulloblastoma with continued expression of one wild-type Ptch allele. This provides evidence that haploinsufficiency of Ptch leads to medulloblastoma in mice, and a similar circumstance appears to occur in humans as well (117) . Similarly, haploid loss of the tumor suppressor Smad4/Dpc4 initiated gastric polyposis and cancer in mice, suggesting that haploinsufficiency of Smad4 is sufficient for tumor initiation (118) . In addition, the autosomal dominant syndrome of familial platelet disorder with predisposition to acute myelogenous leukemia (FPD/AML and MIM 601399) in humans was found to be related to haploinsufficiency of the CBFA2 gene (119) . Thus, we need to consider the possibility that the TSG in the 3p21.3 region may be an example of an acquired abnormality in such a haploinsufficient gene.

This brings us to the discussion of lung cancer gene model(s). Inherited (120) , initiating (114) , gate-keeper (121) , and cancer-causing (our term) genes follow the two-hit proposition for cancer causation (113 , 114 , 120, 121, 122, 123) . These genes (TSG or oncogenes) are usually discovered in an inherited family cancer syndrome through genetic mapping and positional cloning (113 , 114) . Consistent with the two-hit model, the corresponding tumors show loss of one allele and inactivating mutations of the remaining wild-type allele (in the case of a TSG). In the case of an inherited activated oncogene (such as MET), the tumors may show loss of the wild-type allele with a simultaneous duplication of the mutant allele or a second activating mutation in the remaining wild-type allele (124) . The net result of these rate-limiting genetic changes (122 , 123) is a homozygous status for these genes in tumors. In contrast, the haploinsufficient TSGs lose one copy in the germ-line or somatic tissues, predispose as a result of this loss to cancer but remain hemizygous (and express the wild-type allele) in tumors. Because the acquisition of other genetic alterations is still rate-limiting in developing cancer (122 , 123) , another mutation in a different gene would appear to be required in the haploinsufficient cancer model. It is tempting to assume that it could be a mutation in a second known TSG such as p53, RB, or p16, all of which are frequently mutated in many common cancers with 3p21.3 allele loss including lung cancers (6) . There is no reason to exclude the possibility that classical TSGs and the haploinsufficient cancer-causing genes both could be involved in the development of sporadic malignancies arising from a common stem cell. In addition, it is likely that 3p21.3 allele loss is involved in the development of all histological types of lung cancer. Premalignant lesions in the lung have 3p21.3 LOH that is also found in a large fraction of SCLCs and NSCLCs (12 , 15 , 16 , 100 , 125) . These include clones with 3p21.3 allele loss in histologically normal-appearing epithelium found in the lungs of current and former smokers (16 , 100) . Recently, we have been able to estimate the size of these clonal patches with 3p21.3 and other sites of LOH to be ∼90,000 cells, as well as determine that the smoking-damaged lung contains thousands of such patches (126) . We also know from studies of multiple markers that allele loss at classic TSG loci, such as at RB (13q14) and p53 (17p13), occurs after 3p allele loss at a later stage of carcinogenesis when histologically dysplastic or carcinoma in situ lesions are evident (12 , 15 , 16 , 100) . Lung embryology identifies lung stem cells as the primitive columnar epithelia derived from the primordial upper gut at days 40–45 of development (127) . These columnar cells first differentiate into multipotent neuroendocrine cells (probably, the cells of origin of SCLC) and a variety of multipotent bronchial epithelia cells (probable the cells of origin of NSCLC). In fact, some lung cancers, within the same tumor, exhibit histological features of several lung cancer types, indicating a possible common stem cell of origin (128 , 129) . The embryology of the lung would therefore suggest that the same genes could be involved in both SCLC and NSCLC carcinogenesis.

In the light of these observations, the following model of lung cancer pathogenesis is proposed. Consequent to smoking damage, 3p21.3 allele loss occurs in thousands of different sites throughout the respiratory epithelium, leaving the putative 3p21.3 TSG haploinsufficient. These initiated cell(s) grow to form clonal patches of ∼50–100,000 cells (∼15–17 doublings) throughout the lung. The next hit could occur in another cancer-causing gene (such as RB, p53, p16, or “gene X”), leading to the next stages of invasive cancer. The RB and p53 genes are mutated in nearly 100% of SCLC samples, whereas mutations of p53 occur in 50% of NSCLC and some form of p16 inactivation (also inactivating the “RB pathway”) occurs in a very large fraction of NSCLCs (6) . Alternatively, it is possible that the 3p21.3 TSG undergoes allele loss and then a second inactivating event (either an uncommon mutation or loss of expression through promoter hypermethylation), which is required to allow the clonal outgrowth of these initiated cells. In any event, the nearly universal presence of 3p21.3 allele loss in SCLC and squamous cell carcinomas of the lung and its occurrence in >50% of adenocarcinomas of the lung suggest that this alteration is an obligate rate-limiting step in the pathogenesis of many lung cancers (6 , 8 , 14 , 16) . In this regard, it will be very interesting to study SCLCs arising in the rare families segregating a mutant RB allele in the germ-line in which the majority of affected individuals surviving to adulthood have developed SCLC to see whether tumors arising under these conditions also have to undergo 3p21.3 allele loss (130) . It should also be noted that allele loss that includes 3p21.3 and immediately surrounding 3p21 regions would lead to the hemizygotic state for several other possible cancer-predisposing genes residing in the area, including, MLH1, TGFβRII, β-catenin, RON, and Wnt5, which also could contribute to malignant transformation. Functional testing by gene transfer into lung cancer cells and gene disruption strategies in mice are now necessary to test this model and identify the putative 3p21.3 TSG(s).

Studies of familial lung cancer risk, including data from lung cancer occurring in young nonsmoking individuals, are compatible with Mendelian codominant inheritance of a rare major autosomal gene that produces earlier age of onset of lung and probably other cancers (7 , 131 , 132) . A new national consortium (Genetic Epidemiology of Lung Cancer) has been formed that has accumulated enough families with informative cases with an apparently inherited predisposition to lung cancer to begin genetic linkage analysis. In due time, a classical TSG(s) (with homozygous inactivation in tumors) segregating in these lung cancer pedigrees could also be discovered. A causative role for this inherited lung cancer gene in the origin of sporadic lung cancers would also have to be ascertained. Linkage to markers in this 3p21.3 region can be readily tested, and if found, the genes we have reported in this study would need to be screened for germ-line mutations. In any event, the early involvement of chromosome 3p21.3 allele loss in premalignant lesions and sporadic lung cancers argues that one or more of the genes we have found should play a critical role in the development of common, sporadic lung tumors.

Acknowledgments

We thank Dr. Alfred Knudson for many discussions that shaped this project, Dr. Richard Klausner for arranging the sequencing of the 630-kb 3p21.3 region, and both Genome Sequencing Centers, The Sanger Genome Sequencing Center (Hinxton, United Kingdom), and The Washington University Genome Sequencing Center (Saint Louis, MO) for sequencing the 630-kb contig.

Footnotes

  • The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

  • ↵1 The University of Texas Southwestern Medical Center Group was supported by National Cancer Institute Grants CA71618, SPORE P50 CA70907, the G. Harold and Leila Y. Mathers Charitable Foundation, and the Leroy “Skip” Malouf Foundation. The University of Texas Southwestern group received technical support from Sherrie Cundiff, Yvonne Feller, Gina Mele, and Mina Viswanathan. The National Cancer Institute Group has been funded in whole with Federal funds from the National Cancer Institute, NIH, under Contract NO1-CO-56000. The United Kingdom Group was supported by grants from Cancer Research Campaign, The Royal Society, and Fundacao para a Cientia e a Technologia. The Karolinska Institute Group was funded by grants from the Swedish Cancer Society, Karolinska Institute, and the Royal Swedish Academy of Science (to E. Z. and to G. K.). The National Cancer Institute and University of Texas Southwestern groups contributed equally to this work.

  • 10 Internet address: http://www.ncbi.nlm.nih.gov/.

  • 11 Internet addresses: http://blast.wustl.edu/ and http://www2.ebi.ac.uk/blast2/.

  • 12 Internet address: http://www.bork.embl-heidelberg.de/Alignment/.

  • 13 Internet address: http://www.hgsc.bcm.tmc.edu/.

  • 14 Internet address: http://www.expasy.ch/tools/.

  • 15 Internet address: http://www.embl-heidelberg.de/.

  • 16 Internet address: http://www.imcb.osaka-u.ac.jp/nakai/psort.html.

  • 17 Internet address: http://drava.etfos.hr/∼zucic/split.html.

  • 18 Internet address: http://www.cbs.dtu.dk/services/TMHMM-1.0/.

  • 19 Internet address: http://www.ebi.ac.uk/interpro/.

  • ↵2 To whom requests for reprints should be addressed (M. I. L.) at Laboratory of Immunobiology, National Cancer Institute, FCRDC, Frederick, MD 21702. Phone: (301) 846-1288; Fax: (301) 846-6145; E-mail: lerman{at}ncifcrf.gov or (J. D. M.) at Hamon

  • 20 Internet address: http://gcg.tigem.it/cgi-bin/uniestass.pl.

  • 21 Internet address: http://gcg.hgmp.mrc.ac.uk/cgi-bin/Est_blast/Est_blast/ESTblast.pl.

  • 22 Internet address: http://www.bioinf.mdc-berlin.de/home.html.new.

  • 23 Internet address: http://ftp.genome.washington.edu/cgi-bin/RepeatMasker.

  • 24 Internet address: http://genome.imb-jena.de/∼schattev/rummage/index.html.

  • 25 Internet address: http://genome.ornl.gov/.

  • 26 Internet address: http://www.ncbi.nlm.nih.gov/genome/guide/HsChr3.shtml.

  • 27 Internet address: http://pompous.swmed.edu/panorama.htm.

  • 28 Internet address: http://www.ncbi.nlm.nih.gov/genome/guide/HsChr3.shtml.

  • 29 Internet address: http://www.ncbi.nlm.nih.gov/genome/guide/HsChr3.shtml.

  • ↵3 University of Texas Southwestern Medical Center at Dallas, United States of America Group (Yoshitaka Sekido, Scott Bader, David Burbee, Kwun Fong, Eva Forgacs, Boning Gao, Harold Garner, Adi F. Gazdar, Luc Girard, Craig Kamibayashi, Masashi Kondo, Ashwini Pande, Alex Persemlidis, Vivek Ramalingam, Dwight Randle, Yoshio Tomizawa, Arvind Virmani, Ivan Wistuba, Grace Zeng, and John D. Minna); University of Birmingham, United Kingdom Group (Donia Macartney, Sofia Honorio, Eamonn Maher, and Farida Latif); The Karolinska Institute, Sweden, Group (Vladimir Kashuba, Alexei Protopopov, Jingfeng Li, George Klein, and Eugene Zabarovsky); and National Cancer Institute, United States of America Group [Intramural Research Support Program, Science Applications International Corporation-Frederick, and Laboratory of Immunobiology, National Cancer Institute, Frederick Cancer Research and Development Center, Frederick, Maryland 21702] (Fuh-Mei Duh, Ming-Hui Wei, Laura Geil, Igor Kuzmin, and Sergei Ivanov; F-M. D. and M-H. W. contributed equally to this work), [Laboratory of Immunobiology, National Cancer Institute, Frederick Cancer Research and Development Center, Frederick, Maryland 21702] (Debora Angeloni, Alla Ivanova, Bert Zbar, and Michael I. Lerman), and [Medicine Branch at the National Naval Medical Center, National Cancer Institute, NIH, Bethesda, Maryland 20892] (Bruce E. Johnson).

  • 30 Internet address: http://morgan.harvard.edu/.

  • 31 Internet addresses: http://www.sanger.ac.uk/Projects/C_elegans/, http://genome.wustl.edu/gsc/C_elegans/elegans.shtml, and http://www.ncbi.nlm.nih.gov/.

  • 32 E. Koonin, personal communication.

  • 33 D. G. Burbee, E. Forgacs, S. Zöchbauer-Müller, L. Shivakumar, K. Fong, B. Gao, D. Randle, A. Virmani, S. Bader, Y. Sekido, F. Latif, S. Milchgrub, A. F. Gazdar, M. I. Lerman, E. Zabarovsky, M. White, and J. D. Minna. RASSF1A in the 3p21.3 homozygous deletion region: epigenetic inactivation in lung and breast cancer and suppression of the malignant phenotype, manuscript in preparation.

  • 34 S. Bader, F. Latif, Y. Sekido, M-H. Wei, F-M. Duh, J-Y. Chen, K. Tartoff, C-C. Lee, V. Kashuba, R. Gizatullin, E. Zabarovsky, G. Klein, B. Zbar, A. F. Gazdar, M. I. Lerman, and J. D. Minna, unpublished data.

  • 35 Internet address: http://bio.cse.psu.edu/pipmaker/.

  • ↵4 The abbreviations used are: TSG, tumor suppressor gene; SCLC, small cell lung cancer; NSCLC, non-small cell lung cancer; NCI, National Cancer Institute; LOH, loss of heterozygosity; EST, expressed sequence tag; MTN, multiple tissue Northern blots from ClonTech; SSCP, single strand conformation polymorphism; ORF, open reading frame; NLS, nuclear localization signal; FISH, fluorescence in situ hybridization; VWA, Von Willebrand factor type A; PKC, protein kinase C; RT-PCR, reverse transcription-PCR; DAG, diacylglycerol.

  • ↵5 Internet address: http://genome.wustl.edu/gsc/human/chrom3.shtml.

  • ↵6 Internet address: http://www.sanger.ac.uk/cgi-bin/hummap?map=3p21.3.

  • ↵7 Internet address: http://www-bio.llnl.gov/bbrp/image/image.html.

  • ↵8 Internet address: http://pompous.swmed.edu/panorama.htm.

  • ↵9 Internet address:; http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast.

  • Received May 4, 2000.
  • Accepted August 29, 2000.
  • ©2000 American Association for Cancer Research.

References

  1. ↵
    Devesa S. S., Grauman D. J., Blot W. J., Fraumeni J. F., Jr. Cancer surveillance series: changing geographic patterns of lung cancer mortality in the United States, 1950 through 1994. J. Natl. Cancer Inst., 91: 1040-1050, 1999.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    Landis S. H., Murray T., Bolden S., Wingo P. A. Cancer statistics, 1999. CA Cancer J. Clin., 49: 8-31, 1999.
    OpenUrlCrossRefPubMed
  3. ↵
    Parkin D. M., Pisani P., Ferlay J. Global cancer statistics. CA Cancer J. Clin., 49: 33-64, 1999.
    OpenUrlCrossRefPubMed
  4. ↵
    Blot W., Fraumeni J. Cancers of the lung and pleura Schottenfeld D. Fraumeni J. eds. . Cancer Epidemiology and Prevention, : 263-665, Oxford University Press New York 1996.
  5. ↵
    Baron J., Ronan T. Tobacco Schottenfeld D. Fraumeni J. eds. . Cancer Epidemiology and Prevention, : 269-289, Oxford University Press New York 1996.
  6. ↵
    Sekido Y., Fong K., Minna J. Progress in understanding the molecular pathogenesis of human lung cancer. Biochim. Biophys. Acta, 1378: 21-59, 1998.
    OpenUrl
  7. ↵
    Sellers T. A., Bailey-Wilson J. E., Elston R. C., Wilson A. F., Elston G. Z., Ooi W. L., Rothschild H. Evidence for Mendelian inheritance in the pathogenesis of lung cancer. J. Natl. Cancer Inst., 82: 1272-1279, 1990.
    OpenUrlAbstract/FREE Full Text
  8. ↵
    Virmani A. K., Fong K. M., Kodagoda D., McIntire D., Hung J., Tonk V., Minna J. D., Gazdar A. F. Allelotyping demonstrates common and distinct patterns of chromosomal loss in human lung cancer types. Genes Chromosomes Cancer, 21: 308-319, 1998.
    OpenUrlCrossRefPubMed
  9. ↵
    Kok K., Naylor S. L., Buys C. H. Deletions of the short arm of chromosome 3 in solid tumors and the search for suppressor genes. Adv. Cancer Res., 71: 27-92, 1997.
    OpenUrlPubMed
  10. ↵
    Sekido Y., Ahmadian M., Wistuba I., Latif F., Bader S., Wei M-H., Duh F-M., Gazdar A., Lerman M., Minna J. Cloning of a breast cancer homozygous deletion junction narrows the region of search for a 3p21.3 tumor suppressor gene. Oncogene, 16: 3151-3157, 1998.
    OpenUrlCrossRefPubMed
  11. ↵
    Sundaresan V., Chung G., Heppell-Parton A., Xiong J., Grundy C., Roberts I., James L., Cahn A., Bench A., Douglas J., Minna J., Sekido Y., Lerman M., Latif F., Bergh J., Li H., Lowe N., Ogilvie D., Rabbitts P. Homozygous deletions at 3p12 in breast and lung cancer. Oncogene, 17: 1723-1729, 1998.
    OpenUrlCrossRefPubMed
  12. ↵
    Hung J., Kishimoto Y., Sugio K., Virmani A., McIntire D. D., Minna J. D., Gazdar A. F. Allele-specific chromosome 3p deletions occur at an early stage in the pathogenesis of lung carcinoma. J. Am. Med. Assoc., 273: 558-563, 1995.
    OpenUrlCrossRefPubMed
  13. ↵
    O’Connell P., Pekkel V., Fuqua S. A., Osborne C. K., Clark G. M., Allred D. C. Analysis of loss of heterozygosity in 399 premalignant breast lesions at 15 genetic loci. J. Natl. Cancer Inst., 90: 697-703, 1998.
    OpenUrlAbstract/FREE Full Text
  14. ↵
    Wistuba I. I., Behrens C., Virmani A. K., Mele G., Milchgrub S., Girard L., Fondon J. W., III,, Garner H. R., McKay B., Latif F., Lerman M. I., Lam S., Gazdar A. F., Minna J. D. High resolution chromosome 3p allelotyping of human lung cancer and preneoplastic/preinvasive bronchial epithelium reveals multiple, discontinuous sites of 3p allele loss and three regions of frequent breakpoints. Cancer Res., 60: 1949-1960, 2000.
    OpenUrlAbstract/FREE Full Text
  15. ↵
    Sundaresan V., Ganly P., Hasleton P., Rudd R., Sinha G., Bleehen N. M., Rabbitts P. p53 and chromosome 3 abnormalities, characteristic of malignant lung tumours, are detectable in preinvasive lesions of the bronchus. Oncogene, 7: 1989-1997, 1992.
    OpenUrlPubMed
  16. ↵
    Wistuba I., Behrens C., Milchgrub S., Bryant D., Hung J., Minna J. D., Gazdar A. F. Sequential molecular abnormalities are involved in the multistage development of squamous cell lung carcinoma. Oncogene, 18: 643-650, 1999.
    OpenUrlCrossRefPubMed
  17. ↵
    Kok K., van den Berg A., Veldhuis P. M., van der Veen A. Y., Franke M., Schoenmakers E. F., Hulsbeek M. M., van der Hout A. H., de Leij L., van de Ven W., Buys C. H. C. M. A homozygous deletion in a small cell lung cancer cell line involving a 3p21 region with a marked instability in yeast artificial chromosomes. Cancer Res., 54: 4183-4187, 1994.
    OpenUrlAbstract/FREE Full Text
  18. ↵
    Wei M. H., Latif F., Bader S., Kashuba V., Chen J. Y., Duh F. M., Sekido Y., Lee C. C., Geil L., Kuzmin I., Zabarovsky E., Klein G., Zbar B., Minna J. D., Lerman M. I. Construction of a 600-kilobase cosmid clone contig and generation of a transcriptional map surrounding the lung cancer tumor suppressor gene (TSG) locus on human chromosome 3p21.3: progress toward the isolation of a lung cancer TSG. Cancer Res., 56: 1487-1492, 1996.
    OpenUrlAbstract/FREE Full Text
  19. ↵
    Roche J., Boldog F., Robinson M., Robinson L., Varella-Garcia M., Swanton M., Waggoner B., Fishel R., Franklin W., Gemmill R., Drabkin H. Distinct 3p21.3 deletions in lung cancer and identification of a new human semaphorin. Oncogene, 12: 1289-1297, 1996.
    OpenUrlPubMed
  20. ↵
    Daly M. C., Xiang R. H., Buchhagen D., Hensel C. H., Garcia D. K., Killary A. M., Minna J. D., Naylor S. L. A homozygous deletion on chromosome 3 in a small cell lung cancer cell line correlates with a region of tumor suppressor activity. Oncogene, 8: 1721-1729, 1993.
    OpenUrlPubMed
  21. ↵
    Didsbury J. R., Ho Y. S., Snyderman R. Human Gi protein α-subunit: deduction of amino acid structure from a cloned cDNA. FEBS Lett., 211: 160-164, 1987.
    OpenUrlCrossRefPubMed
  22. ↵
    Van Dop C., Medynski D. C., Apone L. M. Nucleotide sequence for a cDNA encoding the α subunit of retinal transducin (GNAT1) isolated from the human eye. Nucleic Acids Res., 17: 4887 1989.
    OpenUrlFREE Full Text
  23. ↵
    Baylin S. B., Herman J. G., Graff J. R., Vertino P. M., Issa J. P. Alterations in DNA methylation: a fundamental aspect of neoplasia. Adv. Cancer Res., 72: 141-196, 1998.
    OpenUrlPubMed
  24. ↵
    Phelps R. M., Johnson B. E., Ihde D. C., Gazdar A. F., Carbone D. P., McClintock P. R., Linnoila R. I., Matthews M. J., Bunn P. A., Jr.,, Carney D., Minna J. D., Mulshine J. L. NCI-Navy Medical Oncology Branch cell line data base. J. Cell. Biochem., 24(Suppl.): 32s-91s, 1996.
  25. ↵
    Heppell-Parton A. C., Nacheva E., Carter N. P., Rabbitts P. H. A combined approach of conventional and molecular cytogenetics for detailed karyotypic analysis of the small cell lung carcinoma cell line U2020. Cancer Genet. Cytogenet., 108: 110-119, 1999.
    OpenUrlCrossRefPubMed
  26. ↵
    Johnson B. E., Makuch R. W., Simmons A. D., Gazdar A. F., Burch D., Cashell A. W. myc family DNA amplification in small cell lung cancer patients’ tumors and corresponding cell lines. Cancer Res., 48: 5163-5166, 1988.
    OpenUrlAbstract/FREE Full Text
  27. ↵
    Burge C., Karlin S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol., 268: 78-94, 1997.
    OpenUrlCrossRefPubMed
  28. ↵
    Sambrook, J., Fritsch, E., and Maniatis, T. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory, 1989.
  29. ↵
    Protopopov A. I., Gizatullin R. Z., Vorobieva N. V., Protopopova M. V., Kiss C., Kashuba V. I., Klein G., Kisselev L. L., Graphodatsky A. S., Zabarovsky E. R. Human chromosome 3: high-resolution fluorescence in situ hybridization mapping of 40 unique NotI linking clones homologous to genes and cDNAs. Chromosome Res., 4: 443-447, 1996.
    OpenUrlCrossRefPubMed
  30. ↵
    Yao M., Latif F., Orcutt M. L., Kuzmin I., Stackhouse T., Zhou F. W., Tory K., Duh F. M., Richards F., Maher E., et al von Hippel-Lindau disease: identification of deletion mutations by pulsed-field gel electrophoresis. Hum. Genet., 92: 605-614, 1993.
    OpenUrlCrossRefPubMed
  31. ↵
    Altschul S. F., Gish W. Local alignment statistics. Methods Enzymol., 266: 460-480, 1996.
    OpenUrlCrossRefPubMed
  32. ↵
    Sekido Y., Bader S., Latif F., Gnarra J. R., Gazdar A. F., Linehan W. M., Zbar B., Lerman M. I., Minna J. D. Molecular analysis of the von Hippel-Lindau disease tumor suppressor gene in human lung cancer cell lines. Oncogene, 9: 1599-1604, 1994.
    OpenUrlPubMed
  33. ↵
    Forgacs E., Biesterveld E. J., Sekido Y., Fong K., Muneer S., Wistuba I. I., Milchgrub S., Brezinschek R., Virmani A., Gazdar A. F., Minna J. D. Mutation analysis of the PTEN/MMAC1 gene in lung cancer. Oncogene, 17: 1557-1565, 1998.
    OpenUrlCrossRefPubMed
  34. ↵
    Burks C. Molecular biology database list. Nucleic Acids Res., 27: 1-9, 1999.
    OpenUrlAbstract/FREE Full Text
  35. ↵
    Wisconsin Package Version 10.0, Genetics Computer Group (GCG) Madison, WI, 1999.
  36. ↵
    Altschul S. F., Madden T. L., Schaffer A. A., Zhang J., Zhang Z., Miller W., Lipman D. J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25: 3389-3402, 1997.
    OpenUrlAbstract/FREE Full Text
  37. ↵
    Bateman A., Birney E., Durbin R., Eddy S. R., Finn R. D., Sonnhammer E. L. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res., 27: 260-262, 1999.
    OpenUrlAbstract/FREE Full Text
  38. ↵
    Schultz J., Milpetz F., Bork P., Ponting C. P. SMART, a simple modular architecture research tool: identification of signaling domains. Proc. Natl. Acad. Sci. USA, 95: 5857-5864, 1998.
    OpenUrlAbstract/FREE Full Text
  39. ↵
    Horton P., Nakai K. Better prediction of protein cellular localization sites with the k nearest neighbors classifier. ISMB, 5: 147-152, 1997.
  40. ↵
    Juretic, D., Lucic, B., Zucic, D., and Trinajstic, N. Protein transmembrane structure: recognition and prediction by using hydrophobicity scales through preference functions. In: C. Parkanyi (ed.), Theoretical and Computational Chemistry. Theoretical Organic Chemistry, pp. 405–445. Amsterdam: Elsevier Science Publishers, 1998.
  41. ↵
    Sonnhammer E. L. L., von Heijne G., Krogh A. A hidden Markov model for predicting transmembrane helices in protein sequences Glasgow J. Littlejohn T. Major F. Lathrop R. Sankoff D. Sensen C. eds. . Proceedings of Sixth International Conference on Intelligent Systems for Molecular Biology, : 175-182, American Association for Artificial Intelligence Press Menlo Park, CA 1998.
  42. ↵
    Nevill-Manning C. G., Wu T. D., Brutlag D. L. Highly specific protein sequence motifs for genome analysis. Proc. Natl. Acad. Sci. USA, 95: 5865-5871, 1998.
    OpenUrlAbstract/FREE Full Text
  43. ↵
    Hofmann K., Bucher P., Falquet L., Bairoch A. The PROSITE database, its status in 1999. Nucleic Acids Res., 27: 215-219, 1999.
    OpenUrlAbstract/FREE Full Text
  44. ↵
    Apweiler, R., Attwood, T., Bairoch, A., Bateman, A., Birney, E., Bucher, P., Codani, J-J., Corpet, F., Croning, M., Durbin, R., Etzold, T., Fleischmann, W., Gouzy, J., Hermjakob, H., Jonassen, I., Kahn, D., Kanapin, A., Schneider, R., Servant, F., and Zdobnov, E. INTERPRO. CCP11 Newsletter, March 1, 2000.
  45. ↵
    Mushegian A. R., Bassett D. E., Jr.,, Boguski M. S., Bork P., Koonin E. V. Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs [see comments]. Proc. Natl. Acad. Sci. USA, 94: 5831-5836, 1997.
    OpenUrlAbstract/FREE Full Text
  46. ↵
    Mushegian A. R., Garey J. R., Martin J., Liu L. X. Large-scale taxonomic profiling of eukaryotic model organisms: a comparison of orthologous proteins encoded by the human, fly, nematode, and yeast genomes. Genome Res., 8: 590-598, 1998.
    OpenUrlAbstract/FREE Full Text
  47. ↵
    Makalowski W., Zhang J., Boguski M. S. Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. Genome Res., 6: 846-857, 1996.
    OpenUrlAbstract/FREE Full Text
  48. ↵
    Yuan Y. P., Eulenstein O., Vingron M., Bork P. Towards detection of orthologues in sequence databases. Bioinformatics (Oxf.), 14: 285-289, 1998.
    OpenUrlAbstract/FREE Full Text
  49. ↵
    Shah, M. B., Guan, X., Einstein, J. R., Matis, S., Xu, Y., Mural, R. J., and Uberbacher, E. C. User’s Guide to GRAIL, and GENQUEST (Sequence Analysis, Gene Assembly And Sequence Comparison Systems) (July, 1994). E-mail Servers and XGRAIL (Version 1.2) and XGENQUEST (Version 1.1). 1994.
  50. ↵
    Hodges P. E., McKee A. H., Davis B. P., Payne W. E., Garrels J. I. The Yeast Proteome Database (YPD): a model for the organization and presentation of genome-wide functional data. Nucleic Acids Res., 27: 69-73, 1999.
    OpenUrlAbstract/FREE Full Text
  51. ↵
    Todd S., Franklin W. A., Varella-Garcia M., Kennedy T., Hilliker C. E., Jr.,, Hahner L., Anderson M., Wiest J. S., Drabkin H. A., Gemmill R. M. Homozygous deletions of human chromosome 3p in lung tumors. Cancer Res., 57: 1344-1352, 1997.
    OpenUrlAbstract/FREE Full Text
  52. ↵
    Todd M. C., Xiang R. H., Garcia D. K., Kerbacher K. E., Moore S. L., Hensel C. H., Liu P., Siciliano M. J., Kok K., van den Berg A., Veldhuis P., Buys C. H., Killary A. M., Naylor S. L. An 80 Kb P1 clone from chromosome 3p21.3 suppresses tumor growth in vivo. Oncogene, 13: 2387-2396, 1996.
    OpenUrlPubMed
  53. ↵
    Cheng Y., Poulos N. E., Lung M. L., Hampton G., Ou B., Lerman M. I., Stanbridge E. J. Functional evidence for a nasopharyngeal carcinoma tumor suppressor gene that maps at chromosome 3p21.3. . Proc. Natl. Acad. Sci. USA, 95: 3042-3047, 1998.
    OpenUrlAbstract/FREE Full Text
  54. ↵
    Killary A. M., Wolf M. E., Giambernardi T. A., Naylor S. L. Definition of a tumor suppressor locus within human chromosome 3p21–p22. Proc. Natl. Acad. Sci. USA, 89: 10877-10881, 1992.
    OpenUrlAbstract/FREE Full Text
  55. ↵
    Sithanandam G., Latif F., Smola U., Bernal R. A., Duh F-M., Li H., Kuzmin I., Wixler V., Geil L., Shresta S., Lloyd P. A., Bader S., Sekido Y., Tartof K. D., Kashuba V. I., Zabarovsky E. R., Dean M., Klein G., Zbar B., Lerman M. I., Minna J. D., Rapp U. R., Allikmets A. 3PK, a new map kinase activated protein kinase, located in the small cell lung cancer tumor suppressor gene region. Mol. Cell. Biol., 16: 868-876, 1996.
    OpenUrlAbstract/FREE Full Text
  56. ↵
    Latif F., Duh F. M., Bader S., Sekido Y., Li H., Geil L., Zbar B., Minna J. D., Lerman M. I. The human homolog of the rodent immediate early response genes, PC4 and TIS7, resides in the lung cancer tumor suppressor gene region on chromosome 3p21. Hum. Genet., 99: 334-341, 1997.
    OpenUrlCrossRefPubMed
  57. ↵
    Sekido Y., Bader S., Latif F., Chen J. Y., Duh F. M., Wei M. H., Albanesi J. P., Lee C. C., Lerman M. I., Minna J. D. Human semaphorins A(V) and IV reside in the 3p21.3 small cell lung cancer deletion region and demonstrate distinct expression patterns. Proc. Natl. Acad. Sci. USA, 93: 4120-4125, 1996.
    OpenUrlAbstract/FREE Full Text
  58. ↵
    Erdmann V. A., Szymanski M., Hochberg A., de Groot N., Barciszewski J. Collection of mRNA-like non-coding RNAs. Nucleic Acids Res., 27: 192-195, 1999.
    OpenUrlAbstract/FREE Full Text
  59. ↵
    Schwartz S., Zhang Z., Frazer K., Smit A., Riemer C., Bouck J., Gibbs R., Hardison R., Miller L. W. PipMakerA web server for aligning two genomic DNA sequences. Genome Res., 10: 577-586, 2000.
    OpenUrlAbstract/FREE Full Text
  60. ↵
    Blechschmidt K., Schweiger M., Wertz K., Poulson R., Christensen H. M., Rosenthal A., Lehrach H., Yaspo M. L. The mouse Aire gene: comparative genomic sequencing, gene organization, and expression. Genome Res., 9: 158-166, 1999.
    OpenUrlAbstract/FREE Full Text
  61. ↵
    Lee I. Y., Westaway D., Smit A. F., Wang K., Seto J., Chen L., Acharya C., Ankener M., Baskin D., Cooper C., Yao H., Prusiner S. B., Hood L. E. Complete genomic sequence and analysis of the prion protein gene region from three mammalian species. Genome Res., 8: 1022-1037, 1998.
    OpenUrlAbstract/FREE Full Text
  62. ↵
    Uchida K., Yoshimura A., Inazawa J., Yanagisawa K., Osada H., Masuda A., Saito T., Takahashi T., Miyajima A. Molecular cloning of CISH, chromosome assignment to 3p21.3, and analysis of expression in fetal and adult tissues. Cytogenet. Cell Genet., 78: 209-212, 1997.
    OpenUrl
  63. ↵
    Xiang R. H., Hensel C. H., Garcia D. K., Carlson H. C., Kok K., Daly M. C., Kerbacher K., van den Berg A., Veldhuis P., Buys C. H. C. M., Naylor S. L. Isolation of the human semaphorin III/F gene (SEMA3F) at chromosome 3p21, a region deleted in lung cancer. Genomics, 32: 39-48, 1996.
    OpenUrlCrossRefPubMed
  64. ↵
    Jackson A. L., Loeb L. A. The mutation rate and cancer. Genetics, 148: 1483-1490, 1998.
    OpenUrlAbstract/FREE Full Text
  65. ↵
    Loeb L. A. Cancer cells exhibit a mutator phenotype. Adv. Cancer Res., 72: 25-56, 1998.
    OpenUrlPubMed
  66. ↵
    The FlyBase database of the Drosophila Genome Projects and community literature. The FlyBase Consortium. Nucleic Acids Res., 27: 85-88, 1999.
    OpenUrlAbstract/FREE Full Text
  67. ↵
    Gurnett C. A., De Waard M., Campbell K. P. Dual function of the voltage-dependent Ca2+ channel α2δ subunit in current stimulation and subunit interaction. Neuron, 16: 431-440, 1996.
    OpenUrlCrossRefPubMed
  68. ↵
    Gao B., Sekido Y., Maximov A., Saad M., Forgacs E., Latif F., Wei M. H., Lerman M., Lee J. H., Perez-Reyes E., Bezprozvanny I., Minna J. D. Functional properties of a new voltage-dependent calcium channel α2δ auxiliary subunit gene (CACNA2D2). J. Biol. Chem., 275: 12237-12242, 2000.
    OpenUrlAbstract/FREE Full Text
  69. ↵
    Wolford J. K., Prochazka M. Structure and expression of the human MTG8/ETO gene. Gene (Amst.), 212: 103-109, 1998.
    OpenUrlCrossRefPubMed
  70. ↵
    LeBoeuf R. D., Ban E. M., Green M. M., Stone A. S., Propst S. M., Blalock J. E., Tauber J. D. Molecular cloning, sequence analysis, expression, and tissue distribution of suppressin, a novel suppressor of cell cycle entry. J. Biol. Chem., 273: 361-368, 1998.
    OpenUrlAbstract/FREE Full Text
  71. ↵
    Vavvas D., Li X., Avruch J., Zhang X. F. Identification of Nore1 as a potential Ras effector. J. Biol. Chem., 273: 5439-5442, 1998.
    OpenUrlAbstract/FREE Full Text
  72. ↵
    Kim S. T., Lim D. S., Canman C. E., Kastan M. B. Substrate specificities and identification of putative substrates of ATM kinase family members. J. Biol. Chem., 274: 37538-37543, 1999.
    OpenUrlAbstract/FREE Full Text
  73. ↵
    Lepperdinger G., Strobl B., Kreil G. HYAL2, a human gene expressed in many cells, encodes a lysosomal hyaluronidase with a novel type of specificity. J. Biol. Chem., 273: 22466-22470, 1998.
    OpenUrlAbstract/FREE Full Text
  74. ↵
    Strobl B., Wechselberger C., Beier D. R., Lepperdinger G. Structural organization and chromosomal localization of Hyal2, a gene encoding a lysosomal hyaluronidase. Genomics, 53: 214-219, 1998.
    OpenUrlCrossRefPubMed
  75. ↵
    Triggs-Raine B., Salo T. J., Zhang H., Wicklow B. A., Natowicz M. R. Mutations in HYAL1, a member of a tandemly distributed multigene family encoding disparate hyaluronidase activities, cause a newly described lysosomal disorder, mucopolysaccharidosis IX. Proc. Natl. Acad. Sci. USA, 96: 6296-6300, 1999.
    OpenUrlAbstract/FREE Full Text
  76. ↵
    Csoka T. B., Frost G. I., Heng H. H., Scherer S. W., Mohapatra G., Stern R. The hyaluronidase gene HYAL1 maps to chromosome 3p21.2-p21.3 in human and 9F1–F2 in mouse, a conserved candidate tumor suppressor locus. Genomics, 48: 63-70, 1998.
    OpenUrlCrossRefPubMed
  77. ↵
    Henikoff J. G., Henikoff S., Pietrokovski S. New features of the Blocks Database servers. Nucleic Acids Res., 27: 226-228, 1999.
    OpenUrlAbstract/FREE Full Text
  78. ↵
    Zegerman P., Bannister A. J., Kouzarides T. The putative tumour suppressor Fus-2 is an N-acetyltransferase. Oncogene, 19: 161-163, 2000.
    OpenUrlCrossRefPubMed
  79. ↵
    Csoka A. B., Scherer S. W., Stern R. Expression analysis of six paralogous human hyaluronidase genes clustered on chromosomes 3p21 and 7q31. Genomics, 60: 356-361, 1999.
    OpenUrlCrossRefPubMed
  80. ↵
    Buanne P., Incerti B., Guardavaccaro D., Avvantaggiato V., Simeone A., Tirone F. Cloning of the human interferon-related developmental regulator (IFRD1) gene coding for the PC4 protein, a member of a novel family of developmentally regulated genes. Genomics, 51: 233-242, 1998.
    OpenUrlCrossRefPubMed
  81. ↵
    Timmer T., Terpstra P., van den Berg A., Veldhuis P. M., Ter Elst A., Voutsinas G., Hulsbeek M. M., Draaijers T. G., Looman M. W., Kok K., Naylor S. L., Buys C. H. A comparison of genomic structures and expression patterns of two closely related flanking genes in a critical lung cancer region at 3p21.3. Eur. J. Hum. Genet., 7: 478-486, 1999.
    OpenUrlCrossRefPubMed
  82. ↵
    Drabkin H. A., West J. D., Hotfilder M., Heng Y. M., Erickson P., Calvo R., Dalmau J., Gemmill R. M., Sablitzky F. DEF-3(g16/NY-LU-12), an RNA binding protein from the 3p21.3 homozygous deletion region in SCLC.. Oncogene, 18: 2589-2597, 1999.
    OpenUrlCrossRefPubMed
  83. ↵
    Gure A. O., Altorki N. K., Stockert E., Scanlan M. J., Old L. J., Chen Y. T. Human lung cancer antigens recognized by autologous antibodies: definition of a novel cDNA derived from the tumor suppressor gene locus on chromosome 3p21.3. Cancer Res., 58: 1034-1041, 1998.
    OpenUrlAbstract/FREE Full Text
  84. ↵
    Whang-Peng J., Knutsen T., Gazdar A., Steinberg S. M., Oie H., Linnoila I., Mulshine J., Nau M., Minna J. D. Nonrandom structural and numerical chromosome changes in non-small-cell lung cancer. Genes Chromosomes Cancer, 3: 168-188, 1991.
    OpenUrlPubMed
  85. ↵
    Whang-Peng J., Kao-Shan C. S., Lee E. C., Bunn P. A., Carney D. N., Gazdar A. F., Minna J. D. Specific chromosome defect associated with human small-cell lung cancer: deletion 3p(14–23). Science (Washington DC), 215: 181-182, 1982.
    OpenUrlAbstract/FREE Full Text
  86. ↵
    Whang-Peng J., Bunn P. A., Jr.,, Kao-Shan C. S., Lee E. C., Carney D. N., Gazdar A. F., Minna J. D. A nonrandom chromosomal abnormality, del 3p(14–23), in human small cell lung cancer (SCLC). Cancer Genet. Cytogenet., 6: 119-134, 1982.
    OpenUrlCrossRefPubMed
  87. ↵
    Naylor S. L., Johnson B. E., Minna J. D., Sakaguchi A. Y. Loss of heterozygosity of chromosome 3p markers in small-cell lung cancer. Nature (Lond.), 329: 451-454, 1987.
    OpenUrlCrossRefPubMed
  88. ↵
    Brauch H., Johnson B., Hovis J., Yano T., Gazdar A. F., Pettengill O. S., Graziano S., Sorenson G. D., Poiesz B. J., Minna J. D., Linehan M., Zbar B. Molecular analysis of the short arm of chromosome 3 in small-cell and non-small-cell carcinoma of the lung. N. Engl. J. Med., 317: 1109-1113, 1987.
    OpenUrlPubMed
  89. ↵
    Johnson B. E., Sakaguchi A. Y., Gazdar A. F., Minna J. D., Burch D., Marshall A., Naylor S. L. Restriction fragment length polymorphism studies show consistent loss of chromosome 3p alleles in small cell lung cancer patients’ tumors. J. Clin. Investig., 82: 502-507, 1988.
  90. ↵
    Brauch H., Tory K., Kotler F., Gazdar A. F., Pettengill O. S., Johnson B., Graziano S., Winton T., Buys C. H., Sorenson G. D., Poiesz B. J., Minna J. D., Zbar B. Molecular mapping of deletion sites in the short arm of chromosome 3 in human lung cancer. Genes Chromosomes Cancer, 1: 240-246, 1990.
    OpenUrlCrossRefPubMed
  91. ↵
    Bell D., Testa J. Chromosomes in lung cancer Wolman S. Sell S. eds. . Human Cytogenetic Cancer Markers, : 247-287, Humana Press, Inc. Totowa, NJ 1998.
  92. ↵
    Hibi K., Takahashi T., Yamakawa K., Ueda R., Sekido Y., Ariyoshi Y., Suyama M., Takagi H., Nakamura Y., Takahashi T. Three distinct regions involved in 3p deletion in human lung cancer. Oncogene, 7: 445-449, 1992.
    OpenUrlPubMed
  93. ↵
    Sozzi G., Veronese M. L., Negrini M., Baffa R., Cotticelli M. G., Inoue H., Tornielli S., Pilotti S., DeGregorio L., Pastorino U., Pierotti M. A., Ohta M., Huebner K., Croce C. M. The FHIT gene at 3p14.2 is abnormal in lung cancer. Cell, 85: 17-26, 1996.
    OpenUrlCrossRefPubMed
  94. ↵
    Yamakawa K., Takahashi T., Horio Y., Murata Y., Takahashi E., Hibi K., Yokoyama S., Ueda R., Nakamura Y. Frequent homozygous deletions in lung cancer cell lines detected by a DNA marker located at 3p21.3-p22. Oncogene, 8: 327-330, 1993.
    OpenUrlPubMed
  95. ↵
    Fong K., Biesterveld E., Virmani A., Wistuba I., Sekido Y., Bader S., Ahmadian M., Ong S., Rassool F., Zimmerman P., Giaccone G., Gazdar A., Minna J. D. FHIT and FRA3B 3p14.2 allele loss are common in lung cancer and preneoplastic bronchial lesions and are associated with cancer related FHIT cDNA splicing aberrations. Cancer Res., 57: 2256-2267, 1997.
    OpenUrlAbstract/FREE Full Text
  96. ↵
    Rabbitts P., Bergh J., Douglas J., Collins F., Waters J. A submicroscopic homozygous deletion at the D3S3 locus in a cell line isolated from a small cell lung carcinoma. Genes Chromosomes Cancer, 2: 231-238, 1990.
    OpenUrlPubMed
  97. ↵
    Latif F., Tory K., Modi W. S., Graziano S. L., Gamble G., Douglas J., Heppell P. A., Rabbitts P. H., Zbar B., Lerman M. I. Molecular characterization of a large homozygous deletion in the small cell lung cancer cell line U2020: a strategy for cloning the putative tumor suppressor gene. Genes Chromosomes Cancer, 5: 119-127, 1992.
    OpenUrlPubMed
  98. ↵
    Drabkin H. A., Mendez M. J., Rabbitts P. H., Varkony T., Bergh J., Schlessinger J., Erickson P., Gemmill R. M. Characterization of the submicroscopic deletion in the small-cell lung carcinoma (SCLC) cell line U2020. Genes Chromosomes Cancer, 5: 67-74, 1992.
    OpenUrlPubMed
  99. ↵
    Ishikawa S., Kai M., Tamari M., Takei Y., Takeuchi K., Bandou H., Yamane Y., Ogawa M., Nakamura Y. Sequence analysis of a 685-kb genomic region on chromosome 3p22–p21.3 that is homozygously deleted in a lung carcinoma cell line. DNA Res., 4: 35-43, 1997.
    OpenUrlAbstract
  100. ↵
    Wistuba I. I., Lam S., Behrens C., Virmani A. K., Fong K. M., LeRiche J., Samet J. M., Srivastava S., Minna J. D., Gazdar A. F. Molecular damage in the bronchial epithelium of current and former smokers. J. Natl. Cancer Inst., 89: 1366-1373, 1997.
    OpenUrlAbstract/FREE Full Text
  101. ↵
    Wu X., Zhao Y., Honn S. E., Tomlinson G. E., Minna J. D., Hong W. K., Spitz M. R. Benzo[a]pyrene diol epoxide-induced 3p21.3 aberrations and genetic predisposition to lung cancer. Cancer Res., 58: 1605-1608, 1998.
    OpenUrlAbstract/FREE Full Text
  102. ↵
    Caldas, C., Hahn, S. A., da Costa, L. T., Redston, M. S., Schutte, M., Seymour, A. B., Weinstein, C. L., Hruban, R. H., Yeo, C. J., and Kern, S. E. Frequent somatic mutations and homozygous deletions of the p16 (MTS1) gene in pancreatic adenocarcinoma [published erratum appears in Nat. Genet., 8: 410, 1994]. Nat. Genet., 8: 27–32, 1994.
  103. ↵
    Hahn S. A., Schutte M., Hoque A. T., Moskaluk C. A., da Costa L. T., Rozenblum E., Weinstein C. L., Fischer A., Yeo C. J., Hruban R. H., Kern S. E. DPC4, a candidate tumor suppressor gene at human chromosome 18q21.1. Science (Washington DC), 271: 350-353, 1996.
    OpenUrlAbstract
  104. ↵
    Schutte M., da Costa L. T., Hahn S. A., Moskaluk C., Hoque A. T., Rozenblum E., Weinstein C. L., Bittner M., Meltzer P. S., Trent J. M., Yeo C., Hruban R., Kern S. Identification by representational difference analysis of a homozygous deletion in pancreatic carcinoma that lies within the BRCA2 region. Proc. Natl. Acad. Sci. USA, 92: 5950-5954, 1995.
    OpenUrlAbstract/FREE Full Text
  105. ↵
    Chen P. L., Chen Y. M., Bookstein R., Lee W. H. Genetic mechanisms of tumor suppression by the human p53 gene. Science (Washington DC), 250: 1576-1580, 1990.
    OpenUrlAbstract/FREE Full Text
  106. ↵
    Harbour J. W., Lai S. L., Whang-Peng J., Gazdar A. F., Minna J. D., Kaye F. J. Abnormalities in structure and expression of the human retinoblastoma gene in SCLC. Science (Washington DC), 241: 353-357, 1988.
    OpenUrlAbstract/FREE Full Text
  107. ↵
    Takahashi T., Nau M. M., Chiba I., Birrer M. J., Rosenberg R. K., Vinocour M., Levitt M., Pass H., Gazdar A. F., Minna J. D. p53: a frequent target for genetic abnormalities in lung cancer. Science (Washington DC), 246: 491-494, 1989.
    OpenUrlAbstract/FREE Full Text
  108. ↵
    Gazzeri S., Gouyer V., Vour’ch C., Brambilla C., Brambilla E. Mechanisms of p16INK4A inactivation in non small-cell lung cancers. Oncogene, 16: 497-504, 1998.
    OpenUrlCrossRefPubMed
  109. ↵
    Girard L., Zöchbauer-Muller S., Virmani A., Gazdar A., Minna J. D. Genome-wide allelotyping of lung cancer identifies new regions of allelic loss, differences between SCLC and NSCLC, and loci clustering. Cancer Res., 60: 4894-4906, 2000.
    OpenUrlAbstract/FREE Full Text
  110. ↵
    Stirzaker C., Millar D. S., Paul C. L., Warnecke P. M., Harrison J., Vincent P. C., Frommer M., Clark S. J. Extensive DNA methylation spanning the Rb promoter in retinoblastoma tumors. Cancer Res., 57: 2229-2237, 1997.
    OpenUrlAbstract/FREE Full Text
  111. ↵
    Toyota M., Ho C., Ohe-Toyota M., Baylin S. B., Issa J. P. Inactivation of CACNA1G, a T-type calcium channel gene, by aberrant methylation of its 5′ CpG island in human tumors. Cancer Res., 59: 4535-4541, 1999.
    OpenUrlAbstract/FREE Full Text
  112. ↵
    Esteller, M., Sanchez-Cespedes, M., Rosell, R., Sidransky, D., Baylin, S. B., and Herman, J. G. Detection of aberrant promoter hypermethylation of tumor suppressor genes in serum DNA from non-small cell lung cancer patients [published erratum appears in Cancer Res., 59: 3853, 1999]. Cancer Res., 59: 67–70, 1999.
  113. ↵
    Weinberg R. A. The molecular basis of oncogenes and tumor suppressor genes. Ann. NY Acad. Sci., 758: 331-338, 1995.
    OpenUrlPubMed
  114. ↵
    White R. L. Tumor suppressing pathways. Cell, 92: 591-592, 1998.
    OpenUrlCrossRefPubMed
  115. ↵
    Fero M. L., Randel E., Gurley K. E., Roberts J. M., Kemp C. J. The murine gene p27Kip1 is haplo-insufficient for tumour suppression. Nature (Lond.), 396: 177-180, 1998.
    OpenUrlCrossRefPubMed
  116. ↵
    Tang B., Bottinger E. P., Jakowlew S. B., Bagnall K. M., Mariano J., Anver M. R., Letterio J. J., Wakefield L. M. Transforming growth factor-β1 is a new form of tumor suppressor with true haploid insufficiency. Nat. Med., 4: 802-807, 1998.
    OpenUrlCrossRefPubMed
  117. ↵
    Zurawel R. H., Allen C., Wechsler-Reya R., Scott M. P., Raffel C. Evidence that haploinsufficiency of ptch leads to medulloblastoma in mice. Genes Chromosomes Cancer, 28: 77-81, 2000.
    OpenUrlCrossRefPubMed
  118. ↵
    Xu X., Brodie S. G., Yang X., Im Y. H., Parks W. T., Chen L., Zhou Y. X., Weinstein M., Kim S. J., Deng C. X. Haploid loss of the tumor suppressor Smad4/Dpc4 initiates gastric polyposis and cancer in mice. Oncogene, 19: 1868-1874, 2000.
    OpenUrlCrossRefPubMed
  119. ↵
    Song W. J., Sullivan M. G., Legare R. D., Hutchings S., Tan X., Kufrin D., Ratajczak J., Resende I. C., Haworth C., Hock R., Loh M., Felix C., Roy D. C., Busque L., Kurnit D., Willman C., Gewirtz A. M., Speck N. A., Bushweller J. H., Li F. P., Gardiner K., Poncz M., Maris J. M., Gilliland D. G. Haploinsufficiency of CBFA2 causes familial thrombocytopenia with propensity to develop acute myelogenous leukaemia. Nat. Genet., 23: 166-175, 1999.
    OpenUrlCrossRefPubMed
  120. ↵
    Fearon E. R. Human cancer syndromes: clues to the origin and nature of cancer. Science (Washington DC), 278: 1043-1050, 1997.
    OpenUrlAbstract/FREE Full Text
  121. ↵
    Kinzler K. W., Vogelstein B. Cancer-susceptibility genes. Gatekeepers and caretakers [see comments]. Nature (Lond.),, 386: 761-763, 1997.
    OpenUrlCrossRefPubMed
  122. ↵
    Knudson A. G., Jr. Mutation and cancer: a personal odyssey. Adv. Cancer Res., 67: 1-23, 1995.
    OpenUrlCrossRefPubMed
  123. ↵
    Moolgavkar S. H., Knudson A. G., Jr. Mutation and cancer: a model for human carcinogenesis. J. Natl. Cancer Inst., 66: 1037-1052, 1981.
  124. ↵
    Zhuang Z., Park W. S., Pack S., Schmidt L., Vortmeyer A. O., Pak E., Pham T., Weil R. J., Candidus S., Lubensky I. A., Linehan W. M., Zbar B., Weirich G. Trisomy 7-harbouring non-random duplication of the mutant MET allele in hereditary papillary renal carcinomas. Nat. Genet., 20: 66-69, 1998.
    OpenUrlCrossRefPubMed
  125. ↵
    Wistuba I. I., Bryant D., Behrens C., Milchgrub S., Virmani A. K., Ashfaq R., Minna J. D., Gazdar A. F. Comparison of features of human lung cancer cell lines and their corresponding tumors. Clin. Cancer Res., 5: 991-1000, 1999.
    OpenUrlAbstract/FREE Full Text
  126. ↵
    Park I. W., Wistuba I. I., Maitra A., Milchgrub S., Virmani A. K., Minna J. D., Gazdar A. F. Multiple clonal abnormalities in the bronchial epithelium of patients with lung cancer. J. Natl. Cancer Inst., 91: 1863-1868, 1999.
    OpenUrlAbstract/FREE Full Text
  127. ↵
    Burri, P. Structural aspects of prenatal and postnatal development and growth of the lung. In: A. McDonald (ed.), Lung Biology in Health and Disease. Lung Growth and Development, pp. 1–35. New York: Marcel Dekker, Inc., 1997.
  128. ↵
    Travis, W. D., Travis, L. B., and Devesa, S. S. Lung cancer [published erratum appears in Cancer (Phila.), 75: 2979, 1999]. Cancer (Phila.), 75: 191–202, 1995.
  129. ↵
    Vuitch F., Sekido Y., Fong K., Mackay B., Minna J. D., Gazdar A. F. Neuroendocrine tumors of the lung. Pathology and molecular biology. Chest Surg. Clin. N. Am., 7: 21-47, 1997.
    OpenUrlPubMed
  130. ↵
    Sanders B. M., Jay M., Draper G. J., Roberts E. M. Non-ocular cancer in relatives of retinoblastoma patients. Br. J. Cancer, 60: 358-365, 1989.
    OpenUrlPubMed
  131. ↵
    Schwartz A. G., Rothrock M., Yang P., Swanson G. M. Increased cancer risk among relatives of nonsmoking lung cancer cases. Genet. Epidemiol., 17: 1-15, 1999.
    OpenUrlCrossRefPubMed
  132. ↵
    Yang P., Schwartz A. G., McAllister A. E., Swanson G. M., Aston C. E. Lung cancer risk in families of nonsmoking probands: heterogeneity by age at diagnosis. Genet. Epidemiol., 17: 253-273, 1999.
    OpenUrlCrossRefPubMed
PreviousNext
Back to top
November 2000
Volume 60, Issue 21
  • Table of Contents

Sign up for alerts

View this article with LENS

Open full page PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for sharing this Cancer Research article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
The 630-kb Lung Cancer Homozygous Deletion Region on Human Chromosome 3p21.3: Identification and Evaluation of the Resident Candidate Tumor Suppressor Genes
(Your Name) has forwarded a page to you from Cancer Research
(Your Name) thought you would be interested in this article in Cancer Research.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
The 630-kb Lung Cancer Homozygous Deletion Region on Human Chromosome 3p21.3: Identification and Evaluation of the Resident Candidate Tumor Suppressor Genes
Michael I. Lerman, John D. Minna and
Cancer Res November 1 2000 (60) (21) 6116-6133;

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
The 630-kb Lung Cancer Homozygous Deletion Region on Human Chromosome 3p21.3: Identification and Evaluation of the Resident Candidate Tumor Suppressor Genes
Michael I. Lerman, John D. Minna and
Cancer Res November 1 2000 (60) (21) 6116-6133;
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • INTRODUCTION
    • MATERIALS AND METHODS
    • RESULTS
    • DISCUSSION
    • Acknowledgments
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF
Advertisement

Related Articles

Cited By...

More in this TOC Section

  • Primary Adenocarcinomas of the Lung in Nonsmokers Show a Distinct Pattern of Allelic Imbalance
  • Lowered Oxygen Tension Induces Expression of the Hypoxia Marker MN/Carbonic Anhydrase IX in the Absence of Hypoxia-inducible Factor 1α Stabilization
  • Semaphorin 3F Gene from Human 3p21.3 Suppresses Tumor Formation in Nude Mice
Show more Molecular Biology & Genetics
  • Home
  • Alerts
  • Feedback
  • Privacy Policy
Facebook  Twitter  LinkedIn  YouTube  RSS

Articles

  • Online First
  • Current Issue
  • Past Issues
  • Meeting Abstracts

Info for

  • Authors
  • Subscribers
  • Advertisers
  • Librarians

About Cancer Research

  • About the Journal
  • Editorial Board
  • Permissions
  • Submit a Manuscript
AACR logo

Copyright © 2021 by the American Association for Cancer Research.

Cancer Research Online ISSN: 1538-7445
Cancer Research Print ISSN: 0008-5472
Journal of Cancer Research ISSN: 0099-7013
American Journal of Cancer ISSN: 0099-7374

Advertisement