Cancer Research  Cancer Health Disparities Conference 2009
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Cancer Research Clinical Cancer Research
Cancer Epidemiology Biomarkers & Prevention Molecular Cancer Therapeutics
Molecular Cancer Research Cancer Prevention Research
Cancer Prevention Journals Portal Cancer Reviews Online
Annual Meeting Education Book Meeting Abstracts Online

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplementary Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Calhoun, E. S.
Right arrow Articles by Kern, S. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Calhoun, E. S.
Right arrow Articles by Kern, S. E.
[Cancer Research 66, 7920-7928, August 15, 2006]
© 2006 American Association for Cancer Research


Molecular Biology, Pathobiology, and Genetics

Identifying Allelic Loss and Homozygous Deletions in Pancreatic Cancer without Matched Normals Using High-Density Single-Nucleotide Polymorphism Arrays

Eric S. Calhoun1, Tomas Hucl1, Eike Gallmeier1, Kristen M. West3, Dan E. Arking3, Anirban Maitra1,2, Christine A. Iacobuzio-Donahue2, Aravinda Chakravarti3, Ralph H. Hruban1,2 and Scott E. Kern1,2

Departments of 1 Oncology and 2 Pathology, The Sol Goldman Pancreatic Cancer Research Center at the Sidney Kimmel Comprehensive Cancer Center, and 3 Department of Genetic Medicine at the Mckusick-Nathans Institute of Genetic Medicine, Johns Hopkins Medical Institutes, Baltimore, Maryland

Requests for reprints: Eric S. Calhoun, Department of Oncology, Johns Hopkins Medical Institutes, CRB 1, Room 464, 1650 Orleans Street, Baltimore, MD 21231. Phone: 410-614-3314; Fax: 410-614-9705; E-mail: ecalhou2{at}jhmi.edu.


    Abstract
 Top
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 References
 
Recent advances in oligonucleotide arrays and whole-genome complexity reduction data analysis now permit the evaluation of tens of thousands of single-nucleotide polymorphisms simultaneously for a genome-wide analysis of allelic status. Using these arrays, we created high-resolution allelotype maps of 26 pancreatic cancer cell lines. The areas of heterozygosity implicitly served to reveal regions of allelic loss. The array-derived maps were verified by a panel of 317 microsatellite markers used in a subset of seven samples, showing a 97.1% concordance between heterozygous calls. Three matched tumor/normal pairs were used to estimate the false-negative and potential false-positive rates for identifying loss of heterozygosity: 3.6 regions (average minimal region of loss, 720,228 bp) and 2.3 regions (average heterozygous gap distance, 4,434,994 bp) per genome, respectively. Genomic fractional allelic loss calculations showed that cumulative levels of allelic loss ranged widely from 17.1% to 79.9% of the haploid genome length. Regional increases in "NoCall" frequencies combined with copy number loss estimates were used to identify 41 homozygous deletions (19 first reports), implicating an additional 13 regions disrupted in pancreatic cancer. Unexpectedly, 23 of these occurred in just two lines (BxPc3 and MiaPaCa2), suggesting the existence of at least two subclasses of chromosomal instability (CIN) patterns, distinguished here by allelic loss and copy number changes (original CIN) and those also highly enriched in the genomic "holes" of homozygous deletions (holey CIN). This study provides previously unavailable high-resolution allelotype and deletion breakpoint maps in widely shared pancreatic cancer cell lines and effectively eliminates the need for matched normal tissue to define informative loci. (Cancer Res 2006; 66(16): 7920-9)


    Introduction
 Top
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 References
 
A greater efficiency in analyzing loss of heterozygosity (LOH) in cancer genomes is needed. Long-standing problems have hampered efforts to characterize LOH, particularly in tumor types such as adenocarcinoma of the pancreas, which contain as few as 10% neoplastic cells (1). Primary tissues are also inherently limited resources, which eventually deplete through analysis. Access is often limited to select individuals/institutions where patients are seen, limiting the ability of the scientific community to validate published results. By establishing in vitro and/or xenografted cultures of carcinoma cells, many of the problems associated with primary tissues can be overcome. From a genetic perspective, early-passage cell lines represent an almost ideal form of tissue free of contaminating normal cells. With unlimited expandability, as well as portability, cell lines have the potential to be shared freely between investigators and/or established in commercial cell-line banks. Unfortunately, in vitro cultures generally lack a matched normal tissue. In the past, this impaired the ability to discern between hemizygosity and homozygosity as well as between polymorphic (germ line) variants and true somatic mutations.

The efficiency of analyzing allelic loss is also dependent on technological limitations. The analysis of microsatellite markers has been the gold standard for estimating allelic loss because the rate of heterozygosity at each locus is comparably higher than that seen for diallelic single-nucleotide polymorphisms (SNP; refs. 24). Such low heterozygosity frequencies have been one reason why analyses based on SNPs have found limited use in genetic studies (5). Technical difficulties in the scalability for microsatellite marker studies, however, limit the number of samples and/or the number of markers used, effectively ensuring a low genomic coverage and poor resolution of LOH breakpoints (6). Recent advances in detecting chromosomal copy number (e.g., bacterial artificial chromosome array–based comparative genomic hybridization and representational oligonucleotide microarray analysis; refs. 711) have provided a solution to some of these problems but fail to identify many cases of LOH, as when uniparental disomy or polysomy is present. Additionally, genomic coverage remains somewhat limited by the number of probes arrayed (recently ~5,400 probes per array; ref. 10).

New high-throughput technologies, such as the Affymetrix genotyping arrays which evaluate >100,000 genotypic-variant loci simultaneously, effectively allow for a high-resolution genome-wide analysis of allelic status on large sample sets (3, 12). The reduced informativeness of diallelic markers is overcome by an increase in the number of evaluated loci and subsequently the number of informative calls. For example, Matsuzaki et al. (3) reported a heterozygous frequency rate of 30.41% using the 100K SNP arrays, which would yield nearly 30,000 informative calls in diploid individuals. The quality of the results, however, is dependent on the purity of the starting tissue. Allelic loss studies are dramatically affected by contaminating normal tissues, a condition common to many scirrhous neoplasms (1, 13), and the effect of contaminating mouse tissues in xenografts has yet to be determined for SNP analysis. Consequently, studies of these tumor systems using the Affymetrix genotyping arrays are limited to established cell line cultures.

By combining the analysis of established, commercially available pancreatic cancer cell lines, as well as several early-passage cell lines with the Affymetrix 100K genotyping arrays, we have overcome long-standing problems associated with allelic-loss analysis in pancreatic cancer. We present here the high-resolution allelotype and breakpoint maps of 26 pancreatic cancer cell lines without the use of matched normal tissues.


    Materials and Methods
 Top
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 References
 
Cell lines. The pancreatic cancer lines AsPc1, BxPc3, CAPAN1, CFPAC1, Hs766T, MiaPaCa2, PL1 (Panc 02.13), PL9 (Panc 08.13), PL5 (Panc 04.03), PL11 (Panc 03.27), Panc-1, and Su86.86 were purchased from the American Type Culture Collection (Manassas, VA). COLO357 was obtained from the European Collection of Animal Cell Cultures (Salisbury, United Kingdom). PK8 and PK9 were kind gifts from Dr. Arata Horii to C.A.I-D. (14). XPA1 (PXA184), XPA2 (PXA196), and XPA3 (PX154) were established by A.M. and described elsewhere (15). The remaining pancreatic cancer lines, PL3 (Panc 04.14), PL13 (Panc 09.06), PL19 (TS0111), PL20 (TS0120), PL21 (TS0129), PL22 (TS0226), PL23 (TS031), and PL24 (TS065), were kind gifts from Dr. Elizabeth M. Jaffee (Johns Hopkins, Baltimore, MD). Lines were propagated in RPMI 1640 (Invitrogen, Carlsbad, CA) supplemented with 10% fetal bovine serum, 2 mmol/L L-glutamine, 100 units/mL penicillin, and 100 µg/mL streptomycin, unless otherwise specified by the supplier, and maintained in a 5% CO2 humidified incubator at 37°C. Genomic DNA was isolated from cell lines using a Qiagen Blood Tissue kit (Valencia, CA) and used directly in the assays described herein.

Microsatellite marker analysis. Microsatellite marker mapping was done in collaboration with the NIH Center for Inherited Disease Research using a total of 386 microsatellite markers (Supplementary File 1; ref. 4). The positions of 317 of these markers (chromosomes 1-22) are currently mapped by the Ensembl database (16) and were used for comparisons made here. The genotype (the identity of the alleles at a particular locus) for each marker was "called" as homozygous/hemizygous (A) or heterozygous (B) based on the resolution of one or two PCR bands, respectively. If the PCR reaction yielded no detectable band, the marker was classified as missing (M). Homozygous/hemizygous and missing calls were considered uninformative whereas heterozygosity indicated the retention of both parental alleles. Data were visualized with Excel (Microsoft, Redmond, WA) by plotting the calls with respect to their positions in the May 2004 human genome build.

SNP allelotype mapping. The genotypes of 115,353 SNPs were analyzed using the Affymetrix CentXba and CentHind oligonucleotide arrays hybridized to reduced-complexity genomic DNA as described (3). Briefly, 250 ng of genomic DNA were digested with either XbaI or HindIII, adapters were ligated to the digested DNA, and PCR was done to preferentially amplify fragments 250 to 2,000 bp in length. Samples were then fragmented, fluorescently labeled, and hybridized to the arrays according to the protocol of the manufacturer. Genotypes were called by the GeneChip DNA Analysis Software Tool (version 3.0) using a 0.05% error setting (Supplementary Files 2-5). Four potential genotype calls were made: AA, BB, AB, and NoCall. No distinction was made between homozygosity for either allele; each result was considered uninformative and provided only a visual reference to indicate genome coverage. Heterozygous calls indicated the retention of both alleles and were considered informative. "NoCall" data were not used in the analysis of LOH. Genotyping calls were visualized by plotting with respect to their genomic position listed in the May 2004 assembly of the human genome. Any probes of which the position was not known or not annotated by GeneChip DNA Analysis Software Tool were subsequently omitted as were SNPs with a Caucasian heterozygous frequency of zero (3). This left a maximum of 104,502 evaluated SNP loci per sample.

Critical gap distance for LOH. The distances between heterozygous calls for each of the 42 Caucasian normal, EBV-transformed, lymphoblastoid cell lines were calculated from previously published allelotypes (3). An interheterozygous call distance of 2.75 million bp (Mbp) represented a value greater than the 99.9th percentile of all gap sizes observed. This distance was used as the cutoff to eliminate smaller regions of contiguous homozygous SNP markers that occur naturally (i.e., by chance) in each cell line. Candidate regions evaluated by at least 40 SNPs were considered as "critical" gaps, indicating instances of LOH. Immediately adjacent critical gaps were combined and represented as one contiguous region of LOH. Instances of regions occurring between critical gaps, or those occurring at the ends of chromosome arms, were combined into one contiguous region at our best judgment. Centromeric regions were included in LOH estimates when critical gaps were located immediately adjacent on both p and q arms; otherwise, each breakpoint estimate was terminated using the position of the last SNP evaluated by the arrays on each respective arm. Excessively homozygous regions (between 2.75 and 8 Mbp and at least 40 SNPs) identified in the 131 Caucasian normals (Supplementary Table S1) were removed from the list of critical gaps as they may represent false-positive events of identity-by-descent. Finally, homozygous regions >8 Mbp identified in the normals were believed to represent true LOH events affecting the immortalized "normal" cell line samples and, as such, were not used to remove critical gaps from the cell line data.

Precision of critical gap breakpoints. Matched normal tissue (not immortalized cell lines) was available for three of the early-passage carcinoma cell lines (PL9, PN139; PL11, PN192; PL13, PN147) and were used to test the method for estimating critical gaps. SNP allelotypes for each normal were determined as described above and compared with the respective carcinoma line allelotype maps. Gaps containing at least three informative hemizygous markers were used to identify a region of true LOH, and the positions of the outermost hemizygous markers established the minimal region of loss (MRL) breakpoints. Regions identified as lost using the matched normals, but not identified as a critical gap, were used to estimate the false-negative rate. Critical gaps containing less than three informative markers, as determined from the matched normal genotypes, were used to estimate the false-positive rate. Critical gaps showing true loss were compared with the MRL estimates to gauge the precision of our allelic loss breakpoint positions.

Estimation of total allelic loss. The extent of total allelic loss was estimated by calculating genomic fractional allelic loss (GFAL; ref. 4), with the following modifications. An expected Caucasian frequency of total heterozygous calls for the Affymetrix CentXba and CentHind oligonucleotide arrays (0.3041; ref. 3) was used as an estimate of the total number of informative markers. GFAL was calculated by dividing the total observed heterozygous frequencies by 0.3041, subtracting from 1, and converting to a percent.

Statistical analysis. The Spearman rank correlation coefficient was used to evaluate whether the degree of allelic loss (i.e., increasing levels of GFAL) correlated with the number of years because the cell line was isolated and established in culture. The year of establishment was obtained from published records (1723) or from the respective suppliers and ranked with the oldest as last (ranked 24th, Table 1 ). GFAL scores were similarly ranked in descending order with cell lines having the highest degree of allelic loss last (ranked 24th). Cell lines with the same year of establishment were assigned an average of the ranks for the year. A high level of significance [{alpha} = 0.01; critical region rs > 0.485 (n = 24)] was used to evaluate the test statistic. The critical value as well as the method for calculation of the test statistic was obtained from published sources (24). The microsatellite unstable lines, PL3 and PL5, were not included in this analysis as they showed no deviation of GFAL from the 42 published Caucasian controls (Table 1 and data not shown; ref. 3).


View this table:
[in this window]
[in a new window]
 
Table 1. Genomic fractional allelic loss in pancreatic cancer cell lines

 
Copy number analysis. Single-point analysis of copy number estimates for each SNP was done by the Copy Number Analysis Tool (version 2.2; Supplementary Files S2-S5; ref. 25). Variations of copy number estimates were reduced by applying a positional moving average algorithm where each SNP copy number value was averaged with a maximum of 10 immediately adjacent (5 telomeric and 5 centromeric) SNP copy number estimates; only adjacent SNPs positioned within ±250 kb of the central SNP were included in the average. Data from the CentXba and CentHind chips were analyzed separately and plotted individually to compare copy number variations between arrays.

Homozygous deletions. Dramatic drops in copy number values below 1.0, evidenced by both arrays, were indicative of a true homozygous deletion. With decreasing regional size (and/or corresponding copy number averages closer to 1.0), the ability to distinguish between homozygous deletion and LOH was reduced. To compensate, we calculated a 20-SNP moving average of the frequency of NoCall genotypes across the genome for each array. Agreement between increasing NoCall frequency and decreases in copy number averages was used to identify additional candidate homozygous deletions. Representative candidate regions, as well as several previously identified, were validated using PCR with at least two consecutive nonoverlapping primer sets. Regional chromosomal gains and amplifications were not specifically evaluated by this study. Raw data and positional moving average (PMA) estimates of copy number are included in Supplementary data.


    Results and Discussion
 Top
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 References
 
Genotype determination of pancreatic cancer cell lines. Several studies have reported the usefulness of the Affymetrix 10K arrays in determining allelic loss when using matched normal tissues (26, 27). Unfortunately, even at this SNP density, evaluating allelic loss without a matched normal would yield only ~3,000 heterozygous informative calls. With the introduction of the 100K SNP arrays, genomic coverage improves 10-fold and would yield a similar 10-fold increase in heterozygous calls (i.e., ~30,000), suggesting it may be possible to do allelic studies without matched normal tissues.

To test this proposal, we evaluated an initial panel of seven pancreatic cancer cell lines (AsPc1, BxPc3, COLO357, Hs766T, MiaPaCa2, Panc-1, and Su86.86) using the Affymetrix CentXba and CentHind oligonucleotide arrays. A total of 104,502 loci were evaluated per cell line, yielding informative heterozygous calls ranging from 18.8% (17,049 of 90,822) to 6.1% (5,326 of 87,005). The data were plotted with respect to their physical positions, indicating several areas of chromosomal loss (Fig. 1 and Supplementary Figs. S1-S9). Validation of the experimental method and data was accomplished using a large panel of microsatellite markers. In total, 2,001 microsatellite calls were obtained, yielding 729 informative heterozygous calls. Heterozygous microsatellite frequencies ranged from 23.3% (65 of 279) to 47.2% (125 of 265) for each cell line (Supplementary Table S2).


Figure 1
View larger version (71K):
[in this window]
[in a new window]
 
Figure 1. Multidimensional genotypic analysis of CAPAN1 chromosome 2 and BxPc3 chromosome 9. A, copy number moving averages for the CentHind (pink) and CentXba (dark blue) chips. B, individual heterozygous genotyping calls are plotted above the grouped uninformative homozygous and NoCall genotypes (black). C, NoCall genotypes (green, CentHind; blue, CentXba) are inversely plotted with respect to increasing frequency. Agreement between valleys in the copy number and NoCall genotype plots commonly indicates the presence of a homozygous deletion. Yellow guidelines are for reference purposes only and indicate one and four copies of the haploid genome. Three homozygous deletions are illustrated (arrows) including one previously unreported deletion at 9 Mbp in BxPc3.

 
We found a high concordance between SNP and microsatellite calls. Of the 729 microsatellite heterozygous calls, 708 (97.1%) were in areas also characterized by the SNP arrays to contain two alleles. The discordant calls were distributed among 13 separate chromosomes in six of the seven cell lines and involved 16 individual critical gap regions (Supplementary Table S2). We felt several possible explanations existed for these discrepancies, including one or more of the following: the ambiguity (i.e., precision; see below) of assigning the critical gap breakpoints as well as the choice of method to combine adjacent critical gaps into contiguous regions (affects seven regions); the presence of a mosaic population of cells in culture with different genomic constitutions (inspection of SNP densities in five regions were visibly decreased from normal but not consistent with complete loss); large regions of possible identity-by-descent that could lead to an incorrect classification as a critical gap (the size of seven critical gaps were <8 Mbp); and the possible incorrect assessment of allelic status by mis-calls in the microsatellite marker analysis.

As expected, the ability to use the microsatellite marker data to determine areas of LOH proved difficult due to the low number of informative calls. Using the SNP arrays, a 120-fold increase in heterozygous calls was obtained over the microsatellite analysis (86,158 versus 729). Using 2.86 billion bp (Chr1-22; ref. 16) as the combined length of the human genome and after subtracting centromeric and telomeric regions not evaluated with the SNP arrays (3), the average heterozygous intermarker distance between these methods varied from nearly 25.5 Mbp (microsatellite analysis) to 216 kbp (SNP arrays). Additionally, plotting of the genotype calls of the SNP arrays revealed clear demarcations of chromosomal loss even without the use of normals, which otherwise would increase the number of informative markers (Fig. 1 and Supplementary Figs. S1-S9).

Considering these results, the ease of the assay, and the time and tissue savings shown, we expanded our analysis to an additional panel of 19 pancreatic cancer lines evaluated with the 100K arrays. In total, 2.72 million loci were evaluated, yielding 2.39 million genotype calls with more than 415K heterozygous calls (Supplementary Files 2 and 3). The frequencies of informative heterozygous calls ranged from 30.1% (PL3) to 6.1% (MiaPaCa2; Table 1). The average interheterozygous marker distance was calculated to be 166kbp.

Regions of LOH determined by the critical gap distance. Data were separated by chromosome and plotted with respect to physical position. This enabled a visual comparison of overlapping regions of LOH. Large regions of homozygosity encompassing entire chromosomes or chromosomal arms were clearly evident as were smaller areas spanning <10 Mbp (Fig. 1 and Supplementary Figs. S1-S9). Lacking a matched normal control, the distinction between identity-by-descent and true areas of homozygosity accomplished through allelic loss became more difficult with a decreasing size of the region of interest. We therefore used the published record of 42 Caucasian normals (3) to determine the average distance between heterozygous calls on the 100K SNP arrays. We assigned a critical gap distance of >2.75 Mbp (evaluated with at least 40 SNP probes) as a cutoff for filtering out the areas of chance homozygosity; this size represents greater than the 99.9th percentile of all gaps observed (data not shown). Regions >8.0 Mbp identified in the controls were not filtered out of the candidate regions identified in the carcinoma cell lines as many of these homozygous regions may indeed represent true areas of LOH, a phenomenon known to occur in clones of lymphoblasts transformed by EBV (28, 29). Regions of excessive homozygosity between 2.75 and 8 Mbp appearing within the control population were more difficult to evaluate. Several possible mechanisms could account for the observation of excessively homozygous regions in the control population. Foremost, chip design artifacts, such as decreasing SNP-probe densities as well as stretches of polymorphisms with very low minor allele frequencies in Caucasian populations (chosen to minimize gaps in genomic coverage), may lead to such phenomena. These regions may represent areas of true LOH or simply represent inherently stable chromosomal regions having little genomic variation in the population.

To gain a better perspective of the ability of the arrays to evaluate the above Caucasian population, we catalogued all gaps >2.75 Mbp (containing at least 40 evaluated SNPs) occurring in these 42 Caucasian normals, as well as a second panel of 89 HapMap CEPH normals previously published (30). In total, 178 instances of homozygosity >2.75 Mbp but <8 Mbp occurred in the 131 normal controls (Supplementary Table S1). More than half of those (n = 90) were clustered (three or more instances per cluster) within 14 common regions across the genome, suggesting the chip density or SNP heterozygous frequencies are inadequate to accurately assess the allelic status in those areas. Taken together, the 178 instances of homozygosity would represent a theoretical false-positive rate of 1.37 regions per genome analyzed. This rate decreases to 0.68 regions per genome after elimination of common clusters.

Using all of the catalogued homozygous regions from the 131 normals as a filter, we eliminated any common critical gaps occurring in the carcinoma cell lines as candidate regions of LOH as these were likely to be false positives. The remaining (qualifying) areas of LOH were then plotted (Fig. 2 ). The breakpoint positions of each region (Supplementary Table S3) were taken from the SNP genomic positions of the first and last heterozygous genotype call (or the terminal chromosomal SNP evaluated by the arrays for each chromosomal arm) that defined a critical gap. Chromosome arms with the greatest frequency of allelic loss (Fig. 3 ) were confirmed on 18q (95.8%, MADH4), 17p (91.7%, TP53), 9p (87.5%, CDKN2A), 8p (79.2%), 3p (75.0%), 6q (75.0%), and 12q (70.8%; refs. 4, 10, 31, 32). Importantly, detailed maps of allelic loss are provided for these widely shared cell lines allowing for the rapid and efficient evaluation of candidate genes for somatic mutations (Fig. 2; Supplementary Table S3).


Figure 2
View larger version (44K):
[in this window]
[in a new window]
 
Figure 2. LOH determined by critical gap evaluations. Regions of LOH (red) identified in a panel of 24 pancreatic cancer cell lines are plotted with respect to chromosomal position. Blue, areas evaluated by the arrays; black, centromeric regions. Bar (alternating gray and black), 10-Mbp genomic segments.

 

Figure 3
View larger version (26K):
[in this window]
[in a new window]
 
Figure 3. Peak LOH values. The histogram displays the percentage of pancreatic cancer cell lines with chromosomal arm–specific LOH.

 
Critical gap distance closely estimates the MRL. Despite the increased resolution of the arrays, the actual precision of our critical gap estimates to the MRL remained unknown. Matched normal tissues, required to define a MRL, were available for three of the early-passage cell lines (PL9, PL11, and PL13) and subsequently assayed using the SNP arrays to address this issue. Of the critical gaps previously identified for these cell lines (Supplementary Table S4), 45 of 52 (86.5%) regions were confirmed as true losses, exemplified by at least three hemizygous informative markers. Of the regions that were not confirmed, one showed loss of two informative markers and may indeed represent an area of LOH. The remaining six regions all failed to contain any informative markers despite an average of 71 SNPs evaluated per region. Of the 45 confirmed regions, the median difference between critical gap and MRL breakpoints was 288,424 bp.

The use of matched normal tissues allowed for a separate estimate of our false-positive and false-negative rates. The false-positive rate, represented by the frequency of critical gaps failing to have any informative markers (i.e., no heterozygous genotypes in the matched normal tissue), was ~2.3 per genome (N = 7). The median size of these critical gaps was 3,478,995 bp. We identified 11 instances of false negatives (i.e., regions containing at least three hemizygous markers within a single heterozygous gap), indicating a projected rate of 3.6 regions per genome. The median interheterozygous call distance for these regions was 970,860 bp with a median MRL distance of 493,645 bp. Four of these occurred on chromosome 8 of PL9 and may be indicative of a mosaic chromosome or chromosomal region undergoing change in culture. Other cell lines (e.g., AsPc1 chromosome 1q, PK9 chromosome 13, and PL21 chromosome 3p) show similar genotype patterns (Supplementary Figs. S1-S9) and may therefore underestimate LOH in these regions as aberrant heterozygous calls will interfere with the calculation of critical gaps. Single-cell dilution before determining the allelic status seems to minimize this problem (data not shown) but may introduce a clonal variability artifact to the data set, necessitating the analysis of multiple clones for each line. Each cell line was therefore only analyzed in aggregate.

Total allelic losses estimated by GFAL. GFAL, an estimate of the fractional total of all chromosomal losses, is ideally calculated by dividing the number of markers indicating allelic loss by the total number of informative markers. Such a calculation, however, would require a matched normal tissue to determine the informativeness of each marker. Instead, we used an average heterozygosity rate of 30.41% as a standard expected proportion for informative markers. GFAL scores were calculated to be 1.09% and 3.44% for the chromosomal stable cell lines PL3 and PL5, respectively, and varied from 6.47% to –2.40% in the Caucasian normals (data not shown). The SD of heterozygous frequencies in the controls was 0.54%, suggesting that the calculated GFAL in the tumor cell lines may incorrectly estimate total allelic loss by ±3.58% (95% confidence). In the lines having chromosome instability, we found GFAL to vary from 79.87% (MiaPaCa2) to 17.11% (XPA3) of the genome length. Increasing levels of GFAL did not correlate with the year of establishment [rs = 0.405; critical region rs > 0.485 (n = 24)]. Such a result, however, is not unexpected as the total number of cell doublings cannot be accounted for in the commercially available lines.

Copy number determination and identification of homozygous deletions. Using raw intensity data from each array, the Copy Number Analysis Tool estimates ploidy for each SNP locus by comparing the distribution of intensities obtained from a reference set of normal DNAs (25). Ideally, this comparison would produce a clear and consistent copy number estimate between each of the arrays at a particular locus (or between consecutive SNPs on the same array). In practice, however, copy number estimates often deviate significantly, leading some investigators to determine ploidy values from more direct and less variable techniques such as array-based comparative genomic hybridization (33, 34). To help reduce the variation in copy number estimates, we applied a PMA smoothing algorithm to each of the arrays and plotted the PMAs separately (Fig. 1 and Supplementary Figs. S1-S9). Agreement between the CentHind and CentXba PMA results was used to help reduce our uncertainty of regional copy number estimates.

Visual inspection of the PMA plots revealed complex genomic patterns. For example, PMAs of copy number suggest that CAPAN1 is likely a near-triploid cell line, given that the mean copy number estimates are often noninteger values (i.e., between 3 and 2 or between 2 and 1). On chromosome 2 of CAPAN1, variations in copy number as well as a reduction to homozygosity (i.e., true LOH) occurred in localized regions and did not involve the entire chromosome (Fig. 1). Reductions to homozygosity did not always coincide with copy number reduction (i.e., between ~118 and 141 Mbp copy number PMAs remained at ~2.5 despite a reduction to homozygosity) and may have, in part, involved locus-restricted (interstitial) recombinational events (i.e., gene conversion). By no means was this a unique event restricted to just chromosome 2 of CAPAN1; examples are present among other chromosomes in most of the cell lines reported here (Supplementary Figs. S1-S9). Most importantly, the ability to identify regions of LOH that lack copy number reduction illustrates why evaluation of genotypes alone (and associated critical gaps) is crucial for LOH determination. Such regions are produced by gene conversion, mitotic recombination, and chromosomal nondisjunction combined with chromosome reduplication (35).

Copy number PMAs also proved beneficial in the identification of homozygously deleted regions. Simultaneous decreases in copy number PMAs (<1.0) from both the CentHind and CentXba arrays routinely coincided with the presence of a homozygous deletion (Fig. 1 and Supplementary Figs. S1-S9). We combined copy number data with genotyping NoCall frequencies (homozygous deletions have an inherent increase in NoCall genotypes) to help identify additional candidate regions that were partially masked by the PMA smoothing algorithm. In total, 41 homozygous deletions were identified, of which 22 were previously reported (3643). The remaining 19 homozygous deletions represent first reports implicating an additional 13 regions (11 separate chromosomes) not previously associated with pancreatic carcinogenesis (Table 2 ). Unexpectedly, the total number of homozygous deletions seemed to be bimodal: of 41 homozygous deletions, 23 occurred in just two cell lines (15 in BxPc3 and 8 in MiaPaCa2). Based on GFAL scores alone, such a high rate of homozygous deletions would not have been predicted for BxPc3. This suggests that the processes leading to chromosomal instability (CIN) having only few homozygous deletions (original CIN) may be distinguishable from an occasional superimposed phenotype having high numbers of these genomic holes (holey CIN).


View this table:
[in this window]
[in a new window]
 
Table 2. Summary of homozygous deletions identified by SNP arrays

 
Until recently, it was laborious and technically challenging to determine the allelic status within cancer tissues that lacked a matched normal control. Early advances in chip-based genotyping technology, evaluating nearly 10,000 loci simultaneously, improved the feasibility of studies evaluating large sample sets but still lacked the resolution to detail allelic loss breakpoints in cancer without matched normal controls. Continual improvements in array design and whole-genome complexity reduction methodologies have now made this possible, enabling the pursuit of large-scale studies that further our understanding of the genomic changes occurring in cancer. It is important to note, however, that the concept of using overlapping regions of LOH to pinpoint the location of novel tumor suppressor genes is a commonly held misconception that we addressed in the past (44). We are currently unaware of any somatically mutated tumor suppressor gene cloned directly from the boundary maps of overlapping LOH. In contrast, such boundary mapping can be successful for germ line mutations (e.g., MEN1). The mutational frequency of any gene rarely approaches the rate of LOH in tumors, with possible exceptions, such as p16 in pancreatic cancer and p53 in colorectal cancer (45). LOH maps permit one to more efficiently evaluate candidate tumor suppressor genes. Most recently, we have used LOH maps of pancreatic cancer xenografts (4) as a tool to uncover mutations in FBXW7, FANCC, and FANCG (46, 47). Using LOH maps to triage samples (i.e., sequence only samples with LOH in the genomic region of interest) provides researchers with the ability to maximize the potential and efficiency for finding somatic mutations.

LOH represents a key structural foundation for proposed cancer genome projects, a required component for identification of gene conversion and similar chromosome recombinational events [as shown in Fig. 1 and as proposed by Cavenee et al. (35)], and a virtually required accompaniment of somatic (as opposed to polymorphic germ line) homozygous deletions, among other uses. The facile extension of LOH analysis to the most common and widely shared cancer samples (i.e., the commercially available cell lines lacking matched normals) is likely to prove essential for many anticipated cancer studies.


    Acknowledgments
 
Grant support: NIH grants CA92624, CA92117, and CA21765; the American Lebanese Syrian Associated Charities; and an unrestricted grant from the Hopkins Capital Group, LLC.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

We thank Drs. Elizabeth M. Jaffee and Arata Horii for the kind gifts of cell lines, Dr. Alison P. Klein for statistical advice, Carl S. Kashuk for help in the evaluation of the HapMap data set, and Dr. Steven C. Cunningham for critically reading the manuscript.


    Footnotes
 
Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).

Received 2/23/06. Revised 5/ 4/06. Accepted 6/20/06.


    References
 Top
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 References
 

  1. Seymour AB, Hruban RH, Redston M, et al. Allelotype of pancreatic adenocarcinoma. Cancer Res 1994;54:2761–4.[Abstract/Free Full Text]
  2. Schaid DJ, Guenther JC, Christensen GB, et al. Comparison of microsatellites versus single-nucleotide polymorphisms in a genome linkage screen for prostate cancer-susceptibility loci. Am J Hum Genet 2004;75:948–65.[CrossRef][Medline]
  3. Matsuzaki H, Dong S, Loi H, et al. Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays. Nat Methods 2004;1:109–11.[CrossRef][Medline]
  4. Iacobuzio-Donahue CA, van der Heijden MS, Baumgartner MR, et al. Large-scale allelotype of pancreaticobiliary carcinoma provides quantitative estimates of genome-wide allelic loss. Cancer Res 2004;64:871–5.[Abstract/Free Full Text]
  5. Wang DG, Fan JB, Siao CJ, et al. Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 1998;280:1077–82.[Abstract/Free Full Text]
  6. Sellick GS, Longman C, Tolmie J, et al. Genomewide linkage searches for Mendelian disease loci can be efficiently conducted using high-density SNP genotyping arrays. Nucleic Acids Res 2004;32:e164.[Abstract/Free Full Text]
  7. Zhou X, Rao NP, Cole SW, Mok SC, Chen Z, Wong DT. Progress in concurrent analysis of loss of heterozygosity and comparative genomic hybridization utilizing high density single nucleotide polymorphism arrays. Cancer Genet Cytogenet 2005;159:53–7.[CrossRef][Medline]
  8. Pinkel D, Segraves R, Sudar D, et al. High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nat Genet 1998;20:207–11.[CrossRef][Medline]
  9. Pollack JR, Perou CM, Alizadeh AA, et al. Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nat Genet 1999;23:41–6.[Medline]
  10. Nowak NJ, Gaile D, Conroy JM, et al. Genome-wide aberrations in pancreatic adenocarcinoma. Cancer Genet Cytogenet 2005;161:36–50.[CrossRef][Medline]
  11. Lucito R, Healy J, Alexander J, et al. Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation. Genome Res 2003;13:2291–305.[Abstract/Free Full Text]
  12. Zhao X, Weir BA, LaFramboise T, et al. Homozygous deletions and chromosome amplifications in human lung carcinomas revealed by single nucleotide polymorphism array analysis. Cancer Res 2005;65:5561–70.[Abstract/Free Full Text]
  13. Bos JL, Fearon ER, Hamilton SR, et al. Prevalence of ras gene mutations in human colorectal cancers. Nature 1987;327:293–7.[CrossRef][Medline]
  14. Sun C, Yamato T, Furukawa T, Ohnishi Y, Kijima H, Horii A. Characterization of the mutations of the K-ras, p53, p16, and SMAD4 genes in 15 human pancreatic cancer cell lines. Oncol Rep 2001;8:89–92.[Medline]
  15. Berman DM, Karhadkar SS, Maitra A, et al. Widespread requirement for Hedgehog ligand stimulation in growth of digestive tract tumours. Nature 2003;425:846–51.[CrossRef][Medline]
  16. Birney E, Andrews D, Caccamo M, et al. Ensembl 2006. Nucleic Acids Res 2006;34:D556–61.[Abstract/Free Full Text]
  17. Schoumacher RA, Ram J, Iannuzzi MC, et al. A cystic fibrosis pancreatic adenocarcinoma cell line. Proc Natl Acad Sci U S A 1990;87:4012–6.[Abstract/Free Full Text]
  18. Lieber M, Mazzetta J, Nelson-Rees W, Kaplan M, Todaro G. Establishment of a continuous tumor-cell line (panc-1) from a human carcinoma of the exocrine pancreas. Int J Cancer 1975;15:741–7.[Medline]
  19. Drucker BJ, Marincola FM, Siao DY, Donlon TA, Bangs CD, Holder WD, Jr. A new human pancreatic carcinoma cell line developed for adoptive immunotherapy studies with lymphokine-activated killer cells in nude mice. In Vitro Cell Dev Biol 1988;24:1179–87.[Medline]
  20. Chen WH, Horoszewicz JS, Leong SS, et al. Human pancreatic adenocarcinoma: in vitro and in vivo morphology of a new tumor line established from ascites. In Vitro 1982;18:24–34.[Medline]
  21. Fogh J, Fogh JM, Orfeo T. One hundred and twenty-seven cultured human tumor cell lines producing tumors in nude mice. J Natl Cancer Inst 1977;59:221–6.[Medline]
  22. Tan MH, Nowak NJ, Loor R, et al. Characterization of a new primary human pancreatic tumor line. Cancer Invest 1986;4:15–23.[Medline]
  23. Morgan RT, Woods LK, Moore GE, Quinn LA, McGavran L, Gordon SG. Human cell line (COLO 357) of metastatic pancreatic adenocarcinoma. Int J Cancer 1980;25:591–8.[Medline]
  24. Walpole RE, Myers RH, Myers SL. Probability and statistics for engineers and scientists. 6th ed. Upper Saddle River (NJ): Prentice Hall; 1998. p. xii, 739 p.
  25. Huang J, Wei W, Zhang J, et al. Whole genome DNA copy number changes identified by high density oligonucleotide arrays. Hum Genomics 2004;1:287–99.[Medline]
  26. Lips EH, Dierssen JW, van Eijk R, et al. Reliable high-throughput genotyping and loss-of-heterozygosity detection in formalin-fixed, paraffin-embedded tumors using single nucleotide polymorphism arrays. Cancer Res 2005;65:10188–91.[Abstract/Free Full Text]
  27. Teh MT, Blaydon D, Chaplin T, et al. Genomewide single nucleotide polymorphism microarray mapping in basal cell carcinomas unveils uniparental disomy as a key somatic event. Cancer Res 2005;65:8597–603.[Abstract/Free Full Text]
  28. Okubo M, Tsurukubo Y, Higaki T, et al. Clonal chromosomal aberrations accompanied by strong telomerase activity in immortalization of human B-lymphoblastoid cell lines transformed by Epstein-Barr virus. Cancer Genet Cytogenet 2001;129:30–4.[CrossRef][Medline]
  29. Takahashi T, Kawabe T, Okazaki Y, et al. In vitro establishment of tumorigenic human B-lymphoblastoid cell lines transformed by Epstein-Barr virus. DNA Cell Biol 2003;22:727–35.[CrossRef][Medline]
  30. Altshuler D, Brooks LD, Chakravarti A, Collins FS, Daly MJ, Donnelly P. A haplotype map of the human genome. Nature 2005;437:1299–320.[CrossRef][Medline]
  31. Mahlamaki EH, Hoglund M, Gorunova L, et al. Comparative genomic hybridization reveals frequent gains of 20q, 8q, 11q, 12p, and 17q, and losses of 18q, 9p, and 15q in pancreatic cancer. Genes Chromosomes Cancer 1997;20:383–91.[CrossRef][Medline]
  32. Aguirre AJ, Brennan C, Bailey G, et al. High-resolution characterization of the pancreatic adenocarcinoma genome. Proc Natl Acad Sci U S A 2004;101:9067–72.[Abstract/Free Full Text]
  33. Bignell GR, Huang J, Greshock J, et al. High-resolution analysis of DNA copy number using oligonucleotide microarrays. Genome Res 2004;14:287–95.[Abstract/Free Full Text]
  34. Gaasenbeek M, Howarth K, Rowan AJ, et al. Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex changes and multiple forms of chromosomal instability in colorectal cancers. Cancer Res 2006;66:3471–9.[Abstract/Free Full Text]
  35. Cavenee WK, Dryja TP, Phillips RA, et al. Expression of recessive alleles by chromosomal mechanisms in retinoblastoma. Nature 1983;305:779–84.[CrossRef][Medline]
  36. Hilgers W, Koerkamp BG, Geradts J, et al. Genomic FHIT analysis in RER+ and RER– adenocarcinomas of the pancreas. Genes Chromosomes Cancer 2000;27:239–43.[CrossRef][Medline]
  37. Bashyam MD, Bair R, Kim YH, et al. Array-based comparative genomic hybridization identifies localized DNA amplifications and homozygous deletions in pancreatic cancer. Neoplasia 2005;7:556–62.[CrossRef][Medline]
  38. Levy A, Dang UC, Bookstein R. High-density screen of human tumor cell lines for homozygous deletions of loci on chromosome arm 8p. Genes Chromosomes Cancer 1999;24:42–7.[CrossRef][Medline]
  39. Caldas C, Hahn SA, da Costa LT, et al. Frequent somatic mutations and homozygous deletions of the p16 (MTS1) gene in pancreatic adenocarcinoma. Nat Genet 1994;8:27–32.[Medline]
  40. Heidenblad M, Schoenmakers EF, Jonson T, et al. Genome-wide array-based comparative genomic hybridization reveals multiple amplification targets and novel homozygous deletions in pancreatic carcinoma cell lines. Cancer Res 2004;64:3052–9.[Abstract/Free Full Text]
  41. Teng DH, Perry WL III, Hogan JK, et al. Human mitogen-activated protein kinase kinase 4 as a candidate tumor suppressor. Cancer Res 1997;57:4177–82.[Abstract/Free Full Text]
  42. Hahn SA, Schutte M, Hoque AT, et al. DPC4, a candidate tumor suppressor gene at human chromosome 18q21.1. Science 1996;271:350–3.[Abstract]
  43. Cox C, Bignell G, Greenman C, et al. A survey of homozygous deletions in human cancer genomes. Proc Natl Acad Sci U S A 2005;102:4542–7.[Abstract/Free Full Text]
  44. Hilgers W, Kern SE. Molecular genetic basis of pancreatic adenocarcinoma. Genes Chromosomes Cancer 1999;26:1–12.[CrossRef][Medline]
  45. Nigro JM, Baker SJ, Preisinger AC, et al. Mutations in the p53 gene occur in diverse human tumour types. Nature 1989;342:705–8.[CrossRef][Medline]
  46. van der Heijden MS, Yeo CJ, Hruban RH, Kern SE. Fanconi anemia gene mutations in young-onset pancreatic cancer. Cancer Res 2003;63:2585–8.[Abstract/Free Full Text]
  47. Calhoun ES, Jones JB, Ashfaq R, et al. BRAF and FBXW7 (CDC4, FBW7, AGO, SEL10) mutations in distinct subsets of pancreatic cancer: potential therapeutic targets. Am J Pathol 2003;163:1255–60.[Abstract/Free Full Text]
  48. Teng DH, Chen Y, Lian L, Ha PC, Tavtigian SV, Wong AK. Mutation analyses of 268 candidate genes in human tumor cell lines. Genomics 2001;74:352–64.[CrossRef][Medline]



This article has been cited by other articles:


Home page
Molecular Cancer TherapeuticsHome page
G.-M. Zou and A. Maitra
Small-molecule inhibitor of the AP endonuclease 1/REF-1 E3330 inhibits pancreatic cancer cell growth and migration
Mol. Cancer Ther., July 1, 2008; 7(7): 2012 - 2021.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
T. Abe, N. Fukushima, K. Brune, C. Boehm, N. Sato, H. Matsubayashi, M. Canto, G. M. Petersen, R. H. Hruban, and M. Goggins
Genome-Wide Allelotypes of Familial Pancreatic Adenocarcinomas and Familial and Sporadic Intraductal Papillary Mucinous Neoplasms
Clin. Cancer Res., October 15, 2007; 13(20): 6019 - 6025.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
M. Stark and N. Hayward
Genome-Wide Loss of Heterozygosity and Copy Number Analysis in Melanoma Using High-Density Single-Nucleotide Polymorphism Arrays
Cancer Res., March 15, 2007; 67(6): 2632 - 2642.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplementary Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Calhoun, E. S.
Right arrow Articles by Kern, S. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Calhoun, E. S.
Right arrow Articles by Kern, S. E.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Cancer Research Clinical Cancer Research
Cancer Epidemiology Biomarkers & Prevention Molecular Cancer Therapeutics
Molecular Cancer Research Cancer Prevention Research
Cancer Prevention Journals Portal Cancer Reviews Online
Annual Meeting Education Book Meeting Abstracts Online