Cancer Research Meeting Calendar  EMT and Cancer Progression and Treatment
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Cancer Research Clinical Cancer Research
Cancer Epidemiology Biomarkers & Prevention Molecular Cancer Therapeutics
Molecular Cancer Research Cancer Prevention Research
Cancer Prevention Journals Portal Cancer Reviews Online
Annual Meeting Education Book Meeting Abstracts Online

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Asmann, Y. W.
Right arrow Articles by Vasmatzis, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Asmann, Y. W.
Right arrow Articles by Vasmatzis, G.
[Cancer Research 62, 3308-3314, June 1, 2002]
© 2002 American Association for Cancer Research


Tumor Biology

Identification of Differentially Expressed Genes in Normal and Malignant Prostate by Electronic Profiling of Expressed Sequence Tags

Yan W. Asmann1, Farhad Kosari1, Kai Wang, John C. Cheville and George Vasmatzis2

Division of Experimental Pathology, Department of Laboratory Medicine and Pathology, and Mayo Cancer Center, Mayo Clinic and Foundation, Rochester, Minnesota, 55905


    ABSTRACT
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Differentially expressed genes between corresponding normal and cancertissue can advance our understanding of the molecular basis of malignancy and potentially serve as biomarkers or prognostic markers of malignancy. To identify differentially expressed genes in prostate cancer, we used a procedure combining electronic expression profiling of the prostate expressed sequence tag (EST) database and molecular biology techniques. A novel electronic expression-profiling algorithm was developed to search publicly available EST sequences for genes that show significant differential expression in prostate cancer compared with normal prostate tissue. Approximately 600 genes expressed in prostate were identified through adequate EST counts of ESTs for electronic profiling. Of these 600 genes, 9 showed statistically significant differences in their EST counts between cancer and normal prostate and were further analyzed. The predictions associated with electronic profiling were experimentally verified for two genes, cysteine-rich secretory protein 3 (CRISP-3) and deadenylating nuclease (DAN), using real-time reverse transcription-PCR with total RNA extracted from cells isolated by laser capture microdissection. In five of five Gleason score 6 cancer cases, CRISP-3 expression was increased >50 fold, whereas the expression of DAN was reduced by >80%.


    INTRODUCTION
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Cancer-associated changes in gene expression are now routinely investigated by high throughput expression profiling techniques such as SAGE3 (1) , microarray/gene chips (2) , and sequencing of clones from cDNA libraries (3 , 4) . An important consideration in expression profiling is the quantity and purity of RNA obtained from the different cell types. Increasing the quantity and purity of RNA from either malignant or benign epithelia invariably yields more reliable and comprehensive experimental results. Although LCM has helped with regard to this issue (5) , cell selection and extraction by LCM is currently a manual process, and at most only a few thousand cells can be extracted in an amount of time necessary to limit RNA degradation.

Electronic analysis of sequenced cDNAs uses the information in the EST database (dbEST) where >3.5 million sequences have been deposited from the analysis of over 6,000 human cancer and normal cDNA libraries constructed from >50 different tissues (6) .4 To use this large amount of information, computer algorithms have been developed for discovery of novel genes (7) and genes with limited tissue distribution and/or cancer-specific expression (8) . One limitation associated with the use of this dbEST is that only highly expressed genes have been sampled adequately to provide sufficient corresponding EST counts for reliable molecular profiling. However, identification of these highly expressed genes could provide significant information to enhance our understanding of carcinogenesis and serve as biomarkers or prognostic markers of malignancy. Prostate is an excellent tissue to study by EST analysis because of the sizable pool of EST data in the LCM-derived libraries.

Despite the existing search algorithms, there is still a need for sophisticated computer analysis tools to perform the clustering and analysis of the ESTs. In this study, we developed a novel electronic profiling algorithm, the Binary Indexing Search Algorithm, to identify differentially expressed genes in cancer and normal prostate EST databases. Functions of this novel algorithm include clustering ESTs to distinct genes, estimating tissue distribution of EST clusters, and sorting EST clusters by relative expression levels. This procedure, using the Binary Indexing Search Algorithm, requires ~6 h of CPU time, and in this study, we successfully used this algorithm for the identification of genes in cancer and normal microdissected prostate EST libraries. The tumor-associated differential expression for two genes, CRISP-3 and DAN, was subsequently verified experimentally.


    MATERIALS AND METHODS
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Binary Indexing Search Algorithm
Classification of dbEST.
The dbEST was categorized using the GLS tool of the cancer genome anatomy project.5 We considered precancer libraries as cancer libraries. All of the cDNA libraries were categorized according to tissue type (tissue origin), tissue histology (cancerous, normal, or fetal), and the library preparation method (microdissected, bulk, cell line, or flow cytometric sorted).

Data Preparation.
The data preparation module reads EST report files downloaded from NCBI6 and produces files in FASTA format for ESTs.

Tissue Distribution of ESTs/Genes.
In the tissue distribution module, binary indexes were used to assign tissue distribution to each prostate EST. A binary index is defined as a binary string of n bits, where n is an integer. A sequence of n nucleotides can be represented by n-bit binary index when purines (As and Gs) are assigned to 1 and pyrimidines (Ts and Cs) are assigned to 0. After a series of testing, we found that a 32-bit binary index is long enough to rule out a random occurrence, such that under most circumstances a unique 32-bit index represents a unique EST. In this study, we used 32-bit continuous indexes to represent ESTs, and every EST was converted into groups of continuous 32-bit binary indexes.

In the first step, the program declares, in random access memory, a DA table for all possible indexes, in which each entry has one byte (8 bits) of memory reservation. The first distinct 29 bits of a 32-bit index are used as the DA for that index in the table, and the decimal value (dv) of the last three bits will give the bit position to be set to 1 in that entry. For example, a 32-bit index, 11110011110011101111100100011111, has a direct entry address 11110011110011101111100100011 in the table, and the 7th bit of that entry is set to 1 [dv = 2 (2) + 2 (1) + 2 (0) = 7]. We ran this module on classified prostate dbEST and created a complete nonredundant DA table containing all prostate indexes.

The algorithm then sets up a tissue distribution table (OS table) using the DA table. Each entry of the OS table contains two fields: a 32-bit index and a 64-bit specificity string (there are 52 different tissue types in human dbEST). This module runs on the entire human dbEST to get every possible index and check if each index is a prostate index using the DA table. If an index is a prostate index, the module builds an entry of this index in the OS table as follows. The direct address of this entry will be the first 24 bits of the index (we use 24-bit addresses to reduce the table size). In that address, we store the complete 32-bit index and the 64-bit specificity string of which the bit corresponding to the tissue where this index comes from will be set to 1. Because we used external chaining to store the indexes that have identical first 24-bits but different last 8 bits, the OS table is also a nonredundant table. After running the human dbEST on this module, each entry in the DA table has a cumulative tissue distribution string in OS table showing all of the tissues corresponding to the index. Therefore, the tissue distribution of each prostate EST can be derived from the OS table by accumulating the tissue distribution of all of the indexes related to that EST. For the same procedure, the tissue distribution of an EST cluster (see below) or a gene can be calculated by accumulating the tissue distributions of all of the ESTs in that cluster.

EST Clustering.
To cluster the ESTs belonging to the same gene together, we created a clustering module in our binary indexing search algorithm. All of the prostate ESTs in FASTA formatted files generated by data preparation module are read. Each EST was then converted into groups of continuous 32-bit binary indexes. For every individual EST, if none of its indexes pointed to any previously built cluster, the EST and all its indexes are assigned to a new cluster. Otherwise, if any of the indexes of an EST pointed to an already built cluster, this EST would be assigned to that cluster, and all its indexes would be set to point to the same cluster.

Sorting and Selecting EST Clusters.
Within each prostate EST cluster, we grouped ESTs according to the tissue histology (cancerous or normal) and the library preparation method (microdissected, bulk, or cell line). This resulted in five groups of ESTs including NB, NM, CB, CM, and CL. Then, we sorted clusters according to the total EST counts, yielding information regarding gene expression level in prostate, differential expression level of the same gene between normal and cancer. Furthermore, we selected the EST clusters with statistically significant EST counts and/or the statistically significant differentially displayed clusters according to the Fisher’s exact test.

Experimental Procedures
Tissue Section and H&E Staining.
Five cases of Gleason score 6 OCT-embedded prostate frozen cancer tissues as well as their corresponding normal tissues were used. Ten-µm cryostat sections were stained using H&E and dehydrated completely in xylene, following protocols from Arcturus Engineering (Mountain View, CA).

LCM.
The LCM technique allows for isolating normal and cancerous prostate epithelial cells precisely and efficiently from among a mixture of normal, cancer, epithelial, and nonepithelial cells. Navigated-LCM was performed with a PixCell II apparatus (Arcturus) on H&E sections. LCM parameters included a laser power of 65 milliwatts, laser pulse duration of 1.2 ms, and laser spot size of 7.5–15 µm in diameter. The infrared laser was pulsed over cells of interest. We captured approximately the same number of cancerous and normal epithelial cells (~2,000) in each case.

RT-PCR.
After LCM, total RNA extraction from the captured cells was performed using the RNeasy kit (Qiagen). The LCM-captured cells were immediately placed into sterile 0.5-ml microcentrifuge tubes containing 200 µl of RLT reagent with 1% ß-mercaptoethanol and inverted at room temperature for 1 h before extraction of total RNA. The RNA extraction was performed according to the instructions from RNeasy kit. Total RNA was eluted into 30 µl of nuclease-free H2O. Oligo-dT primers and SuperScriptII reverse transcriptase (Life Technologies, Inc.) were used in reverse transcription. The reverse transcriptase was performed following the manufacturer’s protocol. After the reverse transcription reaction, 40 rounds of PCR amplification were performed using gene-specific primers. The gene-specific forward and reverse primers were designed using the Primer 3 program.

PCR amplifications were performed in a total volume of 20 µl, with 1 µl of the reverse transcription products, 1 unit of Taq polymerase, 0.5 µM of each primer, 0.25 mM of each deoxynucleotide triphosphate in 1x PCR buffer.

Real-Time RT-PCR.
To obtain a more accurate estimate of the changes in mRNA expression levels in cancerous prostate epithelial cells versus normal/benign prostate epithelial cells, real-time RT-PCR was used. We used the RT product from RT-PCR as templates in this experiment, together with the TaqMan universal PCR MasterMix (Applied Biosystems), gene-specific primers, and dual-labeled TaqMan probes designed using the Primer Express 1.5 software (PE Biosystems). The highly gene-specific regions that do not display any sequence homology with their close family members were chosen for amplification.


    RESULTS
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Development of a Binary Indexing Search Algorithm to Analyze Prostate ESTs.
All of the prostate cDNA libraries were categorized using GLS tools according to tissue histology (cancerous or normal) and the library preparation method (microdissected, bulk, or cell line). The Binary Indexing Search Algorithm was used to analyze the classified EST data. The Binary Indexing Search Algorithm contains four modules: a data preparation module for converting the publicly available EST information to FASTA formatted files; a tissue distribution module for assigning tissue distributions to each index/EST/gene; an EST clustering module for clustering ESTs by different genes; and a selection module for sorting EST clusters according to the number of ESTs in each cluster, and selecting the genes of interest. The data flow between these four modules is shown in Fig. 1Citation .



View larger version (37K):
[in this window]
[in a new window]
[Download PPT slide]
 
Fig. 1. Data flow within the Binary Indexing Search Algorithm. The arrow symbolizes the data direction. Rectangles with dotted borders indicate different modules. Raw EST sequences were imported from prostate EST libraries and classified using the publicly available GLS tool. Different EST pools were processed through the Binary Indexing Search Algorithm. Each periodic result (as indicated in rectangles) was saved and directly accessible.

 
Identification of Differentially Expressed Genes between Normal and Cancer Prostate Using Binary Indexing Search Algorithm.
To date, human prostate EST libraries have >130,000 prostate ESTs, including ESTs from NB, NM, CB, CM, and CL libraries. The prostate EST database is the fifth largest EST database after colon, kidney, lung, and breast libraries, having EST entries from close to 300 prostate libraries. We postulate that, given the size of prostate cDNA libraries, the gene expression level in prostate correlates with the number of ESTs in the corresponding gene cluster.

Prostate adenocarcinoma is of epithelial origin. Therefore, we confined our analysis to CM and NM pools, which contain 22,776 and 9,967 ESTs, respectively, to avoid erroneous calculation of expression levels attributable to contamination from nonepithelial cells. After analyzing these two EST pools using the Binary Indexing Search Algorithm, the expression profile of close to 600 EST clusters in prostate CM and NM were identified. Fig. 2Citation displays a plot that represents the distribution of gene regulation in prostate for these 600 genes. EST clusters (distinct genes) were grouped according to a pseudo-ratio (r) of normalized numbers of ESTs in NM and CM pools, as follows:

for up-regulated genes, where nnm = nm(CM/NM).

for down-regulated genes (nm, the number of gene-related ESTs from NM; cm, the number of gene-related ESTs from CM; CM and NM represent the total number of ESTs in CM and NM libraries; MAX, the function to return the maximum value of input parameters; INT, the function to return the integer value of input parameter).



View larger version (14K):
[in this window]
[in a new window]
[Download PPT slide]
 
Fig. 2. Expression regulation plot of EST clusters from prostate dbEST. Negative r values represent a down-regulation in prostate cancer, whereas positive r values indicate an overexpression of EST clusters in prostate cancer. An r value of 0 suggests the proximately equal expression between normal and cancerous prostate.

 
To avoid infinite values for r, the minimum value of the denominators was 1. The r value for down-regulated genes was negated to separate down-regulated genes from the up-regulated genes in the plot. The number 1 was subtracted from the quotient so that r for nonregulated genes was assigned to 0. The distribution curve of prostate gene regulation can be illustrated by plotting the histogram of r values versus the number of clusters (Fig. 2)Citation . Although r does not precisely describe the differential expression for clusters with very few ESTs, this plot provides an estimate of the differential expression of genes between normal and cancerous prostate epithelium. As shown in Fig. 2Citation , the largest group of EST clusters consists of genes that equally express between normal and prostate cancer. There are close to 160 genes in this group including prostate-specific genes such as the PSA (KLK3 and PSA), prostrate kallikrein 2 (KLK2 and Hk2), prostrate acid phosphatase (PAP), and the housekeeping gene GAPDH. The most up-regulated gene in prostate cancer identified by our algorithm was CRISP-3, with an r of 32. The group of down-regulated genes, including a tumor suppressor gene DAN, had r values as low as -23. According to Fisher’s exact test, there had to be at least 8 ESTs in the CM pool for up-regulated genes and at least 3 ESTs in NM pool for down-regulated genes to be a statistically significant difference between CM and NM. Thirty-three of 600 genes that passed the Fisher’s exact test and the 1-tail test are listed in Table 1Citation . Additionally, we also included differentially expressed genes that had an |r| >=3 even though they failed the Fisher’s exact test and the 1-tail test. The complete list of these 600 genes is available on line.7


View this table:
[in this window]
[in a new window]

 
Table 1 Differentially displayed genes in normal and cancerous prostate

 
In this study, we further analyzed 9 genes from Table 1Citation , including CRISP-3, DAN, TARP, RT14 (general control of amino acid synthesis homologue-like, GCN5L1), developmental arteries and neural crest epidermal growth factor-like gene (DANCE), tropomyosin, NPD017, and ubiquinol-cytochrome c reductase core protein 2 (UBIQ), and semenogelin II. CRISP-3 and TARP are two putatively up-regulated genes. CRISP-3 belongs to a highly conserved protein family and is found in salivary gland, pancreas, prostate, epididymis, ovary, thymus, and colon by Northern blot analysis (9) . Little is known about the function of CRISP-3, and its expression has never been reported in prostate cancer. TARP is a segment of the gamma chain transcript for the T-cell receptor with {delta} chain silent (10) and has been reported previously as a prostate-specific gene. For the 7 down-regulated genes, DAN has been reported as a tumor suppressor gene (11) . RT14 is a human cDNA clone homologous to GCN5 (general control of amino acid synthesis-5), a yeast transcription activator. The function of RT14 is not clear. The DANCE gene encodes for a secretory protein that has been shown to promote adhesion of endothelial cells (12) . The DANCE gene or its protein product has never been connected to prostate cancer. Tropomyosin is a member of the family of actin filament-binding proteins and has been reported to be down-regulated in prostate cancer (13) . NPD017 is a novel gene without a putative ID in dbEST. The finding of semenogelin II as a down-regulated gene in prostate cancer is unexpected. Semenogelins are the predominant proteins in human semen and are the major proteins involved in the gelatinous entrapment of ejaculated spermatozoa (14) . They are synthesized by the secretory epithelium of the seminal vesicles and should not be present in prostate microdissected EST libraries. It is possible that semenogelin II ESTs in prostate NM and CM libraries are the result of contamination of seminal vesicle cells in prostate tissue cDNA samples. In fact, when we carefully selected only cancer or normal prostate epithelial cells by LCM and performed RT-PCR, the semenogelin II band was absent, even after 40 cycles of the PCR reaction (Fig. 4d)Citation . Table 1Citation also includes several genes from the expression distribution curve with no differential expression to use as controls for normalization. PSA, Hk2, and PAP are prostate epithelium-specific genes, and GAPDH is a widely used housekeeping gene expressed in all cells. Although serum concentrations of PSA and Hk2 are elevated in most prostate cancer patients (15) , our analysis in a few samples shows that the per cell mRNA expression levels of PSA and Hk2 do not change between normal and malignant prostatic epithelia.



View larger version (78K):
[in this window]
[in a new window]
[Download PPT slide]
 
Fig. 4. a, H&E-stained prostate sections before LCM. Arrows, normal and cancerous prostate ducts. b, after LCM, normal or cancerous epithelial cells were absent. c, the transferred epithelial cells on CapSure film. These pictures are representations of the LCM process. In the experiments, we separated normal and cancerous LCM-captured epithelial cells on different CapSure films. d, RT-PCR of selected genes in prostate cancer and normal tissues. This is a representative case of 5 investigated Gleason Score 6 (GS6) cases. The left- and rightmost lanes (M) are 100-bp markers. Lanes with odd numbers are from cancer epithelial cells, and lanes with even numbers are from normal epithelial cells.

 
Relative Expression Level of 9 Genes in Normal and Cancer Prostate Epithelium.
The Binary Indexing Search Algorithm also computed the putative relative expression level in CM and NM prostate EST pools of 9 genes discussed above, along with Hk2, PSA, GAPDH, and PAP as standards (Fig. 3)Citation . We found that PSA is the highest expressed gene in both prostate cancer and normal epithelial cells, which represents about 1.3% of total ESTs in both microdissected libraries (CM and NM).



View larger version (21K):
[in this window]
[in a new window]
[Download PPT slide]
 
Fig. 3. Relative expression level of genes in dbEST calculated by Binary Indexing Search Algorithm. As shown, PSA is the highest expressed gene in prostate NM and CM EST databases, which represents ~1.3% of total transcripts. PAP, Hk2, and GAPDH are also highly expressed in prostate epithelial cells. CRISP-3 (~0.15% of total cancer transcripts) and TARP (~0.1% of total cancer transcripts) are overexpressed in prostate cancer. Semenogelin II, DAN, TROPOMS (tropomyosin), NPD017, RT14, DANCE, and UBIQ (ubiquinol-cytochrome c reductase core protein 2) are approximately 0.5–0.13% of total transcripts in prostate NM libraries.

 
In addition, from our electronic profiling, CRISP-3 is relatively prostate specific (CRISP-3 is found in four organs; Table 1Citation ) and highly expressed in prostate cancer (0.15% of total expressed cancer transcripts; Fig. 3Citation ). The putative high expression level of CRISP-3 in cancer, which is higher than or as least as high as that of GAPDH, and its secretory feature make it a potential candidate for a highly sensitive and specific diagnostic marker for prostate cancer.

RT-PCR Confirms the Expression Regulation of CRISP-3 and DAN in Primary Prostate Cancer.
We examined the putative up-regulation of CRISP-3 and the putative down-regulation of DAN in prostate cancer using RT-PCR. PSA, Hk2, PAP, and GAPDH were used as standards. Total RNAs from ~2,000 LCM-captured prostate epithelial cells (Gleason score 6 and the corresponding normal/ benign) were used in RT-PCR (Fig. 4, a–c)Citation . As shown in Fig. 4dCitation , CRISP-3 and DAN exhibit different expression patterns between normal and cancer, whereas standards PSA, Hk2, PAP, and GAPDH show approximately equal expression levels in cancer and normal. We investigated 5 Gleason Score 6 prostate cancer cases and obtained similar results in all 5 cases.

Quantification of Differential Expression of CRISP-3 and DAN Using Real-Time RT-PCR.
To obtain a quantitative estimate of expression levels of the above 6 genes, real-time PCRs was performed next using TaqMan PE 7700 system (Applied Biosystems). The TaqMan assay uses the 5'->3' exonuclease activity of Taq DNA polymerase and a fluorogenic probe for automated quantification of DNA in a real-time manner. The CT value refers to the threshold cycle at which a statistically significant increase in fluorescence is first detected by the sequence detection system. The increase in fluorescence is directly proportional to the exponential increase in PCR products, and the measurement of signal is carried out in a real-time manner.

Because of the possible different amplification efficiencies of different genes, a validation experiment to calculate individual amplification efficiency for each gene was performed. The total RNA from the prostate cancer cell line LNCaP, which contains mRNA from all 6 genes, was used as the starting material. The CT values were obtained from starting total RNA of 10, 5, 2.5, 1.25, 0.625, and 0.3125 ng for each gene. The amplification efficiency of each gene was calculated following the manufacturer’s procedure (Table 2)Citation . The differential expression of a gene in cancer is computed as e(nCT - cCT), where e is the amplification efficiency and nCT and cCT denote the CT for normal and cancer, respectively. As shown in the table, GAPDH, PSA, Hk2, and PAP are approximately equally expressed in cancer and normal prostate epithelial cells. CRISP-3 is significantly up-regulated (50–300-fold) in prostate cancer, whereas the tumor suppressor gene DAN is down-regulated 84–94% in prostate cancer when compared with normal prostate epithelium.


View this table:
[in this window]
[in a new window]

 
Table 2 Differential expression of six genes in cancer and normal prostate epithelial cells using real-time RT-PCR

 
Comparison of the Binary Indexing Search Algorithm Profiling and Other Profiling Methods.
Some differentially expressed genes are identified by other profiling methods including microarray, GeneChip, and SAGE that were not present in our profiling results (Table 3)Citation . Among these, hepsin has been consistently reported as a significantly up-regulated gene in prostate cancer using microarray and GeneChip analyses (16) . However, our study showed that there are only 3 hepsin ESTs in the prostate CM pool, and hepsin did not pass the Fisher’s exact test. Cytokeratins 8, 18, and 19 are reported down-regulated in high grade cancer (Gleason score 8–9; Ref. 17 ), whereas the r value according to our calculation is 0. This may be because the CM pool used in this study does not distinguish between cancer grades. Other genes in Table 3Citation such as glutathione S-transferases (18) , {alpha}-methyl-CoA-racemase (19) , KIAA 1538 protein (20) , human tyrosine kinase receptor axl (21) , and PSGR (22) were not present in our data sets because they did not pass the Fisher’s exact test or did not have ESTs in the CM and NM pool.


View this table:
[in this window]
[in a new window]

 
Table 3 Differentially expressed genes between normal and cancerous prostate identified using microarray/genechip, SAGE, and/or other methods

 
We compared the lists of differentially expressed genes identified by other profiling methods to our analysis of the dbEST (Table 1)Citation . None of the four characterized most up-regulated genes identified by our profiling method, CRISP-3, 6.2kd protein (LOC54543), KIAA1666 protein, and fatty acid binding protein 5, or the 4 characterized most down-regulated genes NPD017, creatine kinase-like gene, UBIQ, and hypothetical protein FLJ21174 were identified using other profiling methods.

Microarray and GeneChip contain a limited probe set, usually 5,000–6,000 known genes. We found that, except for the hypothetical protein FLJ21174, none of the other 8 up-regulated genes identified by our analysis were included in most microarray/GeneChip probe sets. In addition, microarray, GeneChip, and other profiling technologies usually require relatively large quantities of total RNA. As a consequence, bulk tissues (normal and cancerous) instead of LCM-captured cells are routinely used in the analysis. These profiling methods may identify genes that are not specific to normal or malignant prostatic epithelia. This may also add to the discrepancy in genes identified in our analysis when compared with other gene profiling methods. As result, the Binary Indexing Search Algorithm offers additional insight into gene expression that other profiling studies may not be able to provide.


    DISCUSSION
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
In the past decade, an effort to sequence expressed genes in different tissues has made analysis of the dbEST, an alternative resource for gene expression profile analysis. Our study combined analysis of the dbEST using a novel Binary Indexing Search Algorithm with traditional molecular biology techniques to identify genes that are up- and down-regulated in prostate cancer. Using the Binary Indexing Search Algorithm, we performed analysis on the human prostate EST database and identified a list of tumor-regulated genes in prostate cancer. Compared with traditional search algorithms, the Binary Indexing Search Algorithm uses direct addressing to distribute indexes and EST data in memory. Therefore, it avoids n2 comparisons between sequences and is able to quickly identify tissue specificity and cluster ESTs. Although it demands moderately large memory capacity, it uses only a few hours of CPU time to perform the entire procedure. In addition, our approach takes advantage of the publicly available prostate cDNA libraries. Therefore, unlike other approaches such as SAGE or DNA microarrays, the laborious experimental step before electronic profiling is not necessary.

Some consideration should be taken while viewing the output of the Binary Indexing Search Algorithm. For instance, because of the small size of some EST libraries from different organs or tissue types, there may be more organs or tissue types where the gene is expressed than identified by the algorithm. For example, CRISP-3 was found in four organs according to the computational calculation. However, CRISP-3 was found in seven different organs by multiple tissue dot blot including salivary gland, pancreas, prostate, ovary, thymus, testis, and colon.8

Prostate cancer is the most commonly diagnosed noncutaneous malignancy and the second leading cause of cancer-related deaths in the Western male population (23) . Currently, measurement of the serum PSA is the most sensitive biomarker for the detection of prostatic adenocarcinoma. However, an elevation in the serum PSA lacks specificity, and serum PSA may be elevated in common benign conditions of the prostate such as prostatitis and benign prostatic hypertrophy. Numerous studies have shown that only 25% of patients with an elevated serum PSA level between 4 and 10 ng/ml have an adenocarcinoma detected on prostate needle biopsy (24 , 25) . The lack of specificity for PSA results in unnecessary prostate needle biopsy procedures and patient anxiety. More specific biomarkers for prostate cancer are needed to improve our ability to detect prostate cancer.

Many studies, as well as our electronic profiling results, have shown that prostate cancer expression levels of PSA on a per cell basis do not change and may even decrease in high-grade prostate cancers.9

Our data demonstrate that the per-cell CRISP-3 mRNA level is significantly up-regulated in prostate cancer compared with normal tissue. In addition, there is evidence to indicate that CRISP-3 is a secretory protein. Therefore, CRISP-3 is a potential diagnostic marker for prostate cancer, and our subsequent studies will focus on the utility of CRISP-3 as a diagnostic marker for prostate cancer. Similar to PSA, CRISP-3 is also androgen responsive (26) , although the functions of CRISP-3 and its close family members remain largely unknown.


    FOOTNOTES
 
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1 These authors contributed equally to this work. Back

2 To whom requests for reprints should be addressed, at Mayo Clinic and Foundation, 200 First Street, SW, Rochester, MN 55905. Phone: (507) 266-4617; Fax: (507) 266-5193; E-mail: vasmatzis.george{at}mayo.edu. Back

3 The abbreviations used are: SAGE, serial analysis of gene expression; EST, expressed sequence tag; LCM, laser capture microdissection; GLS, Gene Library Summarizer; DA, direct addressing; RT-PCR, reverse transcription-PCR; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; NM, normal microdissected; CM, cancer microdissected; CB, cancer bulk; NB, normal bulk; CRISP-3, cysteine-rich secretory protein 3; TARP, T-cell receptor gamma transcript; PSA, prostate-specific antigen. Back

4 Internet address: http://www.ncbi.nlm.nih.gov/dbEST/index.html. Back

5 Internet address: http://cgap.nci.nih. gov/Tissues/LibrarySummarizer. Back

6 Internet address: ftp://ncbi.nlm.nih.gov/repository/dbEST. Back

7 Internet address: http://www.mayo.edu/research/expath/prostate.html. Back

8 Y. W. Asmann, unpublished work. Back

9 Unpublished data. Back

Received 10/ 2/01. Accepted 4/ 5/02.


    REFERENCES
 Top
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Velculescu V. E., Zhang L., Vogelstein B., Kinzler K. W. Serial analysis of gene expression. Science (Wash. DC), 270: 484-487, 1995.[Abstract/Free Full Text]
  2. Schena M., Shalon D., Davis R. W., Brown P. O. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science (Wash. DC), 270: 368-369, 371, 1995.[Abstract/Free Full Text]
  3. Adams M. D., Kerlavage A. R., Fleischmann R. D., Fuldner R. A., Bult C. J., Lee N. H., Kirkness E. F., Weinstock K. G., Gocayne J. D., White O., et al Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence. Nature (Lond.), 377(Suppl.): 3-174, 1995.[Medline]
  4. Boguski M. S., Tolstoshev C. M., Bassett D. E., Jr. Gene discovery in dbEST. Science (Wash. DC), 265: 1993-1994, 1994.[Free Full Text]
  5. Emmert-Buck M. R., Bonner R. F., Smith P. D., Chuaqui R. F., Zhuang Z., Goldstein S. R., Weiss R. A., Liotta L. A. Laser capture microdissection. Science (Wash. DC), 274: 998-1001, 1996.[Abstract/Free Full Text]
  6. Boguski M. S., Schuler G. D. ESTablishing a human transcript map. Nat. Genet., 10: 369-371, 1995.[Medline]
  7. Vasmatzis G., Essand M., Brinkmann U., Lee B., Pastan I. Discovery of three genes specifically expressed in human prostate by expressed sequence tag database analysis. Proc. Natl. Acad. Sci. USA, 96: 300-304, 1998.
  8. Brinkmann U., Vasmatzis G., Lee B., Yerushalmi N., Essand M., Pastan I. PAGE-1, an X chromosome-linked GAGE-like gene that is expressed in normal and neoplastic prostate, testis, and uterus. Proc. Natl. Acad. Sci. USA, 95: 10757-10762, 1998.[Abstract/Free Full Text]
  9. Kratzschmar J., Haendler B., Eberspaecher U., Roosterman D., Donner P., Schleuning W. D. The human cysteine-rich secretory protein (CRISP) family: primary structure and tissue distribution of CRISP-1, CRISP-2 and CRISP-3. Eur. J. Biochem., 236: 827-836, 1996.[Medline]
  10. Essand M., Vasmatzis G., Brinkmann U., Duray P., Lee B., Pastan I. High expression of a specific T-cell receptor gamma transcript in epithelial cells of the prostate. Proc. Natl. Acad. Sci. USA, 96: 9287-9292, 1999.[Abstract/Free Full Text]
  11. Hanaoka E., Ozaki T., Nakamura Y., Moriya H., Nakagawara A., Sakiyama S. Overexpression of DAN causes a growth suppression in p53-deficient SAOS-2 cells. Biochem. Biophys. Res. Commun., 278: 20-26, 2000.[Medline]
  12. Nakamura T., Ruiz-Lozano P., Lindner V., Yabe D., Taniwaki M., Furukawa Y., Kobuke K., Tashiro K., Lu Z., Honjo T. DANCE, a novel secreted RGD protein expressed in developing, atherosclerotic, and balloon-injured arteries. J. Biol. Chem., 274: 22476-22483, 1999.[Abstract/Free Full Text]
  13. Wang F. L., Wang Y., Wong W. K., Liu Y., Addivinola F. J., Liang P., Chen L. B., Kantoff P. W., Pardee A. B. Two differentially expressed genes in normal human prostate tissue and in carcinoma. Cancer Res., 56: 3634-3637, 1996.[Abstract/Free Full Text]
  14. Lilja H. Cell biology of semenogelin. Andrologia, 22(Suppl.1): 132-141, 1990.
  15. Robbins A. S. PSA and the detection of prostate cancer. J. Am. Med. Assoc., 271: 192-193, 1994.[Abstract/Free Full Text]
  16. Dhanasekaran S. M., Barrette T. R., Ghosh D., Shah R., Varambally S., Kurachi K., Pienta K. J., Rubin M. A., Chinnaiyan A. M. Delineation of prognostic biomarkers in prostate cancer. Nature (Lond.), 412: 822-826, 2001.[Medline]
  17. Alberti I., Barboro P., Barbesino M., Sanna P., Pisciotta L., Parodi S., Nicolo G., Boccardo F., Galli S., Patrone E., Balbi C. Changes in the expression of cytokeratins and nuclear matrix proteins are correlated with the level of differentiation in human prostate cancer. J. Cell. Biochem., 79: 471-485, 2000.[Medline]
  18. Chetcuti A., Margan S., Mann S., Russell P., Handelsman D., Rogers J., Dong Q. Identification of differentially expressed genes in organ-confined prostate cancer by gene expression array. Prostate, 47: 132-140, 2001.[Medline]
  19. Xu J., Stolk J. A., Zhang X., Silva S. J., Houghton R. L., Matsumura M., Vedvick T. S., Leslie K. B., Badaro R., Reed S. G. Identification of differentially expressed genes in human prostate cancer using subtraction and microarray. Cancer Res., 60: 1677-1682, 2000.[Abstract/Free Full Text]
  20. Waghray A., Schober M., Feroze F., Yao F., Virgin J., Chen Y. Q. Identification of differentially expressed genes by serial analysis of gene expression in human prostate cancer. Cancer Res., 61: 4283-4286, 2001.[Abstract/Free Full Text]
  21. Huang G. M., Ng W. L., Farkas J., He L., Liang H. A., Gordon D., Yu J., Hood L. Prostate cancer expression profiling by cDNA sequencing analysis. Genomics, 59: 178-186, 1999.[Medline]
  22. Xu L. L., Stackhouse B. G., Florence K., Zhang W., Shanmugam N., Sesterhenn I. A., Zou Z., Srikantan V., Augustus A., Roschke V., Carter K., McLeod D. G., Moul J. W., Soppett D., Srivastava S. PSGR, a novel prostate-specific gene with homology to a G protein-coupled receptor, is overexpressed in prostate cancer. Cancer Res., 60: 6568-6572, 2000.[Abstract/Free Full Text]
  23. Ries L. A., Wingo P. A., Miller D. S., Howe H. L., Weir H. K., Rosenberg H. M., Vernon S. W., Cronin K., Edwards B. K. The annual report to the nation on the status of cancer, 1973–1997, with a special section on colorectal cancer. Cancer (Phila.), 88: 2398-2424, 2000.[Medline]
  24. Djavan B., Zlotta A., Kratzik C., Remezi M., Seitz C., Schulman C. C., Marberger M. PSA, PSA density, PSA density of transition zone, free/total PSA ratio, and PSA velocity for early detection of prostate cancer in men with serum PSA 2.5 to 4.0 ng/ml. Urology, 54: 517-522, 1999.[Medline]
  25. Bjork T., Lilja H., Christensson A. The prognostic value different forms of prostate specific antigen and their ratios in patients with prostate cancer. BJU Int., 84: 1021-1027, 1999.[Medline]
  26. Schwidetzky U., Haendler B., Schleuning W. D. Isolation and characterization of the androgen-dependent mouse cysteine-rich secretory protein-3 (CRISP-3) gene. Biochem. J., 309: 831-836, 1995.



This article has been cited by other articles:


Home page
Endocr. Rev.Home page
G. M. Gibbs, K. Roelants, and M. K. O'Bryan
The CAP Superfamily: Cysteine-Rich Secretory Proteins, Antigen 5, and Pathogenesis-Related 1 Proteins--Roles in Reproduction, Cancer, and Immune Defense
Endocr. Rev., December 1, 2008; 29(7): 865 - 897.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
A. S. Bjartell, H. Al-Ahmadie, A. M. Serio, J. A. Eastham, S. E. Eggener, S. W. Fine, L. Udby, W. L. Gerald, A. J. Vickers, H. Lilja, et al.
Association of Cysteine-Rich Secretory Protein 3 and {beta}-Microseminoprotein with Outcome after Radical Prostatectomy
Clin. Cancer Res., July 15, 2007; 13(14): 4130 - 4138.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Vasmatzis, E. W. Klee, D. M. Kube, T. M. Therneau, and F. Kosari
Quantitating tissue specificity of human genes to facilitate biomarker discovery
Bioinformatics, June 1, 2007; 23(11): 1348 - 1355.
[Abstract] [Full Text] [PDF]


Home page
J AndrolHome page
L. Udby, A. Bjartell, J. Malm, A. Egesten, A. Lundwall, J. B. Cowland, N. Borregaard, and L. Kjeldsen
Characterization and Localization of Cysteine-Rich Secretory Protein 3 (CRISP-3) in the Human Male Reproductive Tract
J Androl, May 1, 2005; 26(3): 333 - 342.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
J. T. Posey, M. S. Soloway, S. Ekici, M. Sofer, F. Civantos, R. C. Duncan, and V. B. Lokeshwar
Evaluation of the Prognostic Potential of Hyaluronic Acid and Hyaluronidase (HYAL1) for Prostate Cancer
Cancer Res., May 15, 2003; 63(10): 2638 - 2644.
[Abstract] [Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
F. Kosari, Y. W. Asmann, J. C. Cheville, and G. Vasmatzis
Cysteine-rich Secretory Protein-3: A Potential Biomarker for Prostate Cancer
Cancer Epidemiol. Biomarkers Prev., November 1, 2002; 11(11): 1419 - 1426.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Asmann, Y. W.
Right arrow Articles by Vasmatzis, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Asmann, Y. W.
Right arrow Articles by Vasmatzis, G.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Cancer Research Clinical Cancer Research
Cancer Epidemiology Biomarkers & Prevention Molecular Cancer Therapeutics
Molecular Cancer Research Cancer Prevention Research
Cancer Prevention Journals Portal Cancer Reviews Online
Annual Meeting Education Book Meeting Abstracts Online