| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Endocrinology |
1 Surgery Branch, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland; 2 Department of Surgery, Division of Endocrine and Oncologic Surgery, The Johns Hopkins Medical Institutions, Baltimore, Maryland; 3 The Weinberg Cancer Center, Baltimore, Maryland; and 4 Division of Surgical, Molecular and Ultrastructural Pathology, Department of Oncology, University and University Hospital, Pisa, Italy
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Thyroid carcinoma represents 1% of all malignant diseases but 90% of all neuroendocrine malignancies. It is estimated that 510% of the population will develop a clinically significant thyroid nodule during their lifetime (6) . The best available test in the evaluation of a patient with a thyroid nodule is fine needle aspiration biopsy (FNA; Ref. 7 ). Of the malignant FNAs, the majority are from papillary thyroid cancers (PTCs) or its follicular variant (FVPTC). These can be easily diagnosed if they have the classic cytological features, including abundant cellularity and enlarged nuclei containing intranuclear grooves and inclusions (8) . Indeed, one-third of the time these diagnoses are clear on FNA. FNA of thyroid nodules has greatly reduced the need for thyroid surgery and has increased the percentage of malignant tumors among excised nodules (9 , 10) . In addition, the diagnosis of malignant thyroid tumors, combined with effective therapy, has led to a marked decrease in morbidity due to thyroid cancer. Unfortunately, many thyroid FNAs are not definitively benign or malignant, yielding an indeterminate or suspicious diagnosis. The prevalence of indeterminate FNAs varies but typically ranges from 10 to 25% of FNAs (11, 12, 13) . In general, thyroid FNAs are indeterminate because of overlapping or undefined morphological criteria for benign versus malignant lesions or focal nuclear atypia within otherwise benign specimens. Of note, twice as many patients are referred for surgery for a suspicious lesion (10%) than for a malignant lesion (5%), an occurrence that is not widely appreciated because the majority of FNAs are benign. Therefore, when the diagnosis is unclear on FNA, these patients are classified as having a suspicious or indeterminate lesion only. It is well known that frozen section analysis often yields no additional information.
The question then arises: should the surgeon perform a thyroid lobectomy, which is appropriate for benign lesions or a total thyroidectomy, which is appropriate for malignant lesions when the diagnosis is uncertain both preoperatively and intraoperatively? Thyroid lobectomy as the initial procedure for every patient with a suspicious FNA could result in the patient with cancer having to undergo a second operation for completion thyroidectomy. Conversely, total thyroidectomy for all patients with suspicious FNA would result in a majority of patients undergoing an unnecessary surgical procedure, requiring lifelong thyroid hormone replacement and exposure to the inherent risks of surgery (14) .
There is a compelling need to develop more accurate initial diagnostic tests for evaluating a thyroid nodule. Recent studies suggest that gene expression data from cDNA microarray analysis holds promise for improving tumor classification and for predicting response to therapy among cancer patients (15, 16, 17) . No clear consensus exists regarding which computational tool is optimal for the analysis of large gene expression profiling data sets, especially when they are used to predict outcome (18) .
Barden et al. (19) recently showed that in a group of 17 thyroid follicular tumors, 105 genes were found to be significantly different between follicular adenoma (FA) and carcinoma. These 105 genes were informative enough to diagnosis correctly 5 follicular tumors for which the final diagnosis was undisclosed (19) . In this study, we also describe the use of gene expression profiling and supervised machine learning algorithms to construct a molecular classification scheme for thyroid tumors (20) . The gene expression signatures discovered in this study include new tumor-related genes whose encoded proteins may be useful for improving the diagnosis of thyroid tumors.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Tissue Processing and Isolation of RNA.
Frozen sections of 100300 mg of tissue were collected in test tubes containing 1 ml of Trizol. Samples were transferred to FastRNA tubes containing mini beads and homogenized in a FastPrep beater (Bio101Savant, Carlsbad, CA) for 1.5 min at speed 6. The lysate was transferred to a new tube, and total RNA was extracted according to the Trizol protocol (Molecular Research Center, Inc., Cincinnati, OH). Approximately 12 µg of total RNA were obtained from each tumor sample. The total RNA was then subjected to two rounds of amplification following the modified Eberwine method (21
, 22)
. resulting in
42 µg of mRNA. The quality of the extracted RNA was tested by spectrophotometry and by evaluations on minichips (BioAnalyzer; Agilent Technologies, Palo Alto, CA).
Microarray Analysis.
Hybridization was performed on 10,000 human cDNA gene microarrays, Hs-UniGem2, produced by the National Cancer Institute/NIH (ATC, Gaithersburg, MD). Comparisons were made for each tumor with the same control which consisted of amplified RNA extracted from normal thyroid tissue and provided by Ambion, Inc. (Austin, TX). Fluorescent marker dyes (Cy5 and Cy3) were used to label the test and control samples, respectively. The respective dyes and samples were also switched to test for any labeling bias. The mixture of the two populations of RNA species was then hybridized to the same microarray and incubated for 16 h at 42°C. cDNA microarrays were then washed and scanned using the GenePix 4000B (Axon Instruments Inc., Union City, CA), and images were analyzed with GenePix software version 3.0. For each sample, a file containing the image of the array and an Excel file containing the expression ratio values for each gene was uploaded onto the MadbArray web site (National Center for Biotechnology Information/NIH) for additional analysis.5
To accurately compare measurements from different experiments, the data were normalized, and the ratio (signal Cy5/signal Cy3) was calculated so that the median (ratio) was 1.0.
Statistical Analysis.
Data from the 73 thyroid tumors was used to build a benign (FA and HN) versus malignant (PTC and FVPTC) expression ratio-based model, capable of predicting the diagnosis (benign versus malignant) of each sample. After normalization, a file containing the gene expression ratio values from all 73 samples was imported into a statistical analysis software package (Partek, Inc., St. Charles, MO).
Samples were divided in two sets: one set (63 samples) was used to train the diagnosis predictor model, and a second set (8) was used as a validation set to test the model. These 10 samples were not previously used to do any other analysis.
As a first step, the data from the 63 samples were subjected to principal component analysis (PCA) to perform an exploratory analysis and to view the overall trend of the data. PCA is an exploratory technique that describes the structure of high-dimensional data by reducing its dimensionality. It is a linear transformation that converts n original variables (gene expression ratio values) into n new variables or PCs, which have three important properties: they (a) are ordered by the amount of variance explained; (b) are uncorrelated; and (c) explain all variation in the data. The new observations (each array) are represented by points in a three-dimensional space. The distance between any pair of points is related to the similarity between the two observations in high-dimensional space. Observations that are near each other are similar for a large number of variables and conversely, the ones that are far apart are different for a large number of variables.
An ANOVA test with Bonferroni correction was then used to identify genes that were statistically different between the two groups. The resulting significant genes were used to build a diagnosis-predictor model. Variable (gene) selection analysis with cross-validation was performed different times, each time testing a different number of gene combinations. For cross-validation, we used the leave-one-out method to estimate the accuracy of the output class prediction rule: the whole data set was divided into different parts, and each part was individually removed from the data set. The remaining data set was used to train a class prediction rule, and the resulting rule was applied to predict the class of the held-out sample.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
It is important to point out that the mRNA extracted from each sample was amplified. We have found that the quality of the arrays and the data derived from them is superior when mRNA has been amplified from total RNA. Our lab, as well as others, have published specific papers in this regard that have demonstrated increased sensitivity without loss of quality or a bias in the results (23) . Of note, all samples and all reference controls were amplified in the same fashion.
Analysis of the overall gene expression profiles revealed that the benign lesions (FA, HN) could be distinguished from the malignant lesions (PTC, FVPTC). Furthermore, although not statistically significant, the four tumor subtypes appeared to have different gene profiles. The use of a powerful statistical analysis program (Partek, Inc.) helped us to discover a group of 11 genes that were informative enough to create a predictor model. Two combinations were created out of these 11 genes, a combination of 6 genes, and a combination of 10 genes. PCA of the 6 most informative genes resulted in a nearly perfect distinction between the two groups (Fig. 3, A and B)
. In general, PCA describes similarities between samples and is not a commonly used tool for predicting diagnosis. However, in this study, the distinction was so powerful that we were able to visually make a correct diagnosis for each of the 10 unknown samples (Fig. 3, A and B)
.
However, the predictor model determines the kind of tumor with a specific probability value. We were able to correctly predict the diagnosis of all 10 unknown samples, with a more accurate prediction using the 6 gene combination (Table 3
; see Ps).
It is clear from the graph in Fig. 4
how the combination of gene expression values gives a distinctly different profile between the benign and malignant lesions. However, within each tumor group, there are differences among the profiles of the five samples tested. This could be explained by the fact that each tumor, even if of the same type, could be at a different stage of progression.
Of the 11 genes that were informative for the diagnosis, five genes are known genes, and for the other six genes, no functional studies are yet available (Table 4)
. The genes that were identified are the ones that the model has determined best group the known samples into their correct diagnosis. Those genes identified are the ones that consistently grouped the samples into the categories and subcategories described in the text. This type of pattern assignment is based on the analysis of thousands of genes and the recognition by the computer software that certain patterns are associated repeatedly with certain diagnostic groups. This type of analysis derives it power (and significance) by the number of genes that are analyzed, rather than the degree of up or down-regulation of any particular gene. With respect to the specific genes identified, the computer is not biased by the knowledge of previously identified genes associated with thyroid cancer. The genes it identifies are those that best differentiate the varied diagnoses of the known samples. This occurs during the training phase of establishing the algorithm. Once the computer is trained with data from comparisons of RNA from known diagnoses to a standard reference, unknowns can be tested and fit to the diagnostic groups predicted during the training. For the purposes of such an approach, individual genes are less important. A specific gene, which is found in a univariate study to be associated with thyroid cancer, may not turn out to be the best multivariate predictor of a diagnosis in an analysis such as the one presented here.
The present study is a clear example of how gene expression profiling can provide highly useful diagnostic information. Recently, Barden et al. (19) carried out a similar study and obtained a group of 105 genes significantly different between malignant and benign tumors. It is almost certain that analogous studies will yield similar results for many other types of tumors. It is likely that gene expression profiling will be used in the future for clinical decision making. For this purpose, adequate reporting of DNA microarray data to clinicians will be necessary. Gene expression profiles may be more reproducible and clinically applicable than well established but highly subjective techniques such as histopathology. The small number of genes for which RNA expression levels are diagnostically and prognostically relevant raises strong hope that a robust, affordable, commercially available testing system could soon emerge. We hope that the results presented here will provide a useful method for classifying thyroid nodules as benign or malignant and therefore help facilitate appropriate, and eliminate unnecessary, operations in patients with suspicious thyroid tumors.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Requests for reprints: Steven K. Libutti, Senior Investigator, Surgery Branch, National Cancer Institute Building 10, Room 2B07, 10 Center Drive, Bethesda, MD 20892. E-mail: Steven_Libutti{at}nih.gov
5 Internet address: http://nciarray.nci.nih.gov. ![]()
Received 12/ 5/03. Revised 2/ 1/04. Accepted 2/ 9/04.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Eszlinger, K. Krohn, S. Hauptmann, H. Dralle, T. J. Giordano, and R. Paschke Perspectives for Improved and More Accurate Classification of Thyroid Epithelial Tumors J. Clin. Endocrinol. Metab., September 1, 2008; 93(9): 3286 - 3294. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. B. Prasad, H. Somervell, R. P. Tufano, A. P.B. Dackiw, M. R. Marohn, J. A. Califano, Y. Wang, W. H. Westra, D. P. Clark, C. B. Umbricht, et al. Identification of Genes Differentially Expressed in Benign versus Malignant Thyroid Tumors Clin. Cancer Res., June 1, 2008; 14(11): 3327 - 3337. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Durand, C. Ferraro-Peyret, S. Selmi-Ruby, C. Paulin, M. El Atifi, F. Berger, N. Berger-Dutrieux, M. Decaussin, J.-L. Peix, C. Bournaud, et al. Evaluation of Gene Expression Profiles in Thyroid Nodule Biopsy Material to Diagnose Thyroid Cancer J. Clin. Endocrinol. Metab., April 1, 2008; 93(4): 1195 - 1202. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Riesco-Eizaguirre and P. Santisteban New insights in thyroid follicular cell biology and its impact in thyroid cancer therapy Endocr. Relat. Cancer, December 1, 2007; 14(4): 957 - 977. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fujarewicz, M. Jarzab, M. Eszlinger, K. Krohn, R. Paschke, M. Oczko-Wojciechowska, M. Wiench, A. Kukulska, B. Jarzab, and A. Swierniak A multi-gene approach to differentiate papillary thyroid carcinoma from benign lesions: gene selection using support vector machines with bootstrapping Endocr. Relat. Cancer, September 1, 2007; 14(3): 809 - 826. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Eszlinger, K. Krohn, A. Kukulska, B. Jarzab, and R. Paschke Perspectives and Limitations of Microarray-Based Gene Expression Profiling of Thyroid Tumors Endocr. Rev., May 1, 2007; 28(3): 322 - 338. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. L. Griffith, A. Melck, S. J.M. Jones, and S. M. Wiseman Meta-Analysis and Meta-Review of Thyroid Cancer Gene Expression Profiling Studies Identifies Important Diagnostic Biomarkers J. Clin. Oncol., November 1, 2006; 24(31): 5043 - 5051. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Lubitz, S. K. Ugras, J. J. Kazam, B. Zhu, T. Scognamiglio, Y.-T. Chen, and T. J. Fahey III Microarray Analysis of Thyroid Nodule Fine-Needle Aspirates Accurately Classifies Benign and Malignant Lesions J. Mol. Diagn., September 1, 2006; 8(4): 490 - 498. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Cerutti, F. R.M. Latini, C. Nakabashi, R. Delcelo, V. P. Andrade, M. J. Amadei, R. M.B. Maciel, F. C. Hojaij, D. Hollis, J. Shoemaker, et al. Diagnosis of Suspicious Thyroid Nodules Using Four Protein Biomarkers. Clin. Cancer Res., June 1, 2006; 12(11): 3311 - 3318. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Eszlinger, M. Wiench, B. Jarzab, K. Krohn, M. Beck, J. Lauter, E. Gubala, K. Fujarewicz, A. Swierniak, and R. Paschke Meta- and Reanalysis of Gene Expression Profiles of Hot and Cold Thyroid Nodules and Papillary Thyroid Carcinoma for Gene Groups J. Clin. Endocrinol. Metab., May 1, 2006; 91(5): 1934 - 1942. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Wiseman, C. Baliski, R. Irvine, D. Anderson, G. Wilkins, D. Filipenko, H. Zhang, and S. Bugis Hemithyroidectomy: The Optimal Initial Surgical Approach for Individuals Undergoing Surgery for a Cytological Diagnosis of Follicular Neoplasm Ann. Surg. Oncol., March 1, 2006; 13(3): 425 - 432. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Nygaard and E. Hovig Options available for profiling small samples: a review of sample amplification technology when combined with microarray profiling Nucleic Acids Res., February 9, 2006; 34(3): 996 - 1014. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. C. G. van Staveren, D. W. Solis, L. Delys, D. Venet, M. Cappello, G. Andry, J. E. Dumont, F. Libert, V. Detours, and C. Maenhaut From The Cover: Gene expression in human thyrocytes and autonomous adenomas reveals suppression of negative feedbacks in tumorigenesis PNAS, January 10, 2006; 103(2): 413 - 418. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Weber, L. Shen, M. A. Aldred, C. D. Morrison, A. Frilling, M. Saji, F. Schuppert, C. E. Broelsch, M. D. Ringel, and C. Eng Genetic Classification of Benign and Malignant Thyroid Follicular Neoplasia Based on a Three-Gene Combination J. Clin. Endocrinol. Metab., May 1, 2005; 90(5): 2512 - 2521. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Jarzab, M. Wiench, K. Fujarewicz, K. Simek, M. Jarzab, M. Oczko-Wojciechowska, J. Wloch, A. Czarniecka, E. Chmielik, D. Lange, et al. Gene Expression Profile of Papillary Thyroid Cancer: Sources of Variability and Diagnostic Implications Cancer Res., February 15, 2005; 65(4): 1587 - 1597. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |