Skip to main content
  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

AACR logo

  • Register
  • Log in
  • My Cart
Advertisement

Main menu

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Search More

    Advanced Search

  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

User menu

  • Register
  • Log in
  • My Cart

Search

  • Advanced search
Cancer Research
Cancer Research
  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Search More

    Advanced Search

Integrated Systems and Technologies

A Framework to Select Clinically Relevant Cancer Cell Lines for Investigation by Establishing Their Molecular Similarity with Primary Human Cancers

Garrett M. Dancik, Yuanbin Ru, Charles R. Owens and Dan Theodorescu
Garrett M. Dancik
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuanbin Ru
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charles R. Owens
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dan Theodorescu
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
DOI: 10.1158/0008-5472.CAN-11-2427 Published December 2011
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

Experimental work on human cancer cell lines often does not translate to the clinic. We posit that this is because some cells undergo changes in vitro that no longer make them representative of human tumors. Here, we describe a novel alignment method named Spearman's rank correlation classification method (SRCCM) that measures similarity between cancer cell lines and human tumors via gene expression profiles, for the purpose of selecting lines that are biologically relevant. To show utility, we used SRCCM to assess similarity of 36 bladder cancer lines with 10 epithelial human tumor types (N = 1,630 samples) and with bladder tumors of different stages and grades (N = 144 samples). Although 34 of 36 lines aligned to bladder tumors rather than other histologies, only 16 of 28 had SRCCM assigned grades identical to that of their original source tumors. To evaluate the clinical relevance of this approach, we show that gene expression profiles of aligned cell lines stratify survival in an independent cohort of 87 bladder patients (HR = 3.41, log-rank P = 0.0077) whereas unaligned cell lines using original tumor grades did not. We repeated this process on 22 colorectal cell lines and found that gene expression profiles of 17 lines aligning to colorectal tumors and selected based on their similarity with 55 human tumors stratified survival in an independent cohort of 177 colorectal cancer patients (HR = 2.35, log-rank P = 0.0019). By selecting cell lines that reflect human tumors, our technique promises to improve the clinical translation of laboratory investigations in cancer. Cancer Res; 71(24); 7398–409. ©2011 AACR.

Introduction

Cell lines derived from human tumors serve as model systems that have greatly increased our understanding of cancer biology. These are routinely used to characterize molecular mechanisms of disease (1–3) and therapeutic agents (4–6). High-throughput cell line screening programs have been used to characterize the efficacy of anticancer agents (7) and to identify novel multiagent therapies (8). Nevertheless, the validity of a cell line model depends on how representative it is of the tumor type under investigation. Notably, cell lines can lose molecular features that drive clinically relevant characteristics as they adapt to culture conditions (9). An analysis of NCI-60 cell lines found that the proportion of cell lines most similar to their presumed tissue of origin could be as low as 57% and that the molecular profiles of patient tumors are more similar to normal tissues of the same type than their derived cell lines (10). Cross-contamination of cell lines is well documented and also contributes to reduce the number of relevant cancer cell line models for research (11–13).

Stage and grade are cornerstone prognostic and predictive factors of tumor aggressiveness and disease outcome. In the case of bladder cancer, high-grade cancers are more likely to metastasize than low grade ones and up to 30% of nonmuscle-invasive (NMI) tumors progress to muscle invasion (MI; ref. 14). Metastatic risk correlates with stage, with up to 50% of patients harboring muscle-invasive cancers developing metastatic disease during follow-up (15). Furthermore, tumor sensitivity to anticancer agents depends on stage and grade, with resistance more common in poorly differentiated tumors (16, 17). Last, the evaluation of chemotherapy and radiation treatments often occurs in subsets of cancer patients having tumors of a specific stage or grade (18–20). For all of these reasons, the choice of an appropriate cell line model that aligns with the specific human tumor characteristics being investigated is critical if results from cell line experiments are to have clinical relevance.

Here, we develop a novel framework for classifying or aligning human cancer cell lines as a function of their similarity to several important phenotypes of human tumors via gene expression profiles. We then use the gene expression profiles of these aligned cancer cells as biomarkers to determine which cells predict clinical outcomes in human cancer patients (i.e., are clinically relevant). As a proof of principle, we used bladder and colorectal cancer cell line panels and patient tumor cohorts to show that gene expression signatures of cancer cell lines selected on the basis of their alignments to human cancer phenotypes predict patient survival outcomes whereas the signatures of unaligned cells do not. Our practical approach can guide the selection of “clinically relevant” cell lines for use in experimental studies which may enhance the likelihood of clinical translation of laboratory investigations. Because this methodology is applicable to any cancer cell line and measurable phenotype (e.g., grade, invasiveness, and metastasis), it has broad applicability in cancer research.

Materials and Methods

The bladder and colorectal cell line panels

Gene expression profiles (CEL files) for 36 bladder cancer cell lines (BLA-36) are available from the Gene Expression Omnibus (GEO; ref. 21), accession #GSE5845 (22). The database contains expression profiles for 40 cell lines, including several cell lines of the same lineage. In our analysis, we chose to analyze unique cell lines or related cell lines that were derived in separate laboratories. As a result, we removed the cell lines FL3, SLT4, T24T (all derived from T24), and 253J-BV (derived from 253J) prior to our analysis. Information about the BLA-36 cell lines along with our alignments are provided in Supplementary Table S1. Gene expression profiles (CEL files) for 22 colorectal cell lines (CO-22) profiled in triplicate from GlaxoSmithKline are available through the National Cancer Institute's cancer Bioinformatics Grid (caBIG; ref. 23).

The Spearman's rank correlation classification method

The Spearman's rank correlation classification method (SRCCM) algorithm classifies a test sample based on the Spearman's rank correlation between its gene expression profile and the gene expression profile of a set of training samples with known phenotypes (Fig. 1A). The gene expression profile is based on a gene signature that is unique to each phenotype (Fig. 1B and C). The tissue of origin gene signature consists of 12,402 “high fidelity” probes in which expression values are consistent across formalin-fixed, paraffin-embedded (FFPE), and fresh-frozen (FF) preservation methods (24). The bladder stage and grade gene signatures have been previously published and independently validated (25). The bladder disease-specific survival (DSS) gene signature consists of the 181 probes with univariate Cox proportional hazard model log-rank P < 0.01 in the training cohort from Memorial Sloan-Kettering Cancer Center (MSKCC; ref. 26). The colorectal DSS gene signature is a previously published and validated signature consisting of 34 genes that correlate with recurrence and survival (27).

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Overview and use of the SRCCM. A, overview of the SRCCM algorithm for cell line/test sample alignment. A test sample is aligned to the phenotype with the highest mean correlation between test and training gene expression profiles using a relevant gene signature. B, alignment of a test sample with a clinical phenotype using SRCCM is based on the gene expression profiles of the training and test samples and a relevant gene signature for the desired phenotype. C, published and derived gene signatures and phenotypes used by SRCCM for tissue of origin and bladder cancer stage, grade, and DSS alignment. ⁁, signatures from ref. 24; *, stage and grade signatures from ref. 25 are different, despite the same number of genes; **, univariate Cox proportional hazards model, log-rank P < 0.01 in training cohort from ref. 26, see Materials and Methods for details.

The test sample is aligned to (i.e., classified as or assigned) the phenotype with the highest average correlation. Formally, let Math be the set of training phenotypes of interest with known gene expression profiles. In Fig. 1A, k = 2 and P = (low grade, high grade). Let t be the gene expression profile of the test sample and Math be the gene expression profile of the ith training sample of phenotype p, i = 1, …, np. Let Math be the Spearman's rank correlation between gene expression profiles t and Math, which is the standard Pearson product moment correlation calculated using the ranks of the data. Finally, let Math be the average Spearman's rank correlation between the test sample's gene expression profile and the gene expression profiles of phenotype p. Then the classification of the test sample is the phenotype p ∈ P that maximizes Math. For purposes of ranking the test samples when k = 2 phenotypes, we use a correlation score = Math, where P = (nonmuscle invasive, muscle invasive), (low grade, high grade), and (long-term survivor, short-term survivor) for stage, grade, and DSS alignment, respectively. Software for the SRCCM is available to the scientific community through the Comprehensive R Archive Network (28) as part of the correlation classification method contributed package.

Bladder stage and grade alignments

The SRCCM algorithm was used along with separate published and validated stage and grade gene signatures (25) to classify tumors as muscle invasive (stages T2–T4) or nonmuscle invasive (stages Ta–T1), and as high grade (G3–G4) or low grade (G1–G2). Each gene signature consists of 100 genes, with 54 genes common to both signatures. For each signature, we used the 84 genes common to the training (Lindgren; ref. 25), validation (MSKCC), and BLA-36 data sets, with 46 genes common to both signatures. The 2 signatures were further validated using the Chungbuk National University Hospital (CNUH) cohort (29) and the Stransky cohort of patients from the Henri Mondor Hospital in France (30) using the genes common to all data sets (72 for stage and 70 for grade). For visualization purposes, the correlation score (described above) was used to rank the cell lines, with higher scores corresponding to muscle-invasive and high-grade tumors, respectively.

Bladder and colorectal DSS alignments

We use DSS in all of our analyses with survival time equal to the time from diagnoses to time of death or last follow-up. All individuals not dead from disease are censored at the time of last follow-up or time of death from another cause. For DSS alignment, the “median cut” labeling method was used to identify long-term survivors (i.e., low-risk individuals) and short-term survivors (i.e., high-risk individuals; ref. 31). Patients without censored survival times are assigned to the long-term survivor group if the survival time exceeds the median survival time of all patients in the training cohort, and assigned to the short-term survivor group otherwise. For a patient with an unobserved survival time of T that is censored at C, we calculate Embedded Image where P(T > median survival time) and P(T > C) are estimated by the Kaplan–Meier (KM) survival curve for all patients (32). The patient is then assigned to the long-term survivor group if P(long-term survivor) > 0.5, and assigned to the short-term survivor group otherwise.

Results

A new method that aligns cell lines to human tumors based on gene expression similarity

We developed a new classification method, herein referred to as the SRCCM and first applied it to 36 bladder cancer cell lines (BLA-36; Supplementary Table S1). These lines were assigned the following tumor phenotypes: tissue of origin (from 10 epithelial cancers), bladder cancer stage (invasive vs. nonmuscle invasive), grade (low vs. high), and DSS (long-term vs. short-term survivors). SRCCM (Fig. 1A) aligns cell lines to human tumor phenotypes based on the gene expression similarity between cell lines and human tumors calculated using gene signatures, described below and in Materials and Methods, that consists of a subset of all genes profiled in both groups (cell lines and human tumors) (Fig. 1B and C). The SRCCM consists of 3 steps: (i) for each test sample, calculate the rank correlation between its gene expression profile and the gene expression profiles of each training sample with known phenotype (e.g., grade), (ii) assign the phenotype with the highest mean correlation to the test sample producing the alignment, (iii) repeat the procedure for all test samples (Fig. 1A). Therefore, each test sample (e.g., cell line in a set) is aligned to the human tumor phenotype it is most similar to, based on the similarity between gene expression profiles.

For each tumor phenotype, the SRCCM algorithm is validated on at least 1 independent test data set and then applied to the BLA-36 cell lines. For each test sample, the primary output of the SRCCM is its molecular alignment (i.e., a qualitative phenotype), whereas the secondary output is a quantification of the relative strength of 1 alignment (e.g., to high grade) to another (e.g., to low grade) by subtracting the corresponding mean correlations, producing a correlation score. For stage, grade, and DSS alignments, a higher correlation score indicates greater similarity with muscle invasive, high grade, and short-term survivors, respectively (see Materials and Methods).

Our SRCCM algorithm is based on the classification method of Wang and colleagues (33) but is modified in 3 important ways. The first modification is that we use the Spearman rank correlation rather than Pearson product moment correlation to measure gene expression similarity. Our choice of a rank-based classifier is motivated by the fact that the rank value of a probe's expression level within a sample is insensitive to standard within-sample preprocessing methods and is not directly affected by other samples on the array, whereas the expression value is highly dependent on the preprocessing and normalization methods used and can be affected by additional probes and samples (e.g., when robust multichip average normalization is used). The second modification is that instead of measuring similarity based on a sample's global gene expression profile (i.e., the expression levels of all common microarray probes) as used previously (33), we measure similarity based on gene signatures appropriate for the microarray data sets under study and the phenotype of interest (Fig. 1C). For tissue alignment, because bladder samples (N = 350) were both FFPE and FF (Supplementary Table S2), the gene signature used consists of 12,402 probes in which expression values were preserved across FF and FFPE (24). For stage and grade alignment, 2 previously published and independently validated gene signatures are used (ref. 25; Supplementary Tables S3 and S4). For DSS alignment, the gene signature consists of the 181 probes that significantly correlate with survival in the training data set (MSKCC; Supplementary Table S5) with univariate Cox proportional hazards model log-rank P < 0.01. The final modification is that for each phenotype, we ensure that the SRCCM molecular alignment is accurate through validation on at least 1 independent data set. References for all data sets are provided in Supplementary Tables S1 to S5, and data sets are processed as described in Supplementary Materials. The SRCCM algorithm is comparable with common classification methods, including support vector machines, nearest centroid classification, and k-nearest neighbor, but carries out slightly better in terms of overall accuracy and consistency across multiple data sets (described in Supplemental Information and Supplementary Tables S6 and S7).

Molecular alignment of human bladder cell lines as a function of tumor tissue of origin

We first determined whether the BLA-36 cell lines had gene expression profiles that were more similar to bladder cancer or other common carcinomas of cervical, prostate, ovarian, breast, thyroid, kidney, colorectal, and lung origin. These tumor types were selected based on the following criteria (i) epithelial histology, (ii) availability of at least 1 public expression microarray data set containing 30 or more tumor samples, and (iii) profiled on an Affymetrix platform, the platform used for BLA-36 profiling. Because distinct molecular profiles for lung cancer subtypes have been well characterized (34), we considered lung adenocarcinoma (adeno) and lung squamous cell carcinoma (scc) separately. A total of 3,320 tumor samples were used in this analysis with 1,630 used in training and 1,690 in validation of the SRCCM algorithm (Supplementary Table S2). When applied on the 1,690 validation samples from 10 tumor types, the SRCCM algorithm had a mean accuracy of 89%, with tumor-specific accuracy ranging from 77% for cervical tumors to 99% for both colorectal and prostate tumors (Fig. 2A). The overall accuracy for bladder cancer was 91% (Fig. 2A). Interestingly, the majority of misaligned samples are aligned with lung (adeno), including 12% of all ovary samples and 11% of all lung (scc) samples (Fig. 2B). The majority of misaligned cervical samples are aligned with bladder, which comprise 15% of all cervical samples. Importantly, when we applied the SRCCM to the BLA-36 panel, 34 of 36 (94%) cells align with tumors of bladder origin. Of the 2 not predicted to be of bladder tumor origin, one (CubIII) is predicted to be colorectal and the other (SW1710) ovarian (Fig. 2C).

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Independent validation and BLA-36 cell line alignment to tissue of origin using the SRCCM algorithm. Test samples and BLA-36 cell lines are aligned to 10 epithelial cancers from a training data set including bladder, breast, cervix, colorectum, kidney, lung adeno, lung scc, ovary, prostate, and thyroid samples (N = 1,630). A, tissue-specific accuracy of SRCCM alignment algorithm on independent data sets (N = 1,690). B, confusion matrix for independent validation presented as a heatmap, with green indicating correct alignment and red indicating incorrect alignment. For each tissue type (i.e., row of the matrix), the proportion of samples aligned with each tissue type is reported. C, BLA-36 tissue of origin alignment heatmap. For each cell line, the color represents the average correlation with each tissue type, with red indicating strong positive correlation and green indicating weak positive correlation. All cell lines are most strongly correlated (i.e., aligned) with bladder, with the exception of CubIII and SW1710 (arrows) which are aligned with colorectum and ovary, respectively.

Molecular alignment of human bladder cell lines as a function of tumor stage and grade

We next used the SRCCM to align the 36 bladder cancer cell lines as a function of human bladder tumor stage and grade, using previously published and validated stage and grade gene signatures (ref. 25; Fig. 1C). Stage alignment is to nonmuscle- or muscle-invasive (Ta–T1 or T2–T4) tumors and grade alignment is to high- or low-grade (G3–G4 or G1–G2) tumors. For both stage and grade alignments, we use the Lindgren cohort (25) for training and leave-one-out cross-validation (LOOCV) and use 3 additional data sets for independent validation: patient profiles from the MSKCC (26), the CNUH (29), and the Stransky cohort of patients from the Henri Mondor Hospital in France (ref. 30; Supplementary Tables S3 and S4). The SRCCM carries out well in LOOCV and independent validation for stage and grade (Table 1). The high area under the curve (AUC) in LOOCV (0.93 ∼ 0.94) and independent validation (0.81 ∼ 0.92) indicates the SRCCM algorithm accurately separates stage and grade subgroups in tumor samples. When the SRCCM was applied to the 36 cell lines, 20 aligned with muscle-invasive tumors and 16 with nonmuscle-invasive tumors (Fig. 3A). For grade, 22 and 14 aligned with high-grade and low-grade tumors, respectively (Fig. 3B). The rankings of the cell lines in stage and grade alignment are highly correlated (Spearman r = 0.89, P < 0.0001), with 32 of 36 cell lines resembling the muscle-invasive high-grade and nonmuscle-invasive low-grade tumor types (Fig. 3C) commonly seen in patients (25, 35, 36).

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

A and B, stage and grade alignment heatmaps. Cell lines are ranked by their correlation scores (see Materials and Methods) from high (red) to low (green) and classified as either muscle invasive (T2–T4) or nonmuscle invasive (Ta–T1) for stage and high (G3–G4) or low (G1–G2) grade, separated by a blue horizontal line. The heatmap also contains the documented gene mutations according to the COSMIC database (52), with purple, yellow, and black indicating known mutation, wild-type, and unknown mutation status, respectively. The histograms below the heatmaps show correlations of mutations with each alignment. P values of Fisher exact tests: *, 0.05 ≤ P < 0.1; **, P < 0.05. C, plot of correlation scores for SRCCM assigned stage and grade. Red points denote cell lines aligned with high-grade muscle-invasive tumors, green points denote cell lines aligned with low-grade nonmuscle-invasive tumors, and all other cell lines are plotted in black.

View this table:
  • View inline
  • View popup
Table 1.

Performance of stage and grade classification using the SRCCM alignment algorithm

To determine the value of the stage and grade realignment, we evaluated the correlation between SRCCM assigned stages and grades and the status of several common gene mutations (see Supplementary Materials) known to be associated with biological phenotypes in human bladder cancer (Fig. 3A and B). For assigned cell line stages, we observed positive correlations with TP53 (r = 0.671), RB1 (r = 0.408), PTEN (r = 0.354), and KRAS (r = 0.277) mutants, and negative correlations with CDKN2A (r = −0.408) and FGFR3 (r = −0.181) mutants, consistent with the role of the first 3 in aggressive disease and the association of the latter 2 with a nonmuscle-invasive phenotype (37–39). Notably, the correlation with TP53 reaches a marginal significance (Fisher exact P = 0.06), despite the fact that TP53 mutation status is known for just 11 cell lines. For assigned cell line grades, we observe positive correlations with TP53 (r = 0.810, Fisher exact P = 0.024), RB1 (r = 0.667, Fisher exact P = 0.076), PTEN (r = 0.304), and KRAS (r = 0.207) mutants, and negative correlations with CDKN2A (r = −0.310) and FGFR3 (r = −0.305) mutants.

Cell lines assigned their original tumor grades do not correlate with survival but correlation is restored using molecularly aligned cell line grades

We collected stage and grade information for the tumor samples used to derive the BLA-36 lines (Supplementary Table S1) and compared the original tumor information with the SRCCM assigned grades described above. Only 8 cell lines had stage information from their original tumors, whereas 28 cell lines had grade information so only the latter was used in the analysis below. Of the 9 cell lines originating from low-grade tumors, 4 (44%) aligned with low-grade tumors in the Lindgren bladder tumor set (N = 144; Supplementary Table S4) and 12 (63%) of the 19 cell lines from high-grade tumors are aligned with high-grade lesions. Strikingly, 43% (12 of /28) of the cell lines do not resemble their tumor origins with respect to grade gene signatures.

Because of the disagreement between BLA-36 original tumor grade and SRCCM assigned grade and because grade is a strong indicator of patient survival (40), we next investigated the clinical relevance of this alignment (Fig. 4A). Specifically, we asked whether the grades of the original tumors from which the cell lines were derived or the grades assigned via SRCCM were more predictive of DSS. To evaluate this, we aligned 87 tumor samples from an independent patient cohort (MSKCC; Supplementary Table S4) to either a patient data set (Lindgren; Supplementary Table S4) or the BLA-36 cell lines, using their original or reassigned grades, and then generated KM DSS curves for the assigned low- and high-grade samples. In the MSKCC cohort, patient grade is a very strong predictor of survival, producing KM survival curves with a HR of 11.29 and a log-rank P = 0.0026 (Fig. 4B). Use of the Lindgren cohort establishes the efficacy of the SRCCM algorithm to predict MSKCC grade and then generate KM survival curves because this resulted in excellent stratification (HR = 5.62, log-rank P = 0.0013, Fig. 4C). Using the grades of the original tumors from where BLA-36 was derived, the relationship between grade and survival is lost (HR = 0.91, log-rank P = 0.79; Fig. 4D). In contrast, when the MSKCC samples are aligned with SRCCM assigned cell line grades, the relationship between grade and survival is restored (HR = 3.41, log-rank P = 0.0077; Fig. 4E). Together, these results indicate that cell lines no longer resemble the tumors they were derived from and cell lines selected by their original grades no longer correlate with survival. In contrast, the SRCCM is effective in identifying lines that correlate with clinical outcome.

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Original cell line grades no longer correlate with DSS but correlation is restored via SRCCM alignment to patient tumors. A, overview of the methodology used. MSKCC tumors are aligned to the grades of Lindgren tumors or BLA-36 cell lines using the grades of the tumors they were derived from (original grades) or the assigned grades from alignment with the Lindgren patients (aligned grades). We report the accuracy of MSKCC grade assignment. We then generate KM survival curves based on the original patient grades in MSKCC (positive control; B), MSKCC grades based on alignment to Lindgren patient tumors (second positive control; C), MSKCC grades based on alignment to BLA-36 (original grades; D), and MSKCC grades based on alignment to BLA-36 (aligned grades; E).

The clinical impact of selecting human bladder cancer cell lines based on their similarity with human tumors

The overarching objective of this work is to identify the most clinically relevant cancer cell lines, those most strongly correlated with disease outcome. Above, we noted that the aligned grades were more clinically significant than the actual grades of the original tumors from which the cell lines were derived. To define the most clinically relevant cell lines, here, we first align the BLA-36 cell lines directly with DSS, using the MSKCC cohort as the training data set. The MSKCC cohort contains 87 tumor samples and has a median survival time of 26.0 months and a 5-year DSS rate of 56.3% (Supplementary Table S5). Patients were assigned to risk groups (i.e., long- and short-term survivors) using the “median cut” labeling method (31) as described in Materials and Methods. For SRCCM analysis, we use a gene signature consisting of all probes correlated with DSS in MSKCC (univariate Cox proportional hazards model, log-rank P < 0.01), and applied the SRCCM algorithm to align the BLA-36 cell lines to long- and short-term survivors in the MSKCC cohort. In an LOOCV on MSKCC, the long- and short-term survivors produce survival curves with a HR of 3.75 and a log-rank P = 0.00011 (Fig. 5A). We then used the SRCCM to align the BLA-36 cell lines with long- and short-term survivors in MSKCC and ranked the cell lines according to their correlation score (Fig. 5B).

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

DSS in BLA-36 and CO-22 as a function of long- and short-term survivors determined by “median cut” labeling (see Materials and Methods). A, KM survival curves for bladder cancer patients in the training MSKCC data set obtained through SRCCM alignment and LOOCV. B, DSS alignment heatmap for BLA-36 cell lines. Cell lines are ranked from high (red) to low (green) by their correlation scores (see Materials and Methods) and classified as either long-term or short-term survivors, separated by a horizontal blue line, based on their alignment to long- and short-term survivors in MSKCC. C, KM survival curves for colorectal cancer patients in the training VMC data set obtained through SRCCM alignment and LOOCV. D, DSS alignment heatmap for CO-22 cell lines. Cell lines are ranked using the same method as in B, based on their alignment to long- and short-term survivors in VMC.

Next, we compared this ranking to the tumor of origin alignment (Fig. 2C) and to the grade ranking (Fig. 3B) that was found clinically relevant (Fig. 4E) and selected only cells aligning with bladder tumors that were either in high-grade/short-term survivors or low-grade/long-term survivor groups. This provides us with 15 and 4 cell lines in each group, respectively (Fig. 6A). We then evaluated the clinical relevance of this selection by using it to align an independent cohort of patients from CNUH (N = 129, 5-year DSS rate = 80.5%; ref. 29; Supplementary Table S5). Patients aligned to these high-grade/short-term and low-grade/long-term survivors produced survival curves with a HR of 4.16 and a log-rank P = 0.0040 (Fig. 6B) indicating the selection of cell lines based on their tissue of origin, grade, and DSS alignments has biological significance.

Figure 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 6.

Selection and validation of clinically relevant BLA-36 and CO-22 cell lines. A, SRCCM alignment of BLA-36 cell lines to tissue of origin, grade, and DSS. Heatmap of the clinically relevant BLA-36 cell lines aligning with bladder, high grade, and short-term survivors or bladder, low grade, and long-term survivors. The cell lines are ranked by their DSS alignment score. B, validation of the clinical relevance of the selected cell lines in an independent data set. KM survival curves are generated for patients in CNUH following alignment to the clinically relevant cell lines in A. C, SRCCM alignment of CO-22 cell lines to tissue of origin and DSS. Heatmap of the clinically relevant CO-22 cell lines having all replicates aligning with colorectum and no ambiguous DSS alignments. The cell lines are ranked by their average DSS alignment score. D, validation of the clinical relevance of the selected cell lines in an independent data set. KM survival curves are generated for patients in MCC following alignment to the clinically relevant cell lines in C.

Selecting human colorectal cancer cell lines based on their similarity to human tumors leads to the identification of clinically relevant lines

To evaluate the generalizability of our SRCCM algorithm and cell line selection concept, we analyzed a panel of 22 colorectal cell lines (CO-22) profiled by GlaxoSmithKline. The CO-22 cell lines were first aligned to tissue of origin, in which the SRCCM alignment accuracy for colorectal tumors in the independent test data sets was 99% (Fig. 2A). All CO-22 cell line samples aligned with colorectum with the exception of COLO-320HSR, in which all three replicates aligned with uterus, and RKO-E6, in which 1 replicate aligned with bladder (Supplementary Fig. S1A).

We next aligned the CO-22 cell lines directly to DSS, using a set of 55 patients with colorectal cancer from the Vanderbilt Medical Center as the training data set (VMC cohort; ref. 27). The VMC cohort has a median survival time of 38.0 months and a 5-year DSS rate of 73.4% (Supplementary Table S5). Patients were assigned to risk groups using the “median cut” labeling method as was done for the BLA-36 cell lines (see Materials and Methods). For SRCCM analysis, we used a previously published and validated gene signature consisting of 34 genes that correlates with recurrence and survival (27). In a LOOCV on VMC, the long- and short-term survivors produce survival curves with a HR of 8.00 and a log-rank P = 0.0225 (Fig. 5C). We then used the SRCCM to align the CO-22 cell lines with long- and short-term survivors in VMC, ranked the cell lines according to their correlation score, and used the median correlation score to separate long- and short-term survivors (Fig. 5D).

For cell line selection, a cell line was deemed clinically relevant if all of its replicates aligned with colorectum and all replicates had the same DSS alignment (Fig. 6C). Cell lines were removed if they had ambiguous DSS alignments defined as alignments with one or more replicates aligned with short-term survivors and at least one aligned with long-term survivors. Our selection resulted in the removal of the following lines: COLO-320HSR, which aligned with uterus; LS1034, SW1463, and SW948, which had ambiguous DSS alignments, and RKO-E6, which had an ambiguous DSS alignment and also had one replicate aligning with bladder.

The clinical relevance of the remaining 17 cell lines was then evaluated in an independent cohort of patients from the Moffitt Cancer Center (MCC; N = 177, 5-year DSS rate = 67.0%; ref. 27, Supplementary Table S5). Alignment of the MCC patients to the selected colorectal cell lines produces survival curves with a HR of 2.35 and a log-rank P = 0.0019, establishing their clinical relevance (Fig. 6D). Importantly, when all 22 colorectal cell lines were used in this analysis (i.e., without the selection step), the HR falls to 1.69 and the separation of survival curves is no longer statistically significant (log-rank P = 0.054; Supplementary Fig. S1B).

Discussion

For a cell line model to have clinical utility it must be representative of the human tumor of interest. Importantly, cell line models selected based on the characteristics of the tumors from which they are derived, their xenograft behavior, or their technical convenience may not be clinically relevant and therefore lack clinical translatability. In this article, we describe a novel framework for the selection of the most clinically relevant cell lines. At the heart of this approach is the SRCCM alignment algorithm, with software we have made available to the scientific community, and which is applicable to any cancer cell line. Although we have selected clinically relevant cell lines for bladder and colorectal cancers, we caution that because gene expression profiles vary across culture conditions (41, 42) and with cell passage number, the cell lines selected here as most clinically relevant may not be so in another laboratory with the same lines. Instead, we recommend SRCCM alignment and cell line selection using gene expression profiles obtained in the specific laboratory. In time, we will determine whether this possible variation in the repertoire of the “most clinically relevant” lines does indeed occur.

In developing SRCCM, a variety of classification methods for aligning cell lines with clinical phenotypes could have been implemented, including standard machine learning algorithms such as nearest neighbor and support vector machines (43, 44). However, the SRCCM algorithm was developed after carefully considering its ease of interpretation and its applicability. Correlation is an intuitive measure of similarity, while the specific use of Spearman's correlation ensures broad applicability to additional data sets and cell lines, as the choice of data normalization and processing methods will have relatively little impact on the output of the algorithm.

Independent validation of SRCCM was a critical feature of our work. Because classifiers developed from high dimensional microarray data sets are prone to overfitting (45), and technical batch effects can correlate with biological outcomes of interest (46), independent validation is essential if reported classification accuracies are expected to be representative of performance in additional data sets. We validated the tissue of origin, stage, and grade SRCCM alignments on multiple independent data sets and achieved excellent overall accuracies. We validated the DSS SRCCM alignments on an independent data set by first aligning the BLA-36 cell lines to long- and short-term survivors in the MSKCC training cohort and then aligning 129 patients in an independent cohort (CNUH) to the 19 clinically relevant cell lines to produce KM survival curves with clear stratification.

The SRCCM tissue alignment finds that 34 of 36 (94%) of the bladder cell lines and 20 of 22 (91%) of the colorectal cell lines resemble their tissue of origin more closely than the 9 additional carcinomas we considered. This is reassuring, as a previous tissue classification of the NCI-60 cell line panel suggested that as few as 57% of cell lines most closely resembled their presumed tissue of origin (10). In general, the BLA-36 cell lines correlate more strongly with tissue types in which squamous cell histologies are common (e.g., cervix and ovary) than with tissues in which squamous cell histologies are rare (e.g., breast, thyroid, prostate, and kidney; Fig. 2C), suggesting that it may be relatively difficult to distinguish between tissue types if their histologies are similar. In the independent validation on over 300 bladder samples, 9% were misaligned, and the fact that 2 of 36 (6%) of the cell lines are misaligned in the BLA-36 panel is within the expected margin of error. Further analysis of these 2 lines, CubIII and SW1710, is required to determine whether these are of bladder origin.

Most striking is our finding that the grade alignment of the BLA-36 cell lines is consistent with only 44% of the low-grade tumors and 63% of the high-grade tumors that the cell lines were derived from. One possible explanation for these results is that the alignment is inaccurate. However, our findings support the validity of the alignment and loss of relevance of the original tumor grade. For example, we found that TP53 and RB1 mutations are associated with aligned high-grade muscle-invasive tumors whereas FGFR3 mutations are associated with aligned low-grade, nonmuscle- invasive tumors, which is consistent with the prevalence of TP53, RB1, and FGFR3 mutations in patients (37–39). Because tumor grade is an important clinical parameter that correlates with both stage and survival in patients (40, 47), for a cell line model to have clinical relevance, it should properly represent these clinical phenotypes. Importantly, we have found that selection of cell line models based on clinical information from their original tumors will fail to properly represent these phenotypes for the cell lines we analyzed. In contrast, once aligned using SRCCM, the new cell line grades relate to clinical outcomes (Fig. 4D and E).

Cell line models have also been selected based on their xenograft behavior. However, animal models do not always recapitulate human physiology and behavior (48, 49). In a meta-analysis looking at 39 pharmacologic agents with both xenograft data and results from phase II clinical trials, Johnson and colleagues found that out of 6 xenograft histologies, only lung xenografts had the ability to predict clinical activity in the same human cancer histology, though interestingly several were predictive of clinical activity in other histologies (50). In contrast, xenografts derived from aligned cell lines validated for their ability to reflect human tumor biology, using alignment and selection process shown here (Fig. 6), may provide a readout of drug activity in xenografts that parallel that seen in human tumors.

In conclusion, we describe a framework for aligning cell lines with the clinicopathologic phenotypes of human tumors such as tissue of origin, stage, grade, and DSS that can be used to select the most clinically relevant cell line models. Our technique has the potential to reduce the discrepancy often encountered between in vitro and in vivo laboratory data and patient interventions. Although the approach is shown in bladder and colorectal cancer cells, SRCCM provides a general framework that is applicable to all cancer types and is freely available to the scientific community.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Grant Support

This work was supported in part by NIH grants CA075115 and CA143791 to D. Theodorescu.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Footnotes

  • Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).

  • Received July 22, 2011.
  • Revision received October 5, 2011.
  • Accepted October 14, 2011.
  • ©2011 American Association for Cancer Research.

References

  1. 1.↵
    1. Marsit CJ,
    2. Houseman EA,
    3. Christensen BC,
    4. Gagne L,
    5. Wrensch MR,
    6. Nelson HH,
    7. et al.
    Identification of methylated genes associated with aggressive bladder cancer. PLoS One 2010;5:e12334.
    OpenUrlCrossRefPubMed
  2. 2.↵
    1. Inoue K,
    2. Slaton JW,
    3. Kim SJ,
    4. Perrotte P,
    5. Eve BY,
    6. Bar-Eli M,
    7. et al.
    Interleukin 8 expression regulates tumorigenicity and metastasis in human bladder cancer. Cancer Res 2000;60:2290–9.
    OpenUrlAbstract/FREE Full Text
  3. 3.↵
    1. Nutt JE,
    2. Mellon JK,
    3. Qureshi K,
    4. Lunec J
    . Matrix metalloproteinase-1 is induced by epidermal growth factor in human bladder tumour cell lines and is detectable in urine of patients with bladder tumours. Brit J Cancer 1998;78:215–20.
    OpenUrlPubMed
  4. 4.↵
    1. Saitoh H,
    2. Mori K,
    3. Kudoh S,
    4. Itoh H,
    5. Takahashi N,
    6. Suzuki T
    . BCG effects on telomerase activity in bladder cancer cell lines. Int J Clin Oncol 2002;7:165–70.
    OpenUrlCrossRefPubMed
  5. 5.↵
    1. Zhang YB,
    2. Niu HT,
    3. Chang JW,
    4. Dong GL,
    5. Ma XB
    . EZH2 silencing by RNA interference inhibits proliferation in bladder cancer cell lines. Eur J Cancer Care (Engl) 2011;20:106–12.
    OpenUrlCrossRefPubMed
  6. 6.↵
    1. Krause-Heuer AM,
    2. Grünert R,
    3. Kühne S,
    4. Buczkowska M,
    5. Wheate NJ,
    6. Le Pevelen DD,
    7. et al.
    Studies of the mechanism of action of platinum(II) complexes with potent cytotoxicity in human cancer cells. J Med Chem 2009;52:5474–84.
    OpenUrlCrossRefPubMed
  7. 7.↵
    1. Sharma SV,
    2. Haber DA,
    3. Settleman J
    . Cell line-based platforms to evaluate the therapeuticefficacy of candidate anticancer agents. Nat Rev Cancer 2010;10:241–53.
    OpenUrlCrossRefPubMed
  8. 8.↵
    1. Borisy AA,
    2. Elliott PJ,
    3. Hurst NW,
    4. Lee MS,
    5. Lehar J,
    6. Price ER,
    7. et al.
    Systematic discovery of multicomponent therapeutics. Proc Natl Acad Sci U S A 2003;100:7977–82.
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    1. Masters JR
    . Human cancer cell lines: fact and fantasy. Nat Rev Mol Cell Biol 2000;1:233–6.
    OpenUrlCrossRefPubMed
  10. 10.↵
    1. Sandberg R,
    2. Ernberg I
    . Assessment of tumor characteristic gene expression in cell lines using a tissue similarity index (TSI). Proc Natl Acad Sci U S A 2005;102:2052–7.
    OpenUrlAbstract/FREE Full Text
  11. 11.↵
    Cell line misidentification: the beginning of the end. Nat Rev Cancer 2010;10:441–8.
    OpenUrlCrossRefPubMed
  12. 12.↵
    1. MacLeod RAF,
    2. Dirks WG,
    3. Matsuo Y,
    4. Kaufmann M,
    5. Milch H,
    6. Drexler HG
    . Widespread intraspecies cross-contamination of human tumor cell lines arising at source. Int J Cancer 1999;83:555–63.
    OpenUrlCrossRefPubMed
  13. 13.↵
    1. Drexler HG,
    2. Dirks WG,
    3. Matsuo Y,
    4. MacLeod RA
    . False leukemia-lymphoma cell lines: an update on over 500 cell lines. Leukemia 2003;17:416–26.
    OpenUrlCrossRefPubMed
  14. 14.↵
    1. Herr HW,
    2. Dotan Z,
    3. Donat SM,
    4. Bajorin DF
    . Defining optimal therapy for muscle invasive bladder cancer. J Urol 2007;177:437–43.
    OpenUrlCrossRefPubMed
  15. 15.↵
    1. Stein JP,
    2. Lieskovsky G,
    3. Cote R,
    4. Groshen S,
    5. Feng AC,
    6. Boyd S,
    7. et al.
    Radical cystectomy in the treatment of invasive bladder cancer: long-term results in 1,054 patients. J Clin Oncol 2001;19:666–75.
    OpenUrlAbstract/FREE Full Text
  16. 16.↵
    1. Kemp CJ,
    2. Sun SL,
    3. Gurley KE
    . p53 induction and apoptosis in response to radio- and chemotherapy in vivo is tumor-type-dependent. Cancer Res 2001;61:327–32.
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    1. Mizutani Y,
    2. Wada H,
    3. Fukushima M,
    4. Yoshida O,
    5. Ukimura O,
    6. Kawauchi A,
    7. et al.
    The significance of dihydropyrimidine dehydrogenase (DPD) activity in bladder cancer. Eur J Cancer 2001;37:569–75.
    OpenUrlCrossRefPubMed
  18. 18.↵
    1. Grossman HB,
    2. Natale RB,
    3. Tangen CM,
    4. Speights VO,
    5. Vogelzang NJ,
    6. Trump DL,
    7. et al.
    Neoadjuvant chemotherapy plus cystectomy compared with cystectomy alone for locally advanced bladder cancer. N Engl J Med 2003;349:859–66.
    OpenUrlCrossRefPubMed
  19. 19.↵
    1. Sylvester RJ,
    2. Oosterlinck W,
    3. van der Meijden AP
    . A single immediate postoperative instillation of chemotherapy decreases the risk of recurrence in patients with stage Ta T1 bladder cancer: a meta-analysis of published results of randomized clinical trials. J Urol 2004;171:2186–90, quiz 435.
    OpenUrlCrossRefPubMed
  20. 20.↵
    1. von der Maase H,
    2. Hansen SW,
    3. Roberts JT,
    4. Dogliotti L,
    5. Oliver T,
    6. Moore MJ,
    7. et al.
    Gemcitabine and cisplatin versus methotrexate, vinblastine, doxorubicin, and cisplatin in advanced or metastatic bladder cancer: results of a large, randomized, multinational, multicenter, phase III study. J Clin Oncol 2000;18:3068–77.
    OpenUrlAbstract/FREE Full Text
  21. 21.↵
    1. Barrett T,
    2. Troup DB,
    3. Wilhite SE,
    4. Ledoux P,
    5. Evangelista C,
    6. Kim IF,
    7. et al.
    NCBI GEO: archive for functional genomics data sets-10 years on. Nucleic Acids Research 2011;39:D1005–D10.
    OpenUrlAbstract/FREE Full Text
  22. 22.↵
    1. Lee JK,
    2. Havaleshko DM,
    3. Cho H,
    4. Weinstein JN,
    5. Kaldjian EP,
    6. Karpovich J,
    7. et al.
    A strategy for predicting the chemosensitivity of human cancers and its application to drug discovery. Proc Natl Acad Sci U S A 2007;104:13086–91.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    1. Greshock J,
    2. Bachman KE,
    3. Degenhardt YY,
    4. Jing J,
    5. Wen YH,
    6. Eastman S,
    7. et al.
    Molecular target class is predictive of in vitro response profile. Cancer Res 2010;70:3677–86.
    OpenUrlAbstract/FREE Full Text
  24. 24.↵
    1. Smith SC,
    2. Baras AS,
    3. Dancik G,
    4. Ru Y,
    5. Ding KF,
    6. Moskaluk CA,
    7. et al.
    A 20-gene model for molecular nodal staging of bladder cancer: development and prospective assessment. Lancet Oncol 2011;12:137–43.
    OpenUrlCrossRefPubMed
  25. 25.↵
    1. Lindgren D,
    2. Frigyesi A,
    3. Gudjonsson S,
    4. Sjödahl G,
    5. Hallden C,
    6. Chebil G,
    7. et al.
    Combined gene expression and genomic profiling define two intrinsic molecular subtypes of urothelial carcinoma and gene signatures for molecular grading and outcome. Cancer Res 2010;70:3463–72.
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    1. Sanchez-Carbayo M,
    2. Socci ND,
    3. Lozano J,
    4. Saint F,
    5. Cordon-Cardo C
    . Defining molecular profiles of poor outcome in patients with invasive bladder cancer using oligonucleotide microarrays. J Clin Oncol 2006;24:778–89.
    OpenUrlAbstract/FREE Full Text
  27. 27.↵
    1. Smith JJ,
    2. Deane NG,
    3. Wu F,
    4. Merchant NB,
    5. Zhang B,
    6. Jiang A,
    7. et al.
    Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. Gastroenterology 2010;138:958–68.
    OpenUrlCrossRefPubMed
  28. 28.↵
    R: A Language and Environment for Statistical Computing [Internet]. Vienna (AU): R Development Core Team [cited 2010 July 1]. Available from: http://www.R-project.org.
  29. 29.↵
    1. Kim WJ,
    2. Kim EJ,
    3. Kim SK,
    4. Kim YJ,
    5. Ha YS,
    6. Jeong P,
    7. et al.
    Predictive value of progression-related gene classifier in primary non-muscle invasive bladder cancer. Mol Cancer 2010;9:3.
    OpenUrlCrossRefPubMed
  30. 30.↵
    1. Stransky N,
    2. Vallot C,
    3. Reyal F,
    4. Bernard-Pierrot I,
    5. de Medina SG,
    6. Segraves R,
    7. et al.
    Regional copy number-independent deregulation of transcription in cancer. Nat Genet 2006;38:1386–96.
    OpenUrlCrossRefPubMed
  31. 31.↵
    1. Bair E,
    2. Tibshirani R
    . Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol 2004;2:E108.
    OpenUrlCrossRefPubMed
  32. 32.↵
    1. Therneau TM,
    2. Grambsch PM
    . Modeling survival data: extending the Cox model. New York: Springer; 2000.
  33. 33.↵
    1. Wang H,
    2. Huang S,
    3. Shou J,
    4. Su EW,
    5. Onyia JE,
    6. Liao B,
    7. et al.
    Comparative analysis and integrative classification of NCI60 cell lines and primary tumors using gene expression profiling data. BMC Genomics 2006;7:166.
    OpenUrlCrossRefPubMed
  34. 34.↵
    1. Garber ME,
    2. Troyanskaya OG,
    3. Schluens K,
    4. Petersen S,
    5. Thaesler Z,
    6. Pacyna-Gengelbach M,
    7. et al.
    Diversity of gene expression in adenocarcinoma of the lung. Proc Natl Acad Sci U S A 2001;98:13784–9.
    OpenUrlAbstract/FREE Full Text
  35. 35.↵
    1. Dyrskjøt L,
    2. Thykjaer T,
    3. Kruhøffer M,
    4. Jensen JL,
    5. Marcussen N,
    6. Hamilton-Dutoit S,
    7. et al.
    Identifying distinct classes of bladder carcinoma using microarrays. Nat Genet 2003;33:90–6.
    OpenUrlCrossRefPubMed
  36. 36.↵
    1. Blaveri E,
    2. Simko JP,
    3. Korkola JE,
    4. Brewer JL,
    5. Baehner F,
    6. Mehta K,
    7. et al.
    Bladder cancer outcome and subtype classification by gene expression. Clin Cancer Res 2005;11:4044–55.
    OpenUrlAbstract/FREE Full Text
  37. 37.↵
    1. Abdel-Fattah R,
    2. Challen C,
    3. Griffiths TR,
    4. Robinson MC,
    5. Neal DE,
    6. Lunec J
    . Alterations of TP53 in microdissected transitional cell carcinoma of the human urinary bladder: high frequency of TP53 accumulation in the absence of detected mutations is associated with poor prognosis. Br J Cancer 1998;77:2230–8.
    OpenUrlPubMed
  38. 38.↵
    1. Bakkar AA,
    2. Wallerand H,
    3. Radvanyi F,
    4. Lahaye JB,
    5. Pissard S,
    6. Lecerf L,
    7. et al.
    FGFR3 and TP53 gene mutations define two distinct pathways in urothelial cell carcinoma of the bladder. Cancer Res 2003;63:8108–12.
    OpenUrlAbstract/FREE Full Text
  39. 39.↵
    1. Wu XR
    . Urothelial tumorigenesis: a tale of divergent pathways. Nat Rev Cancer 2005;5:713–25.
    OpenUrlCrossRefPubMed
  40. 40.↵
    1. Pagliaro LC,
    2. Sharma P
    . Review of metastatic bladder cancer. Minerva Urol Nefrol 2006;58:53–71.
    OpenUrlPubMed
  41. 41.↵
    1. Kissel T,
    2. Behrens I
    . Do cell culture conditions influence the carrier-mediated transport of peptides in Caco-2 cell monolayers? Eur J Pharm Sci 2003;19:433–42.
    OpenUrlCrossRefPubMed
  42. 42.↵
    1. Lin CS,
    2. Lin GT,
    3. Chow S,
    4. Lin J,
    5. Wang G,
    6. Lue TF
    . Effect of cell passage and density on protein kinase G expression and activation in vascular smooth muscle cells. J Cell Biochem 2004;92:104–12.
    OpenUrlCrossRefPubMed
  43. 43.↵
    1. Byvatov E,
    2. Schneider G
    . Support vector machine applications in bioinformatics. Appl Bioinformatics 2003;2:67–77.
    OpenUrlPubMed
  44. 44.↵
    1. Thamaraiselvi G,
    2. Kaliammal A
    . Data mining: concepts and techniques. SRELS Journal of Information Management 2004;41:339–48.
    OpenUrl
  45. 45.↵
    1. Lee S
    . Mistakes in validating the accuracy of a prediction classifier in high-dimensional but small-sample microarray data. Stat Methods Med Res 2008;17:635–42.
    OpenUrlAbstract/FREE Full Text
  46. 46.↵
    1. Leek JT,
    2. Scharpf RB,
    3. Bravo HC,
    4. Simcha D,
    5. Langmead B,
    6. Johnson WE,
    7. et al.
    Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet 2010;11:733–9.
    OpenUrlCrossRefPubMed
  47. 47.↵
    1. Jemal A,
    2. Siegel R,
    3. Ward E,
    4. Hao Y,
    5. Xu J,
    6. Murray T,
    7. et al.
    Cancer statistics, 2008. CA Cancer J Clin 2008;58:71–96.
    OpenUrlCrossRefPubMed
  48. 48.↵
    1. Peterson JK,
    2. Houghton PJ
    . Integrating pharmacology and in vivo cancer models in preclinical and clinical drug development. Eur J Cancer 2004;40:837–44.
    OpenUrlCrossRefPubMed
  49. 49.↵
    1. Voskoglou-Nomikos T,
    2. Pater JL,
    3. Seymour L
    . Clinical predictive value of the in vitro cell line, human xenograft, and mouse allograft preclinical cancer models. Clin Cancer Res 2003;9:4227–39.
    OpenUrlAbstract/FREE Full Text
  50. 50.↵
    1. Johnson JI,
    2. Decker S,
    3. Zaharevitz D,
    4. Rubinstein LV,
    5. Venditti JM,
    6. Schepartz S,
    7. et al.
    Relationships between drug activity in NCI preclinical in vitro and in vivo models and early clinical trials. Brit J Cancer 2001;84:1424–31.
    OpenUrlCrossRefPubMed
  51. 51.↵
    1. Parkinson H,
    2. Sarkans U,
    3. Kolesnikov N,
    4. Abeygunawardena N,
    5. Burdett T,
    6. Dylag M,
    7. et al.
    ArrayExpress update-an archive of microarray and high-throughput sequencing-based functional genomics experiments. Nucleic Acids Research 2011;39:D1002–D4.
    OpenUrlAbstract/FREE Full Text
  52. 52.↵
    1. Bamford S,
    2. Dawson E,
    3. Forbes S,
    4. Clements J,
    5. Pettett R,
    6. Dogan A,
    7. et al.
    The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer 2004;91:355–8.
    OpenUrlCrossRefPubMed
View Abstract
PreviousNext
Back to top
Cancer Research: 71 (24)
December 2011
Volume 71, Issue 24
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover

Sign up for alerts

View this article with LENS

Open full page PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for sharing this Cancer Research article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
A Framework to Select Clinically Relevant Cancer Cell Lines for Investigation by Establishing Their Molecular Similarity with Primary Human Cancers
(Your Name) has forwarded a page to you from Cancer Research
(Your Name) thought you would be interested in this article in Cancer Research.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
A Framework to Select Clinically Relevant Cancer Cell Lines for Investigation by Establishing Their Molecular Similarity with Primary Human Cancers
Garrett M. Dancik, Yuanbin Ru, Charles R. Owens and Dan Theodorescu
Cancer Res December 15 2011 (71) (24) 7398-7409; DOI: 10.1158/0008-5472.CAN-11-2427

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
A Framework to Select Clinically Relevant Cancer Cell Lines for Investigation by Establishing Their Molecular Similarity with Primary Human Cancers
Garrett M. Dancik, Yuanbin Ru, Charles R. Owens and Dan Theodorescu
Cancer Res December 15 2011 (71) (24) 7398-7409; DOI: 10.1158/0008-5472.CAN-11-2427
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Disclosure of Potential Conflicts of Interest
    • Grant Support
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF
Advertisement

Related Articles

Cited By...

More in this TOC Section

  • FLT PET Imaging of Capecitabine and Trifluridine/Tipiracil
  • In Vivo Vascular Imaging for Head and Neck Cancer Diagnosis
  • SMYD5 Regulates Genome Stability
Show more Integrated Systems and Technologies
  • Home
  • Alerts
  • Feedback
  • Privacy Policy
Facebook  Twitter  LinkedIn  YouTube  RSS

Articles

  • Online First
  • Current Issue
  • Past Issues
  • Meeting Abstracts

Info for

  • Authors
  • Subscribers
  • Advertisers
  • Librarians
  • Reviewers

About Cancer Research

  • About the Journal
  • Editorial Board
  • Permissions
  • Submit a Manuscript
AACR logo

Copyright © 2020 by the American Association for Cancer Research.

Cancer Research Online ISSN: 1538-7445
Cancer Research Print ISSN: 0008-5472
Journal of Cancer Research ISSN: 0099-7013
American Journal of Cancer ISSN: 0099-7374

Advertisement