Skip to main content
  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

AACR logo

  • Register
  • Log in
  • My Cart
Advertisement

Main menu

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Search More

    Advanced Search

  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

User menu

  • Register
  • Log in
  • My Cart

Search

  • Advanced search
Cancer Research
Cancer Research
  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Search More

    Advanced Search

Convergence and Technologies

TCIApathfinder: An R Client for the Cancer Imaging Archive REST API

Pamela Russell, Kelly Fountain, Dulcy Wolverton and Debashis Ghosh
Pamela Russell
1Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, Colorado.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Pamela Russell
  • For correspondence: pamela.russell@ucdenver.edu
Kelly Fountain
2Department of Radiology, University of Colorado School of Medicine, Aurora, Colorado.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dulcy Wolverton
2Department of Radiology, University of Colorado School of Medicine, Aurora, Colorado.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Debashis Ghosh
1Department of Biostatistics and Informatics, Colorado School of Public Health, Aurora, Colorado.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
DOI: 10.1158/0008-5472.CAN-18-0678 Published August 2018
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

The Cancer Imaging Archive (TCIA) hosts publicly available deidentified medical images of cancer from over 25 body sites and over 30,000 patients. Over 400 published studies have utilized freely available TCIA images. Images and metadata are available for download through a web interface or a REST API. Here, we present TCIApathfinder, an R client for the TCIA REST API. TCIApathfinder wraps API access in user-friendly R functions that can be called interactively within an R session or easily incorporated into scripts. Functions are provided to explore the contents of the large database and to download image files. TCIApathfinder provides easy access to TCIA resources in the highly popular R programming environment. TCIApathfinder is freely available under the MIT license as a package on CRAN (https://cran.r-project.org/web/packages/TCIApathfinder/index.html) and from https://github.com/pamelarussell/TCIApathfinder.

Significance: These findings present a new tool, TCIApathfinder, the first client for The Cancer Imaging Archive (TCIA) for use in the highly popular R computing environment, that will dramatically lower the barrier of access to the valuable tools in TCIA. Cancer Res; 78(15); 4424–6. ©2018 AACR.

Introduction

The Cancer Imaging Archive

The Cancer Imaging Archive (TCIA) provides deidentified clinical images of cancer for use by the research community (1). Currently, TCIA includes images from over 25 cancer types and over 30,000 patients. TCIA also hosts supporting data related to the images, as well as analysis results from the research community based on TCIA data. TCIA is contributing to research efforts toward understanding the genomic basis of cancer by providing over 20 collections of clinical images from patients whose matched tumor genomic profiles are freely available from The Cancer Genome Atlas (2).

TCIA datasets, referred to as “collections,” typically represent sets of patients sharing a common disease. Descriptions of each collection are available from the TCIA website. A variety of imaging modalities are represented, such as magnetic resonance imaging, computed tomography, and positron emission tomography. The image files in TCIA conform to the widely adopted DICOM standard (3). For each patient, one or more image studies are included. A study may include one or more image series. In turn, an image series is a stack of two-dimensional images from a single run of an instrument. Patient names and birth dates have been deidentified; patient sex and age are provided. TCIA supports reproducibility through the use of Digital Object Identifiers to refer to subsets of data.

Radiomic analysis

TCIA represents a valuable resource for the field of radiomics. The process known as radiomics involves the conversion of digital medical images into higher dimensional data and the subsequent mining of these data (4). Hundreds of image features or more can be extracted from radiologic image analysis and associated with biological or clinical endpoints to develop diagnostic, prognostic, and predictive models (5). While this can be applied to many biomedical areas, oncologic applications are of particular interest.

Oncologic treatment failure is often attributed to heterogeneity within tumors at the phenotypic, physiologic, and genomic levels (6–9). A major aim of radiomics is to provide quantitative measurements of intra- and intertumoral heterogeneity, thereby individualizing treatment (10). Unlike tissue biopsy, which offers a small sample of the tumor for analysis, radiomics has the potential to evaluate the whole tumor in the native environment. Unfortunately, there are multiple challenges to the process of radiomics. The need for data and ability to share data has been cited as one of the largest hurdles to advancing the field of radiomics (5, 10). A related issue involves the high cost of data preprocessing and analysis, including initial segmentation of tumor volumes. Large centralized data repositories such as TCIA offer a solution. TCIA not only provides publicly available images for thousands of patients, but additional hosts processed data for a subset of studies (11), potentially leading to dramatic time savings or making radiomic analysis possible for research groups with limited resources.

Results

TCIApathfinder provides a powerful gateway to TCIA

TCIA provides programmatic access to its data through a REST API that features many endpoints and return object types (12). Programmers can use the API through any preferred method of sending HTTP requests and parsing the structured responses. Here we describe a novel R package, TCIApathfinder, which provides powerful access to the resources in TCIA without the need to understand or program against the TCIA REST API. TCIApathfinder wraps all API functionality in clearly documented, user-friendly R functions, empowering users to quickly and interactively explore the available resources in TCIA without the need to construct HTTP requests or parse responses. Information is returned in native R data formats such as lists and data frames that are familiar to casual R users, as opposed to JSON and other structured data formats that are returned by the API.

A typical use case of TCIApathfinder is to download all available images for a specific cancer type and imaging modality. TCIApathfinder allows users to interactively explore the available data using these and other filters, save all metadata to R data structures, and download the images to their local machine. Other use cases involve exploring the available data in TCIA. With one or a few commands, users can quickly list and slice the available data along many dimensions. Such analysis would require a substantial amount of programming against the API itself; the interaction with the API is abstracted away from users of TCIApathfinder.

Description of TCIApathfinder

TCIApathfinder is hosted by the Comprehensive R Archive Network (CRAN) from https://cran.r-project.org/web/packages/TCIApathfinder/index.html and the development version can be obtained from https://github.com/pamelarussell/TCIApathfinder. Package documentation is available as a PDF manual from CRAN, from within an R session using R's documentation system, or on GitHub. In order for the package to function correctly, an API key must be obtained from TCIA.

In TCIApathfinder, function calls are used to explore the available data in TCIA and to download image files to the local machine. Two functions download image files from TCIA. The remaining functions in the package are used to explore the available data in TCIA. Each exploratory function returns an object containing simplified summarized data, a parsed JSON response, and the raw API response. Details on all available functions are provided in Table 1.

View this table:
  • View inline
  • View popup
Table 1.

Available functions in TCIApathfinder

TCIApathfinder can be loaded into an active R session and used directly from the R console, or incorporated into R scripts. See Supplementary Video S1 for a demonstration of package installation and usage.

Supplementary Video S1

Instructional video for TCIApathfinder Supplementary Video S1

Discussion

TCIApathfinder makes the extensive resources in TCIA easily available and accessible in the highly popular R programming environment. Simple functions allow the full collection to be easily explored before patients are selected for analysis. Images and supporting data can be imported directly into R scripts for further analysis using packages such as oro.dicom (13) and other image analysis packages, or simply saved to the local machine for any type of downstream analysis. For patients also included in The Cancer Genome Atlas, tumor genomic data can be imported into R via the matched patient ID using the TCGAbiolinks package (14) and analyzed using the extensive tools in Bioconductor (15). Vignettes included with the package demonstrate TCIApathfinder usage as well as downstream radiomic analysis with other packages. TCIApathfinder will significantly lower the barrier for researchers to leverage the valuable resources in TCIA.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Acknowledgments

We thank Bernard Jones and Julio Carballido-Gamio for valuable conversations about medical image analysis and the DICOM standard. Financial support for D. Ghosh and P. Russell has been provided by the Grohne-Stapp Endowed Chair for Cancer Research (University of Colorado Cancer Center). This work has been supported by the Grohne-Stapp Endowed Chair for Cancer Research (University of Colorado Cancer Center).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Footnotes

  • Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).

  • Received March 9, 2018.
  • Revision received May 1, 2018.
  • Accepted June 1, 2018.
  • Published first June 5, 2018.
  • ©2018 American Association for Cancer Research.

References

  1. 1.↵
    1. Clark K,
    2. Vendt B,
    3. Smith K,
    4. Freymann J,
    5. Kirby J,
    6. Koppel P,
    7. et al.
    The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 2013;26:1045–57.
    OpenUrlCrossRefPubMed
  2. 2.↵
    1. Weinstein JN,
    2. Collisson EA,
    3. Mills GB,
    4. Shaw KRM,
    5. Ozenberger BA,
    6. et al.
    Cancer Genome Atlas Research Network, Weinstein JN, Collisson EA, Mills GB, Shaw KRM, Ozenberger BA, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet 2013;45:1113–20.
    OpenUrlCrossRefPubMed
  3. 3.↵
    1. Mustra M,
    2. Delac K,
    3. Grgic M
    . Overview of the DICOM standard. Proceedings of the 50th International Symposium ELMAR [Online]; 2008. Available from: http://ieeexplore.ieee.org/abstract/document/4747434/?reload=true.
  4. 4.↵
    1. Kumar V,
    2. Gu Y,
    3. Basu S,
    4. Berglund A,
    5. Eschrich SA,
    6. Schabath MB,
    7. et al.
    Radiomics: the process and the challenges. Magn Reson Imaging 2012;30:1234–48.
    OpenUrlCrossRefPubMed
  5. 5.↵
    1. Avanzo M,
    2. Stancanello J,
    3. El Naqa I
    . Beyond imaging: the promise of radiomics. Phys Med 2017;38:122–39.
    OpenUrl
  6. 6.↵
    1. Gerlinger M,
    2. Rowan AJ,
    3. Horswell S,
    4. Math M,
    5. Larkin J,
    6. Endesfelder D,
    7. et al.
    Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med 2012;366:883–92.
    OpenUrlCrossRefPubMed
  7. 7.↵
    1. Sequist LV,
    2. Waltman BA,
    3. Dias-Santagata D,
    4. Digumarthy S,
    5. Turke AB,
    6. Fidias P,
    7. et al.
    Genotypic and histological evolution of lung cancers acquiring resistance to EGFR inhibitors. Sci Transl Med 2011;3:75ra26.
    OpenUrlAbstract/FREE Full Text
  8. 8.↵
    1. Sottoriva A,
    2. Spiteri I,
    3. Piccirillo SGM,
    4. Touloumis A,
    5. Collins VP,
    6. Marioni JC,
    7. et al.
    Intratumor heterogeneity in human glioblastoma reflects cancer evolutionary dynamics. Proc Natl Acad Sci U S A 2013;110:4009–14.
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    1. Yachida S,
    2. Jones S,
    3. Bozic I,
    4. Antal T,
    5. Leary R,
    6. Fu B,
    7. et al.
    Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature 2010;467:1114–7.
    OpenUrlCrossRefPubMed
  10. 10.↵
    1. Gillies RJ,
    2. Kinahan PE,
    3. Hricak H
    . Radiomics: images are more than pictures, they are data. Radiology 2016;278:563–77.
    OpenUrlCrossRefPubMed
  11. 11.↵
    Cancer Imaging Archive. TCIA Analysis Results – TCIA DOIs – Cancer Imaging Archive Wiki [Online] [cited 2018 Apr 26]. Available from: https://wiki.cancerimagingarchive.net/display/DOI/TCIA+Analysis+Results.
  12. 12.↵
    Cancer Imaging Archive. TCIA Programmatic Interface (REST API) Usage Guide – The Cancer Imaging Archive (TCIA) Public Access – Cancer Imaging Archive Wiki [Online] [cited 2018 Apr 26]. Available from: https://wiki.cancerimagingarchive.net/display/Public/TCIA+Programmatic+Interface+%28REST+API%29+Usage+Guide.
  13. 13.↵
    1. Whitcher B,
    2. Schmid V,
    3. Thorton A
    . Working with the DICOM and NIfTI Data Standards in R. J Stat Softw 2011;44:1–29.
    OpenUrlCrossRef
  14. 14.↵
    1. Colaprico A,
    2. Silva TC,
    3. Olsen C,
    4. Garofano L,
    5. Cava C,
    6. Garolini D,
    7. et al.
    TCGAbiolinks: an R/bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res 2016;44:e71.
    OpenUrlCrossRefPubMed
  15. 15.↵
    1. Gentleman RC,
    2. Carey VJ,
    3. Bates DM,
    4. Bolstad B,
    5. Dettling M,
    6. Dudoit S,
    7. et al.
    Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004;5:R80.
    OpenUrlCrossRefPubMed
View Abstract
PreviousNext
Back to top
Cancer Research: 78 (15)
August 2018
Volume 78, Issue 15
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Editorial Board (PDF)

Sign up for alerts

View this article with LENS

Open full page PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for sharing this Cancer Research article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
TCIApathfinder: An R Client for the Cancer Imaging Archive REST API
(Your Name) has forwarded a page to you from Cancer Research
(Your Name) thought you would be interested in this article in Cancer Research.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
TCIApathfinder: An R Client for the Cancer Imaging Archive REST API
Pamela Russell, Kelly Fountain, Dulcy Wolverton and Debashis Ghosh
Cancer Res August 1 2018 (78) (15) 4424-4426; DOI: 10.1158/0008-5472.CAN-18-0678

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
TCIApathfinder: An R Client for the Cancer Imaging Archive REST API
Pamela Russell, Kelly Fountain, Dulcy Wolverton and Debashis Ghosh
Cancer Res August 1 2018 (78) (15) 4424-4426; DOI: 10.1158/0008-5472.CAN-18-0678
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Results
    • Discussion
    • Disclosure of Potential Conflicts of Interest
    • Acknowledgments
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF
Advertisement

Related Articles

Cited By...

More in this TOC Section

  • Redox Vulnerabilities in Melanoma Cells
  • Dynamics of Tumor–Vessel Interactions in a 3D Vessel Model
  • AML Risk Stratification Using Mathematical Models
Show more Convergence and Technologies
  • Home
  • Alerts
  • Feedback
  • Privacy Policy
Facebook  Twitter  LinkedIn  YouTube  RSS

Articles

  • Online First
  • Current Issue
  • Past Issues
  • Meeting Abstracts

Info for

  • Authors
  • Subscribers
  • Advertisers
  • Librarians
  • Reviewers

About Cancer Research

  • About the Journal
  • Editorial Board
  • Permissions
  • Submit a Manuscript
AACR logo

Copyright © 2020 by the American Association for Cancer Research.

Cancer Research Online ISSN: 1538-7445
Cancer Research Print ISSN: 0008-5472
Journal of Cancer Research ISSN: 0099-7013
American Journal of Cancer ISSN: 0099-7374

Advertisement