Skip to main content
  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

AACR logo

  • Register
  • Log in
  • My Cart
Advertisement

Main menu

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • RSS Feeds
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Webinars
  • Search More

    Advanced Search

  • AACR Publications
    • Blood Cancer Discovery
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

User menu

  • Register
  • Log in
  • My Cart

Search

  • Advanced search
Cancer Research
Cancer Research
  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • COVID-19 & Cancer Resource Center
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
      • "Best of" Collection
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • Editors' Picks
    • OnlineFirst
    • Citations
    • Author/Keyword
    • RSS Feeds
    • My Alert Summary & Preferences
  • News
    • Cancer Discovery News
  • COVID-19
  • Webinars
  • Search More

    Advanced Search

Bioinformatics and Systems Biology

Abstract 2594: Optimizing the replication of cancer genomics workflows: case studies

Jerry Fowler, F. Anthony San Lucas, Smruthy Sivakumar, Aditya Deshpande, Humam Kadara and Paul A. Scheet
Jerry Fowler
1UT MD Anderson Cancer Center, Houston, TX;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
F. Anthony San Lucas
1UT MD Anderson Cancer Center, Houston, TX;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Smruthy Sivakumar
1UT MD Anderson Cancer Center, Houston, TX;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aditya Deshpande
2Weill Cornell Graduate School of Medical Sciences, New York City, NY;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Humam Kadara
3American University of Beirut, Houston, TX.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paul A. Scheet
1UT MD Anderson Cancer Center, Houston, TX;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
DOI: 10.1158/1538-7445.AM2017-2594 Published July 2017
  • Article
  • Info & Metrics
Loading
Proceedings: AACR Annual Meeting 2017; April 1-5, 2017; Washington, DC

Abstract

Reproducing results is a major issue in cancer biology, whose “work bench” is dynamic and complex, with frequently updated algorithms and software. The better to manage our work in this environment we have developed SyQADA, a System for Quality-Assured Data Analysis – a workflow automation system designed to simplify common sequential analysis processes on the same or different data. SyQADA manages many of the details of procedural bookkeeping involved in bioinformatics workflows: What samples are we using? Where are the raw data? Were all the samples processed? Did every job complete satisfactorily? Is there as much output as expected? Where are the input files for the next step? How long does a typical job take to run? Which program versions did we use? Can we easily compare these results with the output of a different version of a program, or with different input data?

Using SyQADA, we have found ourselves better able to reproduce results while at the same time reducing the human effort required to manage our upstream data analyses. Here, we briefly describe how our lung cancer studies have benefitted from the use of SyQADA.

To understand the effect of different variant callers for Ion Torrent deep sequencing data in a lung cancer genomics study, we created a work protocol that allowed us to compare the different sets of variants called on 34 distinct somatic DNA samples from 4 patients. This complex processing framework involved running multiple variant callers, annotating variants, filtering germline variants using quality control metrics, and collating results across samples and callers. With SyQADA, we were able to re-run individual processes changing parameters with trivial changes to our configuration, yielding improved output. We then applied that unmodified protocol to the 500 samples from 48 individuals in our study, and rapidly produced data from which we could perform biological analysis. We then applied the protocol to a study of pre-malignant lesions in 25 lung cancer patients. In both studies, our workflow allowed us to generate comparable results in a matter of hours rather than days.

SyQADA has been used by individuals with backgrounds ranging from expert programmer to Unix novice, to perform and repeat dozens of diverse analytical workflows. Projects to which SyQADA has been applied include allelic imbalance studies of TCGA samples for cancers of the breast, pancreas, lung, and colon, processing roughly 6000 samples through a dozen steps.

A zipfile containing the SyQADA executable source code, documentation, tutorial examples, and workflows used in our lab will be available.

Citation Format: Jerry Fowler, F. Anthony San Lucas, Smruthy Sivakumar, Aditya Deshpande, Humam Kadara, Paul A. Scheet. Optimizing the replication of cancer genomics workflows: case studies [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 2594. doi:10.1158/1538-7445.AM2017-2594

  • ©2017 American Association for Cancer Research.
Previous
Back to top
Cancer Research: 77 (13 Supplement)
July 2017
Volume 77, Issue 13 Supplement
  • Table of Contents

Sign up for alerts

Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for sharing this Cancer Research article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Abstract 2594: Optimizing the replication of cancer genomics workflows: case studies
(Your Name) has forwarded a page to you from Cancer Research
(Your Name) thought you would be interested in this article in Cancer Research.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Abstract 2594: Optimizing the replication of cancer genomics workflows: case studies
Jerry Fowler, F. Anthony San Lucas, Smruthy Sivakumar, Aditya Deshpande, Humam Kadara and Paul A. Scheet
Cancer Res July 1 2017 (77) (13 Supplement) 2594; DOI: 10.1158/1538-7445.AM2017-2594

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Abstract 2594: Optimizing the replication of cancer genomics workflows: case studies
Jerry Fowler, F. Anthony San Lucas, Smruthy Sivakumar, Aditya Deshpande, Humam Kadara and Paul A. Scheet
Cancer Res July 1 2017 (77) (13 Supplement) 2594; DOI: 10.1158/1538-7445.AM2017-2594
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
  • Info & Metrics
Advertisement

Related Articles

Cited By...

More in this TOC Section

Bioinformatics and Systems Biology

  • Abstract 4261: Pathways of metastatic bladder cancer from a longitudinal patient data set
  • Abstract 1287: Exploring somatic DNA structural alteration and aberrant genomic interactions in cancer through GenomePaint
  • Abstract LB-024: The effects of chromatin structure on variation in fragment size of cell free DNA
Show more Bioinformatics and Systems Biology

Poster Presentations - Proffered Abstracts

  • Abstract 4261: Pathways of metastatic bladder cancer from a longitudinal patient data set
  • Abstract 1287: Exploring somatic DNA structural alteration and aberrant genomic interactions in cancer through GenomePaint
  • Abstract LB-024: The effects of chromatin structure on variation in fragment size of cell free DNA
Show more Poster Presentations - Proffered Abstracts

Poster Presentations - Computational Tools and Databases

  • Abstract LB-107: EV-TRACK: transparent reporting and centralizing knowledge of extracellular vesicles to support the validation of extracellular vesicle biomarkers in cancer research
  • Abstract 2603: A patient driven cancer database to collect information, analyze data, and predict outcomes
  • Abstract 2607: The cBioPortal for Cancer Genomics: an open source platform for accessing and interpreting complex cancer genomics data in the era of precision medicine
Show more Poster Presentations - Computational Tools and Databases
  • Home
  • Alerts
  • Feedback
  • Privacy Policy
Facebook  Twitter  LinkedIn  YouTube  RSS

Articles

  • Online First
  • Current Issue
  • Past Issues
  • Meeting Abstracts

Info for

  • Authors
  • Subscribers
  • Advertisers
  • Librarians

About Cancer Research

  • About the Journal
  • Editorial Board
  • Permissions
  • Submit a Manuscript
AACR logo

Copyright © 2021 by the American Association for Cancer Research.

Cancer Research Online ISSN: 1538-7445
Cancer Research Print ISSN: 0008-5472
Journal of Cancer Research ISSN: 0099-7013
American Journal of Cancer ISSN: 0099-7374

Advertisement