Using Galaxy to Perform Large‐Scale Interactive Data Analyses

Jennifer Hillman‐Jackson1, Dave Clements2, Daniel Blankenberg1, James Taylor2, Anton Nekrutenko1, Galaxy Team2

1 Penn State University, University Park, Pennsylvania, 2 Emory University, Atlanta, Georgia
Publication Name:  Current Protocols in Bioinformatics
Unit Number:  Unit 10.5
DOI:  10.1002/0471250953.bi1005s38
Online Posting Date:  June, 2012
Innovations in biomedical research technologies continue to provide experimental biologists with novel and increasingly large genomic and high‐throughput data resources to be analyzed. As creating and obtaining data has become easier, the key decision faced by many researchers is a practical one: where and how should an analysis be performed? Datasets are large and analysis tool set‐up and use is riddled with complexities outside of the scope of core research activities. The authors believe that Galaxy provides a powerful solution that simplifies data acquisition and analysis in an intuitive Web application, granting all researchers access to key informatics tools previously only available to computational specialists working in Unix‐based environments. We will demonstrate through a series of biomedically relevant protocols how Galaxy specifically brings together (1) data retrieval from public and private sources, for example, UCSC's Eukaryote and Microbial Genome Browsers, (2) custom tools (wrapped Unix functions, format standardization/conversions, interval operations), and 3rd‐party analysis tools. Curr. Protoc. Bioinform. 38:10.5.1‐10.5.47. © 2012 by John Wiley & Sons, Inc.

Keywords: Galaxy; comparative genomics; genomic alignments; Web application; genome variation

Table of Contents

  • Introduction
  • Basic Protocol 1: Finding Human Coding Exons with Highest SNP Density
  • Basic Protocol 2: Loading Data and Understanding Datatypes
  • Basic Protocol 3: Calling Peaks for ChIP‐seq Data
  • Basic Protocol 4: Compare Datasets Using Genomic Coordinates
  • Basic Protocol 5: Working with Multiple Sequence Alignments
  • Guidelines for Understanding Results
  • Commentary
  • Literature Cited
  • Figures
