Using Galaxy to Perform Large-Scale Interactive Data Analyses—An Update

Ostrovsky, A, Hillman-Jackson, J, Bouvier, D, Clements, D, Afgan, E, Blankenberg, D, Schatz, MC, Nekrutenko, A, Taylor, J, Team, TG, Lariviere, D (February 2021) Using Galaxy to Perform Large-Scale Interactive Data Analyses—An Update. Current Protocols, 1 (2). ISSN 2691-1299

DOI: 10.1002/cpz1.31


Modern biology continues to become increasingly computational. Datasets are becoming progressively larger, more complex, and more abundant. The computational savviness necessary to analyze these data creates an ongoing obstacle for experimental biologists. Galaxy ( provides access to computational biology tools in a web-based interface. It also provides access to major public biological data repositories, allowing private data to be combined with public datasets. Galaxy is hosted on high-capacity servers worldwide and is accessible for free, with an option to be installed locally. This article demonstrates how to employ Galaxy to perform biologically relevant analyses on publicly available datasets. These protocols use both standard and custom tools, serving as a tutorial and jumping-off point for more intensive and/or more specific analyses using Galaxy. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Finding human coding exons with highest SNP density. Basic Protocol 2: Calling peaks for ChIP-seq data. Basic Protocol 3: Compare datasets using genomic coordinates. Basic Protocol 4: Working with multiple alignments. Basic Protocol 5: Single cell RNA-seq.

Item Type: Paper
Subjects: bioinformatics > computational biology
CSHL Authors:
Communities: CSHL labs > Schatz lab
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 1 February 2021
Date Deposited: 07 May 2021 18:00
Last Modified: 07 May 2021 18:00

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving