Integrated Computational Pipeline for Single-Cell Genomic Profiling

Chorbadjiev, L., Kendall, J., Alexander, J., Zhygulin, V., Song, J., Wigler, M., Krasnitz, A. (May 2020) Integrated Computational Pipeline for Single-Cell Genomic Profiling. JCO Clinical Cancer Informatics, 4. pp. 464-471.

Abstract

Purpose: Copy-number profiling of multiple individual cells from sparse sequencing may be used to reveal a detailed picture of genomic heterogeneity and clonal organization in a tissue biopsy specimen. We sought to provide a comprehensive computational pipeline for single-cell genomics, to facilitate adoption of this molecular technology for basic and translational research. Materials and methods: The pipeline comprises software tools programmed in Python and in R and depends on Bowtie, HISAT2, Matplotlib, and Qt. It is installed and used with Anaconda. Results: Here we describe a complete pipeline for sparse single-cell genomic data, encompassing all steps of single-nucleus DNA copy-number profiling, from raw sequence processing to clonal structure analysis and visualization. For the latter, a specialized graphical user interface termed the single-cell genome viewer (SCGV) is provided. With applications to cancer diagnostics in mind, the SCGV allows for zooming and linkage to the University of California at Santa Cruz Genome Browser from each of the multiple integrated views of single-cell copy-number profiles. The latter can be organized by clonal substructure or by any of the associated metadata such as anatomic location and histologic characterization. Conclusion: The pipeline is available as open-source software for Linux and OS X. Its modular structure, extensive documentation, and ease of deployment using Anaconda facilitate its adoption by researchers and practitioners of single-cell genomics. With open-source availability and Massachusetts Institute of Technology licensing, it provides a basis for additional development by the cancer bioinformatics community.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > genomics and proteomics
bioinformatics > computational biology
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > spontaneous copy number variation
CSHL Authors:
Communities: CSHL labs > Krasnitz lab
CSHL labs > Wigler lab
CSHL Cancer Center Program > Cancer Genetics and Genomics Program
Depositing User: Adrian Gomez
Date: 20 May 2020
Date Deposited: 21 May 2020 17:19
Last Modified: 29 Jan 2024 20:47
PMCID: PMC7265781
Related URLs:
URI: https://repository.cshl.edu/id/eprint/39473

Actions (login required)

Administrator's edit/view item Administrator's edit/view item