Recovering rearranged cancer chromosomes from karyotype graphs

Aganezov, S., Zban, I., Aksenov, V., Alexeev, N., Schatz, M. C. (December 2019) Recovering rearranged cancer chromosomes from karyotype graphs. BMC Bioinformatics, 20 (Suppl). p. 641. ISSN 1471-2105 (Public Dataset)

[thumbnail of document(1).pdf] PDF
document(1).pdf - Published Version

Download (1MB)
URL: https://www.ncbi.nlm.nih.gov/pubmed/31842730
DOI: 10.1186/s12859-019-3208-4

Abstract

BACKGROUND: Many cancer genomes are extensively rearranged with highly aberrant chromosomal karyotypes. Structural and copy number variations in cancer genomes can be determined via abnormal mapping of sequenced reads to the reference genome. Recently it became possible to reconcile both of these types of large-scale variations into a karyotype graph representation of the rearranged cancer genomes. Such a representation, however, does not directly describe the linear and/or circular structure of the underlying rearranged cancer chromosomes, thus limiting possible analysis of cancer genomes somatic evolutionary process as well as functional genomic changes brought by the large-scale genome rearrangements. RESULTS: Here we address the aforementioned limitation by introducing a novel methodological framework for recovering rearranged cancer chromosomes from karyotype graphs. For a cancer karyotype graph we formulate an Eulerian Decomposition Problem (EDP) of finding a collection of linear and/or circular rearranged cancer chromosomes that are determined by the graph. We derive and prove computational complexities for several variations of the EDP. We then demonstrate that Eulerian decomposition of the cancer karyotype graphs is not always unique and present the Consistent Contig Covering Problem (CCCP) of recovering unambiguous cancer contigs from the cancer karyotype graph, and describe a novel algorithm CCR capable of solving CCCP in polynomial time. We apply CCR on a prostate cancer dataset and demonstrate that it is capable of consistently recovering large cancer contigs even when underlying cancer genomes are highly rearranged. CONCLUSIONS: CCR can recover rearranged cancer contigs from karyotype graphs thereby addressing existing limitation in inferring chromosomal structures of rearranged cancer genomes and advancing our understanding of both patient/cancer-specific as well as the overall genetic instability in cancer.

Item Type: Paper
Subjects: bioinformatics
diseases & disorders > cancer
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification
diseases & disorders
bioinformatics > genomics and proteomics > genetics & nucleic acid processing
bioinformatics > genomics and proteomics
diseases & disorders > neoplasms
bioinformatics > computational biology > algorithms
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > chromosome
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > chromosomes, structure and function > chromosome
bioinformatics > computational biology
CSHL Authors:
Communities: CSHL labs > Schatz lab
CSHL Cancer Center Program > Cancer Genetics and Genomics Program
Depositing User: Adrian Gomez
Date: 17 December 2019
Date Deposited: 09 Jan 2020 17:55
Last Modified: 01 Feb 2024 20:04
PMCID: PMC6915857
Related URLs:
Dataset ID:
  • Supplement: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3208-4
  • Datasets: https://github.com/aganezov/RCK-pub-data
URI: https://repository.cshl.edu/id/eprint/38884

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving