Brief Communication
Published: 04 May 2015

CONSERTING: integrating copy-number analysis with structural-variation detection

Xiang Chen^1,2,
Pankaj Gupta^1,2,
Jianmin Wang^2,3,4,
Joy Nakitandwe^2,5,
Kathryn Roberts⁵,
James D Dalton⁵,
Matthew Parker^1,2,
Samir Patel⁵,
Linda Holmfeldt ORCID: orcid.org/0000-0003-4140-3423⁵,
Debbie Payne⁵,
John Easton^2,6,
Jing Ma^2,5,
Michael Rusch^1,2,
Gang Wu^1,2,
Aman Patel^1,2,
Suzanne J Baker^2,7,
Michael A Dyer^2,7,
Sheila Shurtleff^2,5,
Stephen Espy³,
Stanley Pounds⁸,
James R Downing^2,5,
David W Ellison^2,5,
Charles G Mullighan ORCID: orcid.org/0000-0002-1871-1850^2,5 &
…
Jinghui Zhang^1,2

Nature Methods volume 12, pages 527–530 (2015)Cite this article

7456 Accesses
39 Altmetric
Metrics details

Subjects

Abstract

We developed Copy Number Segmentation by Regression Tree in Next Generation Sequencing (CONSERTING), an algorithm for detecting somatic copy-number alteration (CNA) using whole-genome sequencing (WGS) data. CONSERTING performs iterative analysis of segmentation on the basis of changes in read depth and the detection of localized structural variations, with high accuracy and sensitivity. Analysis of 43 cancer genomes from both pediatric and adult patients revealed novel oncogenic CNAs, complex rearrangements and subclonal CNAs missed by alternative approaches.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Strategy for CNA detection used by CONSERTING.**

**Figure 2: Comparison of WGS CNA detection by CONSERTING and four other methods.**

**Figure 3: A complex rearrangement in a sample of pediatric low-grade glioma identified by CONSERTING.**

CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths

Article Open access 26 June 2020

Ultrafast prediction of somatic structural variations by filtering out reads matched to pan-genome k-mer sets

Article 19 December 2022

Signatures of copy number alterations in human cancer

Article Open access 15 June 2022

References

Mullighan, C.G. et al. N. Engl. J. Med. 360, 470–480 (2009).
Article CAS Google Scholar
Ley, T.J. et al. Nature 456, 66–72 (2008).
Article CAS Google Scholar
Chiang, D.Y. et al. Nat. Methods 6, 99–103 (2009).
Article CAS Google Scholar
Xie, C. & Tammi, M.T. BMC Bioinformatics 10, 80 (2009).
Article Google Scholar
Boeva, V. et al. Bioinformatics 27, 268–269 (2011).
Article CAS Google Scholar
Abyzov, A., Urban, A.E., Snyder, M. & Gerstein, M. Genome Res. 21, 974–984 (2011).
Article CAS Google Scholar
Xi, R. et al. Proc. Natl. Acad. Sci. USA 108, E1128–E1136 (2011).
Article CAS Google Scholar
Downing, J.R. et al. Nat. Genet. 44, 619–622 (2012).
Article CAS Google Scholar
Zhang, J. et al. Nature 481, 157–163 (2012).
CAS PubMed PubMed Central Google Scholar
Roberts, K.G. et al. Cancer Cell 22, 153–166 (2012).
Article CAS Google Scholar
Zhang, J. et al. Nature 481, 329–334 (2012).
Article CAS Google Scholar
Zhang, J. et al. Nat. Genet. 45, 602–612 (2013).
CAS PubMed PubMed Central Google Scholar
Brennan, C.W. et al. Cell 155, 462–477 (2013).
Article CAS Google Scholar
Pleasance, E.D. et al. Nature 463, 191–196 (2010).
Article CAS Google Scholar
Wang, J. et al. Nat. Methods 8, 652–654 (2011).
Article CAS Google Scholar
Stephens, P.J. et al. Cell 144, 27–40 (2011).
Article CAS Google Scholar
Sanborn, J.Z. et al. Cancer Res. 73, 6036–6045 (2013).
Article CAS Google Scholar
Handsaker, R.E., Korn, J.M., Nemesh, J. & McCarroll, S.A. Nat. Genet. 43, 269–276 (2011).
Article CAS Google Scholar
Parker, M. et al. Nature 506, 451–455 (2014).
Article CAS Google Scholar
Wu, G. et al. Nat. Genet. 46, 444–450 (2014).
Article CAS Google Scholar
Li, H. et al. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Edmonson, M.N. et al. Bioinformatics 27, 865–866 (2011).
Article CAS Google Scholar
Breiman, L., Friedman, J.M., Olshen, R. & Stone, C. Classification and Regression Trees edn. 1 (Chapman and Hall/CRC, 1984).
Schwarz, G. Ann. Stat. 6, 461–464 (1978).
Article Google Scholar
Kent, W.J. et al. Genome Res. 12, 996–1006 (2002).
Article CAS Google Scholar
Rozen, S. & Skaletsky, H. Methods Mol. Biol. 132, 365–386 (2000).
CAS PubMed Google Scholar

Download references

Acknowledgements

This study was supported by the St. Jude Children's Research Hospital–Washington University Pediatric Cancer Genome Project, Cancer Center support grant P30 CA021765 from the US National Cancer Institute and the American Lebanese Syrian Associated Charities of St. Jude Children's Research Hospital. C.G.M. is supported as a Pew Scholar in the Biomedical Sciences and is a St. Baldrick's Scholar.

Author information

Authors and Affiliations

Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
Xiang Chen, Pankaj Gupta, Matthew Parker, Michael Rusch, Gang Wu, Aman Patel & Jinghui Zhang
Pediatric Cancer Genome Project, St. Jude Children's Research Hospital and Washington University School of Medicine, Memphis, Tennessee, USA
Xiang Chen, Pankaj Gupta, Jianmin Wang, Joy Nakitandwe, Matthew Parker, John Easton, Jing Ma, Michael Rusch, Gang Wu, Aman Patel, Suzanne J Baker, Michael A Dyer, Sheila Shurtleff, James R Downing, David W Ellison, Charles G Mullighan & Jinghui Zhang
Department of Information Sciences, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
Jianmin Wang & Stephen Espy
Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, New York, USA
Jianmin Wang
Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
Joy Nakitandwe, Kathryn Roberts, James D Dalton, Samir Patel, Linda Holmfeldt, Debbie Payne, Jing Ma, Sheila Shurtleff, James R Downing, David W Ellison & Charles G Mullighan
Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
John Easton
Department of Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
Suzanne J Baker & Michael A Dyer
Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
Stanley Pounds

Authors

Xiang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Pankaj Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Jianmin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Joy Nakitandwe
View author publications
You can also search for this author in PubMed Google Scholar
Kathryn Roberts
View author publications
You can also search for this author in PubMed Google Scholar
James D Dalton
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Parker
View author publications
You can also search for this author in PubMed Google Scholar
Samir Patel
View author publications
You can also search for this author in PubMed Google Scholar
Linda Holmfeldt
View author publications
You can also search for this author in PubMed Google Scholar
Debbie Payne
View author publications
You can also search for this author in PubMed Google Scholar
John Easton
View author publications
You can also search for this author in PubMed Google Scholar
Jing Ma
View author publications
You can also search for this author in PubMed Google Scholar
Michael Rusch
View author publications
You can also search for this author in PubMed Google Scholar
Gang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Aman Patel
View author publications
You can also search for this author in PubMed Google Scholar
Suzanne J Baker
View author publications
You can also search for this author in PubMed Google Scholar
Michael A Dyer
View author publications
You can also search for this author in PubMed Google Scholar
Sheila Shurtleff
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Espy
View author publications
You can also search for this author in PubMed Google Scholar
Stanley Pounds
View author publications
You can also search for this author in PubMed Google Scholar
James R Downing
View author publications
You can also search for this author in PubMed Google Scholar
David W Ellison
View author publications
You can also search for this author in PubMed Google Scholar
Charles G Mullighan
View author publications
You can also search for this author in PubMed Google Scholar
Jinghui Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.C. and J.Z. conceived and designed the CONSERTING algorithm. X.C., P.G. and J.W. implemented the algorithm. J.Z., S.J.B., M.A.D., J.R.D., D.W.E. and C.G.M. designed the experiment. X.C., J.W., J.D.D., M.P., J.M., M.R., G.W., A.P., S.E., S. Pounds and J.Z. analyzed the data. K.R., J.D.D., S. Patel, L.H., D.P. and J.E performed validation and functional assays. J.N. and S.S. generated COLO-829 whole-genome sequencing data. X.C. and J.Z. wrote the manuscript.

Corresponding author

Correspondence to Jinghui Zhang.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 Overview of the CONSERTING process.

A parallelogram shows input or output files, and a rectangle defines an analytical process. A diamond defines the condition for a follow-up process.

Supplementary Figure 2 CIRCOS plots for CNAs derived from SNP array, CONSERTING, CNV-seq, SegSeq, FREEC and BIC-seq for the 20 paired tumor-normal whole-genome sequencing data sets presented in this paper (SegSeq was not run on SJTALL015 and SJLGG039, as these two samples were mapped to hg19, which is not compatible with SegSeq).

All CIRCOS plots are shown in the same order. (a) Legend for the CIRCOS plot. (b) CIRCOS plots for the 12 ETP-ALL cases. (c) CIRCOS plots for the four RB cases. (d) CIRCOS plots for two B-ALL samples from COG. (e) A CIRCOS plot for the non-ETP T-ALL sample SJTALL015. (f) A CIRCOS plot for the low-grade glioma sample SJLGG039 (arrows show amplifications identified by both CONSERTING and SNP array).

Supplementary Figure 3 Somatic CNAs computed by CONSERTING and BIC-seq for 22 paired whole-genome sequencing data sets from The Cancer Genome Atlas (TCGA) GBM project.

Each sample is displayed in a colored box with the BIC-seq results (denoted by B) shown at the top, the SNP array result (denoted by S, and downloaded from TCGA) at the middle and the CONSERTING results (denoted by C) at the bottom. TCGA sample I.D.s are at the left. Samples marked with an asterisk had lower than median F₁ scores for both CONSERTING and BIC-seq. Diagnosis and relapse sample pairs are shown in same color with different intensities. Pairs with underlined sample I.D.s had highly divergent CNA profiles from diagnosis to relapse.

Supplementary Figure 4 ROC curves for CONSERTING and BIC-seq in 11 ETP-ALL samples.

SJTALL008 was excluded from this analysis because there were no CNV calls in the curated SNP array result.

Supplementary Figure 5 CNA calls at chr1:164 -244 Mb of retinoblastoma tumor SJRB003 by SNP array, CONSERTING and four other methods.

(a) Global view of the CNA state in the 93-Mb region on 1q. The thin horizontal lines define the copy-neutral state (i.e., no copy-number variation). Blue blocks above the “neutral” line are the copy-number gains (amplifications) identified by each method, with the height of the block corresponding to the amplitude of the copy-number gain. (b) Detailed view of a 35-kb region showing that BIC-seq missed two breakpoints separating two CNV segments. Both breakpoints were confirmed experimentally by Sanger sequencing. The two CNV boundaries at chr1:231118034 and chr1:231123028 missed by BIC-seq are part of the complex rearrangement depicted in Supplementary Fig. 5c. Both were involved in interchromosomal translocations 60–70 Mb upstream, and the breakpoints were experimentally validated. (c) A complex rearrangement identified by CONSERTING at chr1:164-244 Mb. The top panel is an SV graph that connects the 12 SVs identified in this region. The black lines mark the breakpoints of the seven SVs detected only by CONSERTING, and the gray lines mark the breakpoints of the five SVs detected by both CONSERTING and CREST. The purple bar marks the boundaries of the CNA segment, with the amplitudes of CNAs marked at the bottom. The three colored dots mark the location of FISH probes used to validate the copy number and SVs. (d) PCR amplicon validation of CONSERTING-predicted SVs. Lane 68 (predicted amplicon size: 500 bp): chr1:231123028(-)|chr1:164744054(+); lane 69 (predicted amplicon size: 313 bp): chr1:166476222(+)|chr1:174088270(+); Lane 70 (predicted amplicon size: 343 bp): chr1:173901001(+)|chr1:233328796(+); Lane 71 (predicted amplicon size: 488 bp): chr1:230868636(-)|chr1:173478509(+); lane 72 (predicted amplicon size: 332 bp): chr1:241160552(+)|chr1:177362495(-); lane 73 (predicted amplicon size: 350 bp): chr1:177464060(+)|chr1:224896416(+); lane 74 (predicted amplicon size: 371 bp): chr1:236474589(+)|chr9:136321446(-). (e) FISH validation of the WGS predicted gain (scale bar, 10 µm; also refer to Supplementary Fig. 14 of Zhang, J. et al. Nature 481, 329–334 (2012)).

Supplementary Figure 6 Double-minute chromosomes identified in the TCGA-GBM data set.

The inner green and magenta lines connect intra- and interchromosomal SV breakpoints, respectively. The red and blue arcs represent amplification and deletion identified by CONSERTING, respectively. The thickness of the arc is proportional to the level of amplification or deletion. Samples marked with an asterisk (06-0152-01A, 06-0210-01A, 06-0211-01A, 06-0211-02A, 06-0648-01A,14-1402-01A, 14-1402-02A, 19-5960-01A and 27-1831-01A) had a chromothripsis-like CNA-SV profile.

Supplementary Figure 7 A novel intragenic NOTCH1 deletion resulting in expression of ICN T-ALL.

(a) The normalized read depth for part of NOTCH1 for normal germline (blue), tumor (red) and tumor-normal (gray), showing a deletion spanning exons 14–27 in the tumor sample. (b) A deletion in this region detected by CONSERTING but not by SNP array with two ends mapped within exons 14 and 27. (c) Sanger-sequencing chromatograms confirming the in-frame deletion in both genomic DNA and cDNA. (d) Western blot analysis of SJTALL015 with the intragenic deletion (∆) targeting the NOTCH1 HD domain, and the T cell lines MOLT3, HPBALL, DND41, PF382, TALL-1 and LOUCY and the murine fibroblast cell line GPE-86, either wild type (-) or harboring activating mutations (+). (e) NOTCH1 domain view showing that the intragenetic deletion removes several EGF domains, an LNR domain and the heterodimerization (HD) domain.

Supplementary Figure 8 Fractured genome in WGS data from SJRB002 and data from The Cancer Genome Atlas (TCGA) project.

(a) CNA plots of chromosomes 5 and 6 using the initial WGS data. Copy-number gain with log₂ ratio > 0.17 is marked in red, copy-number loss with log₂ ratio < –0.17 is marked in blue, and the remaining segments are marked in black. (b) CNA plot of the same sample based on the second WGS data set with no ‘fracture’. The 6p amplification is the only CNA that is replicated in the two WGS data sets (not detected by SNP array but validated by SKY mapping). (c) Genome-wide CNAs using the original WGS by all methods. (d) Genome-wide CNAs using the second 10x WGS by all methods. (e) Number of predicted somatic CNAs for TCGA-GBM samples with and without fractured genome by BIC-seq, CONSERTING and SNP array. (f) Three paired WGS data showing fractured genome. Each sample is displayed in a colored box with the BIC-seq results shown at the top, the SNP array result (downloaded from TCGA) at the middle and the CONSERTING results at the bottom. TCGA sample I.D.s are at the left. Gain is shown in red, and loss in blue.

Supplementary Figure 9 Distribution of quality score of a typical PCGP sample (SJTALL013) and that of a typical TCGA sample (TCGA-06-0145-01A).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, X., Gupta, P., Wang, J. et al. CONSERTING: integrating copy-number analysis with structural-variation detection. Nat Methods 12, 527–530 (2015). https://doi.org/10.1038/nmeth.3394

Download citation

Received: 16 July 2014
Accepted: 22 March 2015
Published: 04 May 2015
Issue Date: June 2015
DOI: https://doi.org/10.1038/nmeth.3394

CONSERTING: integrating copy-number analysis with structural-variation detection

Subjects

Abstract

Access options

Similar content being viewed by others

CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths

Ultrafast prediction of somatic structural variations by filtering out reads matched to pan-genome k-mer sets

Signatures of copy number alterations in human cancer

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary Figure 1 Overview of the CONSERTING process.

Supplementary Figure 3 Somatic CNAs computed by CONSERTING and BIC-seq for 22 paired whole-genome sequencing data sets from The Cancer Genome Atlas (TCGA) GBM project.

Supplementary Figure 4 ROC curves for CONSERTING and BIC-seq in 11 ETP-ALL samples.

Supplementary Figure 5 CNA calls at chr1:164 -244 Mb of retinoblastoma tumor SJRB003 by SNP array, CONSERTING and four other methods.

Supplementary Figure 6 Double-minute chromosomes identified in the TCGA-GBM data set.

Supplementary Figure 7 A novel intragenic NOTCH1 deletion resulting in expression of ICN T-ALL.

Supplementary Figure 8 Fractured genome in WGS data from SJRB002 and data from The Cancer Genome Atlas (TCGA) project.

Supplementary Figure 9 Distribution of quality score of a typical PCGP sample (SJTALL013) and that of a typical TCGA sample (TCGA-06-0145-01A).

Supplementary information

Supplementary Text and Figures

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Table 4

Supplementary Table 5

Supplementary Table 6

Supplementary Software

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links