[en] In the past decade, the view on genomic structural variation (SV) has been changed completely. SVs, previously considered rare events, are now recognized as the largest source of interindividual genetic variation affecting more bases than single nucleotide polymorphisms, variable number of tandem repeats and other small genetic variants. They have also been shown to play a role in phenotypic variation and in disease. In this review, the authors will provide an introduction to SV; a short historical perspective on the research of this source of genomic variation; a description of the types of structural variants, and on how they may have arisen; and an overview on methods of detecting structural variants, focusing on the analysis of high-throughput sequencing data.
Jacobs PA, Strong JA. A case of human intersexuality having a possible XXY sex-determining mechanism. Nature 1959;183:302-3.
Jacobs PA, Baikie AG, Court Brown WM, et al. The somatic chromosomes in mongolism. Lancet 1959;1:710.
Nowell PC, Hungerford DA. Chromosome studies on normal and leukemic human leukocytes. J Natl Cancer Inst 1960;25:85-109.
Manolov G, Manolova Y. Marker band in one chromosome 14 from Burkitt lymphomas. Nature 1972;237:33-4.
Mitelman F, Andersson-Anvret M, Brandt L, et al. Reciprocal 8;14 translocation in EBV-negative B-cell acute lymphocytic leukemia with Burkitt-type cells. Int J Cancer 1979;24:27-33.
Craig-Holmes AP, Moore FB, Shaw MW. Polymorphism of human C-band heterochromatin. I. Frequency of variants. Am J Hum Genet 1973;25:181-92.
Goossens M, Dozy AM, Embury SH, et al. Triplicated alphaglobin loci in humans. Proc Natl Acad Sci USA 1980;77:518-21.
Bridges CB. The bar "Gene" a duplication. Science 1936;83:210-11.
Vissers LE, de Vries BB, Osoegawa K, et al. Array-based comparative genomic hybridization for the genomewide detection of submicroscopic chromosomal abnormalities. Am J Hum Genet 2003;73:1261-70.
Lucito R, Healy J, Alexander J, et al. Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation. Genome Res 2003;13:2291-305.
Sebat J, Lakshmi B, Troge J, et al. Large-scale copy number polymorphism in the human genome. Science 2004;305:525-8.
Iafrate AJ, Feuk L, Rivera MN, et al. Detection of large-scale variation in the human genome. Nat Genet 2004;36:949-51.
Tuzun E, Sharp AJ, Bailey JA, et al. Fine-scale structural variation of the human genome. Nat Genet 2005;37:727-32.
de Vries BB, Pfundt R, Leisink M, et al. Diagnostic genome profiling in mental retardation. Am J Hum Genet 2005;77:606-16.
Dhami P, Coffey AJ, Abbs S, et al. Exon array CGH: detection of copy-number changes at the resolution of individual exons in the human genome. Am J Hum Genet 2005;76:750-62.
Feuk L, MacDonald JR, Tang T, et al. Discovery of human inversion polymorphisms by comparative analysis of human and chimpanzee DNA sequence assemblies. PLoS Genet 2005;1:e56.
Sharp AJ, Locke DP, McGrath SD, et al. Segmental duplications and copy-number variation in the human genome. Am J Hum Genet 2005;77:78-88.
Tyson C, Harvard C, Locker R, et al. Submicroscopic deletions and duplications in individuals with intellectual disability detected by array-CGH. Am J Med Genet A 2005;139:173-85.
Conrad DF, Andrews TD, Carter NP, et al. A high-resolution survey of deletion polymorphism in the human genome. Nat Genet 2006;38:75-81.
Hinds DA, Kloek AP, Jen M, et al. Common deletions and SNPs are in linkage disequilibrium in the human genome. Nat Genet 2006;38:82-5.
McCarroll SA, Hadnott TN, Perry GH, et al. Common deletion polymorphisms in the human genome. Nat Genet 2006;38:86-92.
Redon R, Ishikawa S, Fitch KR, et al. Global variation in copy number in the human genome. Nature 2006;444:444-54.
Feuk L, Carson AR, Scherer SW. Structural variation in the human genome. Nat Rev Genet 2006;7:85-97.
Sebat J, Lakshmi B, Malhotra D, et al. Strong association of de novo copy number mutations with autism. Science 2007;316:445-9.
Marshall CR, Noor A, Vincent JB, et al. Structural variation of chromosomes in autism spectrum disorder. Am J Hum Genet 2008;82:477-88.
Fellermann K, Stange DE, Schaeffeler E, et al. A chromosome 8 gene-cluster polymorphism with low human betadefensin 2 gene copy number predisposes to Crohn disease of the colon. Am J Hum Genet 2006;79:439-48.
McCarroll SA, Huett A, Kuballa P, et al. Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease. Nat Genet 2008;40:1107-12.
Niederer HA, Willcocks LC, Rayner TF, et al. Copy number, linkage disequilibrium and disease association in the FCGR locus. Hum Mol Genet 2010;19:3282-94.
Fanciulli M, Norsworthy PJ, Petretto E, et al. FCGR3B copy number variation is associated with susceptibility to systemic, but not organ-specific, autoimmunity. Nat Genet 2007;39:721-3.
Mamtani M, Anaya JM, He W, et al. Association of copy number variation in the FCGR3B gene with risk of autoimmune diseases. Genes Immun 2010;11:155-60.
de Cid R, Riveira-Muñoz E, Zeeuwen PL, et al. Deletion of the late cornified envelope (LCE) 3C and 3B genes as a susceptibility factor for psoriasis. Nat Genet 2009;41:211-15.
Gonzalez E, Kulkarni H, Bolivar H, et al. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/ AIDS susceptibility. Science 2005;307:1434-40.
Singleton AB, Farrer M, Johnson J, et al. alpha-Synuclein locus triplication causes Parkinson's disease. Science 2003;302:841.
Rovelet-Lecrux A, Hannequin D, Raux G, et al. APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy. Nat Genet 2006;38:24-6.
International Schizophrenia C. Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 2008;455:237-41.
Stefansson H, Rujescu D, Cichon S, et al. Large recurrent microdeletions associated with schizophrenia. Nature 2008;455:232-6.
Perry GH, Dominy NJ, Claw KG, et al. Diet and the evolution of human amylase gene copy number variation. Nat Genet 2007;39:1256-60.
Talkowski ME, Rosenfeld JA, Blumenthal I, et al. Sequencing chromosomal abnormalities reveals neurodevelopmental loci that confer risk across diagnostic boundaries. Cell 2012;149:525-37.
Antonarakis SE, Rossiter JP, Young M, et al. Factor VIII gene inversions in severe hemophilia A: results of an international consortium study. Blood 1995;86:2206-12.
Feuk L. Inversion variants in the human genome: role in disease and genome architecture. Genome Med 2010;2:11.
Osborne LR, Li M, Pober B, et al. A 1.5 million-base pair inversion polymorphism in families with Williams-Beuren syndrome. Nat Genet 2001;29:321-5.
Wellcome Trust Case Control C, Craddock N, Hurles ME, et al. Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls. Nature 2010;464:713-20.
Usher CL, McCarroll SA. Complex and multi-allelic copy number variations in human disease. Brief Funct Genomic 2015;14:329-38.
Iyer J, Girirajan S. Gene discovery and functional assessment of rare copy-number variants in neurodevelopmental disorders. Brief Funct Genomic 2015;14:315-28.
Puig M, Casillas A, Villatoro S, Cáceres M. Human inversions and their functional consequences. Brief Funct Genomic 2015;14:369-79.
Koboldt DC, Larson DE, Chen K, et al. Massively parallel sequencing approaches for characterization of structural variation. Methods Mol Biol 2012;838:369-84.
Mills RE, Walter K, Stewart C, et al. Mapping copy number variation by population-scale genome sequencing. Nature 2011;470:59-65.
Genomes Project C, Abecasis GR, Altshuler D, et al. A map of human genome variation from population-scale sequencing. Nature 2010;467:1061-73.
Boomsma DI, Wijmenga C, Slagboom EP, et al. The Genome of the Netherlands: design, and project goals. Eur J Hum Genet 2014;22:221-7.
Genome of the Netherlands C. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet 2014;46:818-25.
MacDonald JR, Ziman R, Yuen RK, et al. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res 2014;42:D986-92.
Martinez-Fundichely A, Casillas S, Egea R, et al. InvFEST, a database integrating information of polymorphic inversions in the human genome. Nucleic Acids Res 2014;42:D1027-32.
Graubert TA, Cahan P, Edwin D, et al. A high-resolution map of segmental DNA copy number variation in the mouse genome. PLoS Genet 2007;3:e3.
Yalcin B, Wong K, Agam A, et al. Sequence-based characterization of structural variation in the mouse genome. Nature 2011;477:326-9.
Guryev V, Saar K, Adamovic T, et al. Distribution and functional impact of DNA copy number variation in the rat. Nat Genet 2008;40:538-45.
Fadista J, Thomsen B, Holm LE, et al. Copy number variation in the bovine genome. BMC Genomics 2010;11:284.
Bae JS, Cheong HS, Kim LH, et al. Identification of copy number variations and common deletion polymorphisms in cattle. BMC Genomics 2010;11:232.
Liu J, Zhang L, Xu L, et al. Analysis of copy number variations in the sheep genome using 50K SNP BeadChip array. BMC Genomics 2013;14:229.
Esteve-Codina A, Paudel Y, Ferretti L, et al. Dissecting structural and nucleotide genome-wide variation in inbred Iberian pigs. BMC Genomics 2013;14:148.
Ghosh S, Qu Z, Das PJ, et al. Copy number variation in the horse genome. PLoS Genet 2014;10:e1004712.
Wang W, Wang S, Hou C, et al. Genome-wide detection of copy number variations among diverse horse breeds by array CGH. PLoS One 2014;9:e86860.
Durkin K, Coppieters W, Drogemuller C, et al. Serial translocation by means of circular intermediates underlies colour sidedness in cattle. Nature 2012;482:81-4.
Kadri NK, Sahana G, Charlier C, et al. A 660-Kb deletion with antagonistic effects on fertility and milk production segregates at high frequency in Nordic Red cattle: additional evidence for the common occurrence of balancing selection in livestock. PLoS Genet 2014;10:e1004049.
Stephens PJ, Greenman CD, Fu B, et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell 2011;144:27-40.
Lupski JR. Genomic disorders: structural features of the genome can lead to DNA rearrangements and human disease traits. Trends Genet 1998;14:417-22.
Moore JK, Haber JE. Cell cycle and genetic requirements of two pathways of nonhomologous end-joining repair of double- strand breaks in Saccharomyces cerevisiae. Mol Cell Biol 1996;16:2164-73.
Roth DB, Wilson JH. Nonhomologous recombination in mammalian cells: role for short sequence homologies in the joining reaction. Mol Cell Biol 1986;6:4295-304.
McVey M, Lee SE. MMEJ repair of double-strand breaks (director's cut): deleted sequences and alternative endings. Trends Genet 2008;24:529-38.
Lee JA, Carvalho CM, Lupski JR. A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell 2007;131:1235-47.
Hastings PJ, Lupski JR, Rosenberg SM, et al. Mechanisms of change in gene copy number. Nat Rev Genet 2009;10:551-64.
Conrad DF, Pinto D, Redon R, et al. Origins and functional impact of copy number variation in the human genome. Nature 2010;464:704-12.
Gu W, Zhang F, Lupski JR. Mechanisms for human genomic rearrangements Pathogenetics 2008;1:4.
Girirajan S, Dennis MY, Baker C, et al. Refinement and discovery of new hotspots of copy-number variation associated with autism spectrum disorder. Am J Hum Genet 2013;92:221-37.
Inoue K, Lupski JR. Molecular mechanisms for genomic disorders. Annu Rev Genomics Hum Genet 2002;3:199-242.
Sharp AJ, Cheng Z, Eichler EE. Structural variation of the human genome. Annu Rev Genomics Hum Genet 2006;7:407-42.
Lupski JR, Stankiewicz P. Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet 2005;1:e49.
Ferguson DO, Sekiguchi JM, Chang S, et al. The nonhomologous end-joining pathway of DNA repair is required for genomic stability and the suppression of translocations. Proc Natl Acad Sci USA 2000;97:6630-3.
Zhang F, Carvalho CM, Lupski JR. Complex human chromosomal and genomic rearrangements. Trends Genet 2009;25:298-307.
Chen JM, Chuzhanova N, Stenson PD, et al. Complex gene rearrangements caused by serial replication slippage. Hum Mutat 2005;26:125-34.
Hermetz KE, Newman S, Conneely KN, et al. Large inverted duplications in the human genome form via a fold-back mechanism. PLoS Genet 2014;10:e1004139.
Korbel JO, Campbell PJ. Criteria for inference of chromothripsis in cancer genomes. Cell 2013;152:1226-36.
Kloosterman WP, Guryev V, van Roosmalen M, et al. Chromothripsis as a mechanism driving complex de novo structural rearrangements in the germline. Hum Mol Genet 2011;20:1916-24.
Kloosterman WP, Tavakoli-Yaraki M, van Roosmalen MJ, et al. Constitutional chromothripsis rearrangements involve clustered double-stranded DNA breaks and nonhomologous repair mechanisms. Cell Rep 2012;1:648-55.
Stewart C, Kural D, Stromberg MP, et al. A comprehensive map of mobile element insertion polymorphisms in humans. PLoS Genet 2011;7:e1002236.
Bruder CE, Piotrowski A, Gijsbers AA, et al. Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles. Am J Hum Genet 2008;82:763-71.
Rodriguez-Santiago B, Malats N, Rothman N, et al. Mosaic uniparental disomies and aneuploidies as large structural variants of the human genome. Am J Hum Genet 2010;87:129-38.
Abyzov A, Mariani J, Palejev D, et al. Somatic copy number mosaicism in human skin revealed by induced pluripotent stem cells. Nature 2012;492:438-42.
Laurie CC, Laurie CA, Rice K, et al. Detectable clonal mosaicism from birth to old age and its relationship to cancer. Nat Genet 2012;44:642-50.
McConnell MJ, Lindberg MR, Brennand KJ, et al. Mosaic copy number variation in human neurons. Science 2013;342:632-7.
Olshen AB, Venkatraman ES, Lucito R, et al. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 2004;5:557-72.
Pique-Regi R, Monso-Varona J, Ortega A, et al. Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics 2008;24:309-18.
Coughlin CR, 2nd, Scharer GH, Shaikh TH. Clinical impact of copy number variation analysis using high-resolution microarray technologies: advantages, limitations and concerns. Genome Med 2012;4:80.
Le Caignec C, Redon R. Copy number variation goes clinical. Genome Biol 2009;10:301.
Armengol L, Nevado J, Serra-Juhe C, et al. Clinical utility of chromosomal microarray analysis in invasive prenatal diagnosis. Hum Genet 2012;131:513-23.
Wang L, Hauser ER, Shah SH, et al. Peakwide mapping on chromosome 3q13 identifies the kalirin gene as a novel candidate gene for coronary artery disease. Am J Hum Genet 2007;80:650-63.
Colella S, Yau C, Taylor JM, et al. QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucleic Acids Res 2007;35:2013-25.
Korn JM, Kuruvilla FG, McCarroll SA, et al. Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet 2008;40:1253-60.
Winchester L, Yau C, Ragoussis J. Comparing CNV detection methods for SNP arrays. Brief Funct Genomic Proteomic 2009;8:353-66.
Bansal V, Bashir A, Bafna V. Evidence for large inversion polymorphisms in the human genome from HapMap data. Genome Res 2007;17:219-30.
Antonacci F, Kidd JM, Marques-Bonet T, et al. Characterization of six human disease-associated inversion polymorphisms. Hum Mol Genet 2009;18:2555-66.
Caceres A, Sindi SS, Raphael BJ, et al. Identification of polymorphic inversions from genotypes. BMC Bioinformatics 2012;13:28.
Alkan C, Kidd JM, Marques-Bonet T, et al. Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet 2009;41:1061-7.
Abyzov A, Urban AE, Snyder M, et al. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 2011;21:974-84.
Yoon S, Xuan Z, Makarov V, et al. Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Res 2009;19:1586-92.
Sudmant PH, Kitzman JO, Antonacci F, et al. Diversity of human copy number variation and multicopy genes. Science 2010;330:641-6.
Korbel JO, Urban AE, Grubert F, et al. Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome. Proc Natl Acad Sci USA 2007;104:10110-15.
Chen K, Wallis JW, McLellan MD, et al. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods 2009;6:677-81.
Hormozdiari F, Alkan C, Eichler EE, et al. Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes. Genome Res 2009;19:1270-8.
Kidd JM, Cooper GM, Donahue WF, et al. Mapping and sequencing of structural variation from eight human genomes. Nature 2008;453:56-64.
Korbel JO, Urban AE, Affourtit JP, et al. Paired-end mapping reveals extensive structural variation in the human genome. Science 2007;318:420-6.
Ye K, Schulz MH, Long Q, et al. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 2009;25:2865-71.
Wang J, Mullighan CG, Easton J, et al. CREST maps somatic structural variation in cancer genomes with base-pair resolution. Nat Methods 2011;8:652-4.
Schroder J, Hsu A, Boyle SE, et al. Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads. Bioinformatics 2014;30:1064-72.
Hajirasouliha I, Hormozdiari F, Alkan C, et al. Detection and characterization of novel sequence insertions using pairedend next-generation sequencing. Bioinformatics 2010;26:1277-83.
Chen K, Chen L, Fan X, et al. TIGRA: a targeted iterative graph routing assembler for breakpoint assembly. Genome Res 2014;24:310-17.
Medvedev P, Stanciu M, Brudno M. Computational methods for discovering structural variation with next-generation sequencing. Nat Methods 2009;6:S13-20.
Qi J, Zhao F. inGAP-sv: a novel scheme to identify and visualize structural variation from paired end mapping data. Nucleic Acids Res 2011;39:W567-75.
Medvedev P, Fiume M, Dzamba M, et al. Detecting copy number variation with mated short reads. Genome Res 2010;20:1613-22.
Sindi SS, Onal S, Peng LC, et al. An integrative probabilistic model for identification of structural variation in sequencing data. Genome Biol 2012;13:R22.
Quinlan AR, Clark RA, Sokolova S, et al. Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome Res 2010;20:623-35.
Hormozdiari F, Hajirasouliha I, Dao P, et al. Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery. Bioinformatics 2010;26:i350-7.
Escaramis G, Tornador C, Bassaganyas L, et al. PeSV-Fisher:identification of somatic and non-somatic structural variants using next generation sequencing data. PLoS One 2013;8:e63377.
Jiang Y, Wang Y, Brudno M. PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants. Bioinformatics 2012;28:2576-83.
Rausch T, Zichner T, Schlattl A, et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics 2012;28:i333-9.
Handsaker RE, Korn JM, Nemesh J, et al. Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nat Genet 2011;43:269-76.
Michaelson JJ, Sebat J. forestSV: structural variant discovery through statistical learning. Nat Methods 2012;9:819-21.
Tan R, Wang Y, Kleinstein SE, et al. An evaluation of copy number variation detection tools from whole-exome sequencing data Hum Mutat 2014;35:899-907.
Krumm N, Sudmant PH, Ko A, et al. Copy number variation detection and genotyping from exome sequence data. Genome Res 2012;22:1525-32.
Fromer M, Moran JL, Chambert K, et al. Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet 2012;91:597-607.
Schouten JP, McElgunn CJ, Waaijer R, et al. Relative quantification of 40 nucleic acid sequences by multiplex ligation- dependent probe amplification. Nucleic Acids Res 2002;30:e57.
Armour JA, Palla R, Zeeuwen PL, et al. Accurate, highthroughput typing of copy number variation using paralogue ratios from dispersed repeats. Nucleic Acids Res 2007;35:e19.
Florijn RJ, Bonden LA, Vrolijk H, et al. High-resolution DNA Fiber-FISH for genomic DNA mapping and colour bar-coding of large genes. Hum Mol Genet 1995;4:831-6.