Mobile elements create structural variation: analysis of a complete human genome

  1. Jinchuan Xing1,
  2. Yuhua Zhang1,
  3. Kyudong Han2,
  4. Abdel Halim Salem3,
  5. Shurjo K Sen2,
  6. Chad D Huff1,
  7. Qiong Zhou1,
  8. Ewen F Kirkness4,
  9. Samuel Levy4,
  10. Mark A Batzer5 and
  11. Lynn B Jorde1,6
  1. 1 University of Utah;
  2. 2 Louisiana State University;
  3. 3 Suez Canal University;
  4. 4 J. Craig Venter Institute;
  5. 5 Lousiana State University
  1. 6 E-mail: lbj{at}genetics.utah.edu

Abstract

Structural variants (SVs) are common in the human genome. Because approximately half of the human genome consists of repetitive, transposable DNA sequences, it is plausible that these elements play an important role in generating SVs in humans. Sequencing of the diploid genome of one individual human (HuRef) affords us the opportunity to assess, for the first time, the impact of mobile elements on SVs in an individual in a thorough and unbiased fashion. In this study, we systematically evaluated more than 8,000 SVs to identify mobile element-associated SVs as small as 100 bp and specific to the HuRef genome. Combining computational and experimental analyses, we identified and validated 706 mobile element insertion events (including Alu, L1, SVA elements and non-classical insertions), which added more than 305 kb of new DNA sequence to the HuRef genome compared to the Human Genome Project (HGP) reference sequence. We also identified 140 mobile element-associated deletions, which removed ~126 kb of sequence from the HuRef genome. Overall, ~10% of the HuRef-specific indels that are larger than 100 bp are caused by mobile element-associated events. More than one-third of the insertion/deletion events occurred in genic regions, and new Alu insertions occurred in exons of three human genes. Based on the number of insertions and the estimated time to the most recent common ancestor of HuRef and the HGP reference genome, we estimated the Alu, L1, and SVA retrotransposition rates to be one in 21 births, 212 births, and 916 births, respectively. This study presents the first comprehensive analysis of mobile element-related structural variants in the complete DNA sequence of an individual and demonstrates that mobile elements play an important role in generating inter-individual structural variation.

Footnotes

    • Received February 1, 2009.
    • Accepted April 29, 2009.
ACCEPTED MANUSCRIPT

Preprint Server