Whole genome characterization of sequence diversity of 15,220 Icelanders

Hákon Jónsson, patrick sulem, Birte Kehr, Snædís Kristmundsdóttir, Florian Zink, Eiríkur Hjartarson, Marteinn Hardarson, Kristjan Hjorleifsson, Hannes Eggertsson, Sigurjón Axel Guðjónsson, Lucas D. Ward, Gudny Arnadottir, Einar A. Helgason, Hannes Helgason, Arnaldur Gylfason, Aðalbjörg Jónasdóttir, Áslaug Jónasdóttir, Thorunn Rafnar, Soren Besenbacher, Michael L. FriggeSimon N. Stacey, Ólafur T. Magnússon, Unnur Þorsteinsdóttir, Gísli Másson, Augustine Kong, Bjarni Halldórsson, Agnar Helgason, Daniel Gudbjartsson, Kari Stefansson

Research output: Contribution to journalArticlepeer-review

Abstract

Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.
Original languageEnglish
Number of pages170115
JournalScientific data
Volume4
DOIs
Publication statusPublished - 21 Sept 2017

Other keywords

  • DNA sequencing
  • Genetic variation
  • Haplotypes
  • Rare variants
  • DNA-rannsóknir
  • Erfðabreytileiki
  • Erfðafræði

Fingerprint

Dive into the research topics of 'Whole genome characterization of sequence diversity of 15,220 Icelanders'. Together they form a unique fingerprint.

Cite this