MAY 09, 2018 6:00 AM PDT

Genome in a Bottle: Developing Benchmark Sets for Structural Variants

Speaker

Abstract

The Genome in a Bottle Consortium (GIAB) has characterized an Ashkenazi trio from the Personal Genome Project (NIST Reference Material 8392) with 12 short, long, and linked read sequencing and mapping methods.  Datasets are public without embargo for analysis and methods development by the community. We have characterized ~3.7 million small variants as well as reference calls for ~90% of the genome with estimated errors of ~2 FPs and 2 FNs per million variants. To extend this characterization to larger indels and structural variants, we collected analyses of variants >=20 bp from 33 bioinformatics methods and five technologies. Nineteen discovery and refinement methods produced sequence-resolved calls using local or global assembly or split reads, giving a precise prediction of deletion breakpoints, inserted sequences, and complex changes.  We designed an integration approach to address challenges in comparing and evaluating large variants, which are frequently in tandem repeats (>50% of all calls) and not precisely characterized. We have iteratively refined our integration process based on feedback received for publicly released draft “straw man” callsets.   Current work includes developing 2-tiered high-confidence variant calls and a high-confidence bed file for benchmarking SVs, as well as a web app for crowd-sourced manual curation of SVs.  These results represent a significant step in GIAB work towards improved benchmarking of large variants in research and clinical settings.


Show Resources
You May Also Like
JAN 23, 2020 9:00 AM PST
C.E. CREDITS
JAN 23, 2020 9:00 AM PST
DATE: January 23, 2020 TIME: 9:00am PST, 12:00pm EST...
APR 07, 2020 8:00 AM PDT
C.E. CREDITS
APR 07, 2020 8:00 AM PDT
DATE: April 7, 2020 TIME: 8:00am PT, 11:00am ET This webinar sets out to establish why quality control is key to robust, reliable, reproducible science. We will look at best practice criteri...
MAR 03, 2020 9:00 AM JST
C.E. CREDITS
MAR 03, 2020 9:00 AM JST
DATE: March 3, 2020 TIME: 9:00am JST A major limitation in the ex vivo expansion of harvested human hematopoietic stem-progenitor cells (HSPCs) is the rapid differentiation of HSPCs at the e...
FEB 25, 2020 9:00 AM PST
C.E. CREDITS
FEB 25, 2020 9:00 AM PST
Learn about how to generate a small scale CAR-T workflow using ThermoFisher products See detailed characterization tools that can be utilized and applied in a CAR-T workflow...
DEC 10, 2019 9:00 AM PST
C.E. CREDITS
DEC 10, 2019 9:00 AM PST
DATE: December 10, 2019TIME: 9:00am PST, 12:00pm EST A major limitation in the ex vivo expansion of harvested human hematopoietic stem-progenitor cells (HSPCs) is the rapid dif...
MAY 08, 2020 10:00 AM PDT
C.E. CREDITS
MAY 08, 2020 10:00 AM PDT
DATE: May 8, 2020 TIME: 10:00am PT, 11:00am MT, 1:00pm ET The application of next generation sequencing to interrogate immune repertoires and methods in which these highly complex dataset...
Loading Comments...
Show Resources
Attendees
  • See more