MAY 13, 2015 10:30 AM PDT

Population Scale Human Genome Analysis on the Cloud

Speaker
  • Peter White, Co-Founder, Chief Scientific Advisor, GenomeNext LLC, Assistant Professor of Pediatrics, Nationwide Children's Hospital, James Hirmas, Co-Founder, CEO, GenomeNext LLC
    BIOGRAPHY

Abstract

Advanced sequencing technologies have made population scale whole genome sequencing a possibility. However, current strategies for analysis of this data rely upon parallelization approaches that have limited scalability, lack reproducibility and are complex to implement, requiring substantial investment in specialized IT solutions. To overcome these challenges our goal was to develop a platform that fully automates all the necessary components to perform both single sample and large-scale genomic data analysis. We developed a highly accurate and deterministic analysis solution, named Churchill, which fully automates the analytical process required to perform the complex and computationally intensive process of alignment, post-alignment processing and genotyping. Our parallelization strategy enables division of each analysis step across multiple compute instances, enabling whole genome analysis to be completed in under 90 minutes. In addition to rapid single sample analysis, Churchill optimizes utilization of available compute resources and scales in a near linear fashion. Utilizing Amazon Web Services (AWS) cloud computing resources we developed a platform that enables population scale genome analysis to be performed. To demonstrate this, we analyzed the 1000 Genomes Project dataset of 2,504 whole genome and exome sequenced individuals. Starting from FASTQ raw input data, we were able to fully automate the analysis process, ultimately performing multi-sample variant calling and generating population allele frequencies in seven days. Our approach demonstrates the feasibility of generating population allele frequencies specific to a given unified analysis approach, critical for accurately filtering datasets for discovery of rare pathogenic variants. Moreover, through use of on demand cloud computing resources, our method represents a solution for the genomics computational bottleneck and will keep pace with the magnitude of data generated by population scale sequencing. Learning Objectives: 1 Understanding the steps required to analyze human genome sequencing data, for both single sample analysis and large scale genomic studies 2 Optimizing compute resources and leveraging cloud computing to resolve the bioinformatics bottleneck


Show Resources
You May Also Like
SEP 14, 2021 7:00 AM PDT
C.E. CREDITS
SEP 14, 2021 7:00 AM PDT
Date: September 14, 2021 Time: 7am PDT, 10am EDT, 4pm CEST A conventional thermal cycler has long been a commodity product in the lab and end-point PCR techniques can be completed almost wit...
MAY 11, 2021 10:00 AM PDT
C.E. CREDITS
MAY 11, 2021 10:00 AM PDT
Date: May 11, 2021 Time: 10:00zm PDT Your samples are some of the most valuable assets in the laboratory. After spending countless hours on extraction and preparation, your conclusions could...
FEB 24, 2021 10:00 AM PST
C.E. CREDITS
FEB 24, 2021 10:00 AM PST
DATE: February 24, 2021 TIME: 10am PST Automated lab instruments such as liquid handlers and cell sorters are increasingly common in all types of laboratories, driving fast results for labor...
MAR 16, 2021 10:00 AM PDT
C.E. CREDITS
MAR 16, 2021 10:00 AM PDT
Date: March 16, 2021 Time: 10:00am (PST) Scientific progress and breakthroughs today are often too expensive for most institutions to acquire. Each year, the National Institutes of Health (N...
JUL 15, 2021 9:00 AM PDT
JUL 15, 2021 9:00 AM PDT
Date: July 15, 2021 Time: 9:00am (PDT), 12:00pm (EDT) The Pisces workflow robust, easy-to-use, end-to-end multi-omics solution for highly multiplexed targeted Spatial RNA analysis. VeranomeB...
JUN 22, 2021 10:00 AM PDT
C.E. CREDITS
JUN 22, 2021 10:00 AM PDT
Date: June 22, 2021 Time: 10:00am (PDT), 1:00pm (EDT) Antimicrobial resistance (AMR) has emerged as one of the principal public health problems of the 21st century. It threatens the effectiv...
MAY 13, 2015 10:30 AM PDT

Population Scale Human Genome Analysis on the Cloud


Specialty

Research And Development

Gene Expression

Dna

Big Data

Cancer Research

Tumor

Biomarkers

Cancer

Earth Science

Oncology

University

Gene Sequencing

Drug Discovery

Mass Cytometry

Cell Culture

Geography

Asia50%

Europe50%

Registration Source

Website Visitors100%

Job Title

Student50%

Medical Laboratory Technician50%

Organization

Manufacturer - Other50%

Academic Institution50%


Show Resources
Loading Comments...
Show Resources