MAY 10, 2018 6:00 AM PDT

Differential Abundance Analysis for Microbial Marker-Gene Surveys

Speaker
  • Research Fellow, Dana-Farber Cancer Institute
    Biography
      I am a Research Fellow in the Department of Biostatistics and Computational Biology at the Dana-Farber Cancer Institute and Department of Biostatistics at the Harvard TH Chan School of Public Health under the guidance of Professor John Quackenbush. Prior to joining Harvard I was a National Science Foundation Graduate Research Fellow at the University of Maryland, College Park where I received my Ph.D. in Applied Mathematics, Statistics and Scientific Computation.
      As a computer scientist and computational biologist, my interests are to develop computational methods for the analysis of high-throughput sequencing data. I also desire to develop software and support these methods as open-source software for the broader scientific community through Bioconductor and popular domain tools such as QIIME and Phyloseq. MetagenomeSeq, is my most popular tool developed and is in the top 5% of all Bioconductor packages downloaded in the last year with over 5,000 unique users. I am excited to leverage statistical and network methodologies in accounting for technological when identifying disease markers.

    Abstract

    We introduce a differential abundance analysis method for the analysis of sparse high-throughput data from large-scale surveys of marker genes for microbial communities. Our approach relies on cumulative sum scaling (CSS) normalization - a count data normalization technique - and the zero-inflated Gaussian (ZIG) model as a statistical method for detecting differential abundance of taxonomic features. ZIG differential abundance detection method accounts for bias introduced by the under-sampling of microbial communities commonly found in large-scale marker gene studies.  We have implemented these methods in the publicly available metagenomeSeq bioconductor package. In addition we highlight the utility of the method in a large scale study characterizing the diarrheal microbiome in young children from developing children. Diarrhea, a major cause of mortality and morbidity in young children from developing countries, leading to as many as 15% of all deaths in children under 5 years of age. While many causes of this disease are already known, conventional diagnostic approaches fail to detect a pathogen in up to 60% of diarrheal cases. Using our novel methodology Streptococci were found in our study to be statistically associated with diarrheal disease in general and more severe forms (such as dysentery) in particular.


    Show Resources
    You May Also Like
    OCT 08, 2020 7:00 AM PDT
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    C.E. CREDITS
    OCT 08, 2020 7:00 AM PDT
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    DATE: October 8, 2020 TIME: 7:00am PDT, 10:00am EDT, 4:00pm CEST How often do you pipette in your cell culture lab every day? Usually, we do it so often that we tend stop thinking about ho...
    APR 07, 2020 8:00 AM PDT
    C.E. CREDITS
    APR 07, 2020 8:00 AM PDT
    DATE: April 7, 2020 TIME: 8:00am PT, 11:00am ET This webinar sets out to establish why quality control is key to robust, reliable, reproducible science. We will look at best practice criteri...
    SEP 10, 2020 9:00 AM PDT
    C.E. CREDITS
    SEP 10, 2020 9:00 AM PDT
    Date: September 10, 2020 Time: 9:00am (PDT), 12:00pm (EDT) Osmolality testing is relevant throughout the entire bioprocessing workflow. As customers look to refine mAb and gene therapy workf...
    AUG 25, 2020 8:00 AM PDT
    C.E. CREDITS
    AUG 25, 2020 8:00 AM PDT
    DATE: August 25, 2020 TIME: 8:00am PDT, 10:00am CDT, 11:00am EDT Recombinant lentivirus (LV) and adeno-associated virus (AAV) are critical components of cell and gene therapies, which show g...
    MAY 08, 2020 10:00 AM PDT
    C.E. CREDITS
    MAY 08, 2020 10:00 AM PDT
    DATE: May 8, 2020 TIME: 10:00am PT, 11:00am MT, 1:00pm ET The application of next generation sequencing to interrogate immune repertoires and methods in which these highly complex dataset...
    SEP 02, 2020 7:00 AM PDT
    C.E. CREDITS
    SEP 02, 2020 7:00 AM PDT
    DATE: September 2, 2020 TIME: 03:00pm PDT, 6:00pm EDT Spatial omics is an expanding collection of methods to examine biological molecules in their geographical context. By retaining the prec...
    Loading Comments...
    Show Resources
    Attendees
    • See more