MAY 10, 2017 10:30 AM PDT

Methods to account for sequencing artifacts in large high-throughput RNA-Seq data

C.E. Credits: CEU
Speaker
  • Research Fellow, Dana-Farber Cancer Institute
    Biography
      I am a Research Fellow in the Department of Biostatistics and Computational Biology at the Dana-Farber Cancer Institute and Department of Biostatistics at the Harvard TH Chan School of Public Health under the guidance of Professor John Quackenbush. Prior to joining Harvard I was a National Science Foundation Graduate Research Fellow at the University of Maryland, College Park where I received my Ph.D. in Applied Mathematics, Statistics and Scientific Computation.
      As a computer scientist and computational biologist, my interests are to develop computational methods for the analysis of high-throughput sequencing data. I also desire to develop software and support these methods as open-source software for the broader scientific community through Bioconductor and popular domain tools such as QIIME and Phyloseq. MetagenomeSeq, is my most popular tool developed and is in the top 5% of all Bioconductor packages downloaded in the last year with over 5,000 unique users. I am excited to leverage statistical and network methodologies in accounting for technological when identifying disease markers.

    Abstract

    Although ultrahigh-throughput RNA-sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-seq data from hundreds or thousands of samples, often collected at multiple locations and from diverse tissues. We examine the effects of different preprocessing methods on downstream analyses. We find analysis of large RNA-seq data sets requires careful quality control and that one account for sparsity due to the heterogeneity intrinsic in multi-group studies. We motivate our results using the GTEx cohort and look at the differential pathways of cell lines from their progenitor tissues.


    Show Resources
    You May Also Like
    OCT 08, 2020 7:00 AM PDT
    C.E. CREDITS
    OCT 08, 2020 7:00 AM PDT
    DATE: October 8, 2020 TIME: 7:00am PDT, 10:00am EDT, 4:00pm CEST How often do you pipette in your cell culture lab every day? Usually, we do it so often that we tend stop thinking about ho...
    DEC 02, 2020 8:00 AM PST
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    C.E. CREDITS
    DEC 02, 2020 8:00 AM PST
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    DATE: December 2nd, 2020 TIME: 08:00am PDT, 11:00pm EDT Bioreactors and shakers are used to cultivate microorganisms, plant, insect, and mammalian cells in different volumes. Upscaling of pr...
    OCT 29, 2020 6:00 AM PDT
    C.E. CREDITS
    OCT 29, 2020 6:00 AM PDT
    Date: October 29, 2020 Time: 6:00am (PDT), 9:00am (EDT), Chronic inflammation can occur as a result of a combination of genetic predispositions and environmental factors. Epigenetic modifica...
    SEP 10, 2020 9:00 AM PDT
    C.E. CREDITS
    SEP 10, 2020 9:00 AM PDT
    Date: September 10, 2020 Time: 9:00am (PDT), 12:00pm (EDT) Osmolality testing is relevant throughout the entire bioprocessing workflow. As customers look to refine mAb and gene therapy workf...
    NOV 16, 2020 8:00 AM PST
    C.E. CREDITS
    NOV 16, 2020 8:00 AM PST
    Date: November 16, 2020 Time: 8:00am (PST), 11:00am (EST) CRISPR screening has become the prime discovery tool in modern biomedical research and drug discovery. At the same time, most screen...
    SEP 02, 2020 7:00 AM PDT
    C.E. CREDITS
    SEP 02, 2020 7:00 AM PDT
    DATE: September 2, 2020 TIME: 03:00pm PDT, 6:00pm EDT Spatial omics is an expanding collection of methods to examine biological molecules in their geographical context. By retaining the prec...
    Loading Comments...
    Show Resources
    Attendees
    • See more