MAY 10, 2017 10:30 AM PDT

Methods to account for sequencing artifacts in large high-throughput RNA-Seq data

C.E. CREDITS: CEU | P.A.C.E. CE | Florida CE
Speakers
  • Research Fellow, Dana-Farber Cancer Institute
    Biography
      I am a Research Fellow in the Department of Biostatistics and Computational Biology at the Dana-Farber Cancer Institute and Department of Biostatistics at the Harvard TH Chan School of Public Health under the guidance of Professor John Quackenbush. Prior to joining Harvard I was a National Science Foundation Graduate Research Fellow at the University of Maryland, College Park where I received my Ph.D. in Applied Mathematics, Statistics and Scientific Computation.
      As a computer scientist and computational biologist, my interests are to develop computational methods for the analysis of high-throughput sequencing data. I also desire to develop software and support these methods as open-source software for the broader scientific community through Bioconductor and popular domain tools such as QIIME and Phyloseq. MetagenomeSeq, is my most popular tool developed and is in the top 5% of all Bioconductor packages downloaded in the last year with over 5,000 unique users. I am excited to leverage statistical and network methodologies in accounting for technological when identifying disease markers.

    Abstract:

    Although ultrahigh-throughput RNA-sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-seq data from hundreds or thousands of samples, often collected at multiple locations and from diverse tissues. We examine the effects of different preprocessing methods on downstream analyses. We find analysis of large RNA-seq data sets requires careful quality control and that one account for sparsity due to the heterogeneity intrinsic in multi-group studies. We motivate our results using the GTEx cohort and look at the differential pathways of cell lines from their progenitor tissues.


    Show Resources
    You May Also Like
    MAY 03, 2018 11:00 AM PDT
    MAY 03, 2018 11:00 AM PDT
    DATE: May 3, 2018TIME: 11:00AM PDT, 2:00PM EDTWhile stress is one of the leading causes of neuropsychiatric disorders, the molecular underpinnings of how stress induces alterations in b...
    MAY 24, 2018 09:30 AM PDT
    C.E. CREDITS
    MAY 24, 2018 09:30 AM PDT
    DATE: May 24, 2018 TIME: 9:30PM PDT The current gold standard in in vitro pre-clinical cancer treatment screening remain cell lines,...
    MAY 22, 2018 08:00 AM PDT
    C.E. CREDITS
    MAY 22, 2018 08:00 AM PDT
    DATE: May 22, 2018TIME: 08:00AM PDT The nuclear receptors pregnane X receptor (PXR) and constitutive androstane receptor (CAR) are closely related transcription factors that...
    APR 27, 2018 10:00 AM PDT
    C.E. CREDITS
    APR 27, 2018 10:00 AM PDT
    DATE: April 27, 2018TIME: 10:00am PST, 1:00pm ESTGlioblastoma (GBM) and Medulloblastoma (MB) are the most common adult and paediatric brain tumours, both of which can have devastating c...
    JUN 29, 2018 09:00 AM PDT
    C.E. CREDITS
    JUN 29, 2018 09:00 AM PDT
    DATE: June 29, 2018TIME: 09:00AM PDT, 12:00PM EDT There is significant epidemiological evidence to suggest that the consumption of a high-broccoli diet is associated with a r...
    APR 25, 2018 07:00 AM PDT
    C.E. CREDITS
    APR 25, 2018 07:00 AM PDT
    DATE: April 25, 2018TIME: 1500 CET, 10:00 a.m. EST, 7:00 a.m. PST Hematology and coagulation laboratories perform a wide range of routine and specialized tests, allowing clinicians to...
    Loading Comments...