MAY 10, 2017 10:30 AM PDT

Methods to account for sequencing artifacts in large high-throughput RNA-Seq data

C.E. Credits: CEU
Speaker
  • Research Fellow, Dana-Farber Cancer Institute
    Biography
      I am a Research Fellow in the Department of Biostatistics and Computational Biology at the Dana-Farber Cancer Institute and Department of Biostatistics at the Harvard TH Chan School of Public Health under the guidance of Professor John Quackenbush. Prior to joining Harvard I was a National Science Foundation Graduate Research Fellow at the University of Maryland, College Park where I received my Ph.D. in Applied Mathematics, Statistics and Scientific Computation.
      As a computer scientist and computational biologist, my interests are to develop computational methods for the analysis of high-throughput sequencing data. I also desire to develop software and support these methods as open-source software for the broader scientific community through Bioconductor and popular domain tools such as QIIME and Phyloseq. MetagenomeSeq, is my most popular tool developed and is in the top 5% of all Bioconductor packages downloaded in the last year with over 5,000 unique users. I am excited to leverage statistical and network methodologies in accounting for technological when identifying disease markers.

    Abstract

    Although ultrahigh-throughput RNA-sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-seq data from hundreds or thousands of samples, often collected at multiple locations and from diverse tissues. We examine the effects of different preprocessing methods on downstream analyses. We find analysis of large RNA-seq data sets requires careful quality control and that one account for sparsity due to the heterogeneity intrinsic in multi-group studies. We motivate our results using the GTEx cohort and look at the differential pathways of cell lines from their progenitor tissues.


    Show Resources
    You May Also Like
    MAY 11, 2021 10:00 AM PDT
    C.E. CREDITS
    MAY 11, 2021 10:00 AM PDT
    Date: May 11, 2021 Time: 10:00zm PDT Your samples are some of the most valuable assets in the laboratory. After spending countless hours on extraction and preparation, your conclusions could...
    JUN 09, 2021 7:00 AM PDT
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    C.E. CREDITS
    JUN 09, 2021 7:00 AM PDT
    Add to Calendar Select one of the following: iCal Google Calendar Outlook Calendar Yahoo Calendar
    Date: June 9, 2021 Time: 09 June 2021, 7am PDT, 10am EDT, 4pm CEST cells with dramatic implications on the validity of past cell culture related research. The fact that at least 509 cell lin...
    DEC 02, 2020 8:00 AM PST
    C.E. CREDITS
    DEC 02, 2020 8:00 AM PST
    DATE: December 2nd, 2020 TIME: 08:00am PDT, 11:00pm EDT Bioreactors and shakers are used to cultivate microorganisms, plant, insect, and mammalian cells in different volumes. Upscaling of pr...
    NOV 16, 2020 8:00 AM PST
    C.E. CREDITS
    NOV 16, 2020 8:00 AM PST
    Date: November 16, 2020 Time: 8:00am (PST), 11:00am (EST) CRISPR screening has become the prime discovery tool in modern biomedical research and drug discovery. At the same time, most screen...
    MAR 16, 2021 10:00 AM PDT
    C.E. CREDITS
    MAR 16, 2021 10:00 AM PDT
    Date: March 16, 2021 Time: 10:00am (PST) Scientific progress and breakthroughs today are often too expensive for most institutions to acquire. Each year, the National Institutes of Health (N...
    APR 21, 2021 5:00 PM CEST
    APR 21, 2021 5:00 PM CEST
    Date: April 21, 2021 Time: 8:00am (PDT), 11:00am (EDT), 5:00pm (CEST) Spatial Answers Trilogy - Spatial Answers in Immunology Immunology Researchers share their Spatial Discoveries in SARS-C...
    MAY 10, 2017 10:30 AM PDT

    Methods to account for sequencing artifacts in large high-throughput RNA-Seq data

    C.E. Credits: CEU

    Specialty

    Cancer Diagnostics

    Clinical Diagnostics

    Cancer Research

    Immunology

    Cell Culture

    Big Data

    Bioinformatics

    Tumor

    Molecular Biology

    Cancer

    Genetics

    Oncology

    Earth Science

    Gene Expression

    University

    Geography

    Asia67%

    Europe33%

    Registration Source

    Website Visitors100%

    Job Title

    Student33%

    Medical Doctor/Specialist33%

    Medical Laboratory Technician33%

    Organization

    Medical School33%

    Academic Institution33%

    Manufacturer - Other33%


    Show Resources
    Loading Comments...
    Show Resources
    Attendees
    • See more