Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to work out coverage?

    Hi everyone,

    I want to work out what coverage/depth of sequencing we have on our last RNA-seq run. We are working on a small genome, so Ideally we would multiplex to bring the cost down and do more experiments. Before working out if we can multiplex I need to work out how much coverage we have and then how much we need.

    Cheers, J

  • #2
    The issue of coverage in RNA-seq data (transcriptomes) is arguably more complicated than it is for whole genome shotgun sequencing or exome sequencing.

    In sequencing the genome, you want to be reasonably sure that you are covering the majority of the genome with sufficient depth to genotype and identify mutations at each position. Since each portion of the genome is present in approximately equal amounts (i.e. the target space is approximately uniform in representation) the amount of sequencing needed to achieve a particular level of coverage can be predicted.

    In transcriptomes, the situation is not as simply (IMHO). Although the size of the genome of the organism may be important and having a small genome may help help you there are other important factors to consider. For example, the dynamic range of gene expression levels. This varies from species to species and tissue to tissue. It is also highly dependent on library construction method. Is the library polyA+, if not how are ribosomal sequences removed? Was a library normalization method applied? etc. In some transcriptome libraries I have seen, a small number of highly expressed genes consumed a huge percentage of all reads. This was particularly pronounced in libraries created with riboMinus RNA compared to polyA+ RNA. Another issue is what you hope to get out of the data? You can use RNA-seq libraries to profile gene level expression, differential expression, identify mutations, RNA editing, gene fusions, alternative splicing, etc. Do you need to be sure to profile genes with very low expression levels (say down to 1 copy per cell or even less...)?

    One strategy is to think about which of these analysis options you are going to pursue. Then think about which would require the most depth to satisfy. Since you have some example data, see how well this data performs at increasing levels of depth for this task. That is, make saturation curves for the metric of interest.

    Several papers have described the relationship between library depth and output for several different applications of RNA-seq. The supplementary materials of the following paper describes some of these: ALEXA-seq (see Supplementary Text and Supplementary Figure 4)

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Non-Coding RNA Research and Technologies
      by seqadmin


      Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

      [Article Coming Soon!]...
      Today, 08:07 AM
    • seqadmin
      Recent Developments in Metagenomics
      by seqadmin





      Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
      09-23-2024, 06:35 AM
    • seqadmin
      Understanding Genetic Influence on Infectious Disease
      by seqadmin




      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
      09-09-2024, 10:59 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 10-02-2024, 04:51 AM
    0 responses
    14 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-01-2024, 07:10 AM
    0 responses
    24 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-30-2024, 08:33 AM
    1 response
    31 views
    0 likes
    Last Post EmiTom
    by EmiTom
     
    Started by seqadmin, 09-26-2024, 12:57 PM
    0 responses
    19 views
    0 likes
    Last Post seqadmin  
    Working...
    X