Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    If you have m samples, then the maximum possible coefficient of variance is achieved if a gene appears in only one samples and has zero counts in the (m-1) other samples. In these cases, you will see a dispersion value close to m. So, all these points in the top left of your plots are likely genes that appear in only one or very few samples and are absent in most other samples. So, if you look at a scatter plot of one sample versus another, most of these genes will just vanish in the bottom-left zero-zero corner. This is why you didnt see them there.

    I don't know what "cdsFilt" is, but removing these genes did make the plot more normal looking. However, note that if you zoomed in your previous plot into the region with mean > 10, it looks quite the same as the new plot. So your results shouldn't change much, i.e., the filtering might not even have been necessary.

    Comment


    • #17
      Hi Simon, by cdsFilt I mean I did the following:

      cdsFilt:

      rs <- rowSums ( counts ( cds ))
      use <- (rs > quantile(rs, 0.4))
      table(use)
      FALSE TRUE
      10,000 15,243
      #Proceed with only filtered data (15,243 genes)
      cdsFilt <- cds[ use, ]

      cdsFilt <- estimateDispersions( cdsFilt )

      #View dispersion estimates
      str( fitInfo(cdsFilt) )

      #Plot filtered dispersions
      plot(
      rowMeans( counts( cdsFilt, normalized=TRUE ) ),
      fitInfo(cdsFilt)$perGeneDispEsts,
      pch = '.', log="xy"
      )
      xg <- 10^seq( -.10, 10, length.out=300 )
      lines( xg, fitInfo(cds)$dispFun( xg ), col="red" )

      If I am going to use this data to test for differential expression and eQTLs, then I think removing the low expression transcripts, while not necessary, would be helpful, as it reduces multiple test correction.

      Comment


      • #18
        Can anyone please help in normalizing rna seq data using EDASeq??
        Thanks

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 05-14-2024, 07:03 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-10-2024, 06:35 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-09-2024, 02:46 PM
        0 responses
        53 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-07-2024, 06:57 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Working...
        X