Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Pedrissimo
    Junior Member
    • Mar 2010
    • 4

    DEXSeq sig-regulated exons only high in one sample

    Hello,

    I am comparing two biological conditions each condition has 2 biological replicates. The design matrix is:

    samples = data.frame( + condition = c("high","high","low","low"), + replicate = c(1:2,1:2), + row.names = c("high1","high2","low1","low2"), + stringsAsFactors = TRUE, + check.names = FALSE + )

    After creating my exon count set, I make a BA plot and color all the significant hits "padjust<0.05" in red. The vast majority of my sig-regulated exons are higher in my high sample with almost none in the low sample. A plot is attached, FPKM=read counts.

    I have normalized my samples in a number of ways including the technique included with DEXSeq. I think my problem is that one pair replicates has a large variance and therefore the exons that are regulated in this direction are not detected?

    I would like to test/demonstrate this by plotting the variance for EACH biological sample against the mean. Can someone show me how this might be accomplished.


    TIA
    Last edited by Pedrissimo; 05-31-2012, 11:36 AM.
  • areyes
    Senior Member
    • Aug 2010
    • 165

    #2
    Hi,

    First of all... your plot looks really strange, why do you have negative FPKM/read count values?

    What did you give as input to DEXSeq? You should give it raw count values per exon
    can you send the commands that we use and your sessionInfo?

    Alejandro

    Comment

    • Pedrissimo
      Junior Member
      • Mar 2010
      • 4

      #3
      Hi Alejandro,

      Sorry for the lack of clarity. from the table generated below, I take all padj values < 0.05 and graph their log2 foldchanges in blue, the non-significants are in grey. Here are the commands I issued:

      annotationfile = file.path("hg19-chr.gtf")

      samples = data.frame( + condition = c("high","high","low","low"), + replicate = c(1:2,1:2), + row.names = c("high1","high2","low1","low2"), + stringsAsFactors = TRUE, + check.names = FALSE + )

      fullFilenames<- list.files("path to files",full.names=TRUE,pattern="counts.txt")

      ecs<- read.HTSeqCounts(countfiles = fullFilenames,design = samples,flattenedfile = annotationfile)

      ecs<- estimateSizeFactors(ecs)

      ecs<- estimateDispersions(ecs)

      ecs<- fitDispersionFunction(ecs)

      ecs<- estimatelog2FoldChanges(ecs)

      test<- testForDEU(ecs)

      res1<- DEUresultTable(test)



      > sessionInfo()
      R version 2.14.1 (2011-12-22)
      Platform: x86_64-unknown-linux-gnu (64-bit)

      locale:
      [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
      [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
      [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
      [7] LC_PAPER=C LC_NAME=C
      [9] LC_ADDRESS=C LC_TELEPHONE=C
      [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

      attached base packages:
      [1] stats graphics grDevices utils datasets methods base

      other attached packages:
      [1] multicore_0.1-7 DEXSeq_1.0.2 Biobase_2.14.0

      loaded via a namespace (and not attached):
      [1] hwriter_1.3 plyr_1.7.1 statmod_1.4.14 stringr_0.6 tools_2.14.1

      Comment

      • areyes
        Senior Member
        • Aug 2010
        • 165

        #4
        Hi again,

        The direction of the changes that you observe should not be affected by a higher variance in one of the biological replicates. What I still find strange are the negative count values in your MvsA plot. I would suggest to update to a newer version of DEXSeq, could you also send the output of
        Code:
        head(counts(ecs))
        and an updated MvsA plot as the one in the vignette?

        Alejandro

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        9 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        13 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        20 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...