Dear Colleagues
I am using the RNAseq pipeline: Tophat -> HTseq -> DEseq
I have RNAseq samples in triplicate (3x wt , 3x mutant)
I wanted to see which genes are switched ON (from no expression to expression) in the mutant.
Just by looking the amount of read counts on genes one can see the easy scenario
wt1 wt2 wt3 mutant1 mutant2 mutant3 DESeq log2FC
0 0 0 13 5 7 Inf
0 0 0 1240 1378 1200 Inf
my question arises when I see a small number of reads on the wt
for example
wt1 wt2 wt3 mutant1 mutant2 mutant3 DESeq log2FC
0 1 0 715 1024 920 11,3
1 1 1 2107 2997 2572 11,2
2 2 0 2660 3131 3472 11,1
0 1 0 747 801 642 11,1
2 2 2 3827 4187 4222 11
1 0 0 548 783 789 11
0 1 1 758 1813 1464 11
0 1 0 547 597 695 11
2 1 0 1618 2251 1709 11
Are these small number of read counts random? noise? background expression? bad aligned reads? or real expression?.
Which is the threshold to say that a gene is expressed ? (for example: 3 read counts in all 3 wt samples).
Thanks in advance for your insights.
I am using the RNAseq pipeline: Tophat -> HTseq -> DEseq
I have RNAseq samples in triplicate (3x wt , 3x mutant)
I wanted to see which genes are switched ON (from no expression to expression) in the mutant.
Just by looking the amount of read counts on genes one can see the easy scenario
wt1 wt2 wt3 mutant1 mutant2 mutant3 DESeq log2FC
0 0 0 13 5 7 Inf
0 0 0 1240 1378 1200 Inf
my question arises when I see a small number of reads on the wt
for example
wt1 wt2 wt3 mutant1 mutant2 mutant3 DESeq log2FC
0 1 0 715 1024 920 11,3
1 1 1 2107 2997 2572 11,2
2 2 0 2660 3131 3472 11,1
0 1 0 747 801 642 11,1
2 2 2 3827 4187 4222 11
1 0 0 548 783 789 11
0 1 1 758 1813 1464 11
0 1 0 547 597 695 11
2 1 0 1618 2251 1709 11
Are these small number of read counts random? noise? background expression? bad aligned reads? or real expression?.
Which is the threshold to say that a gene is expressed ? (for example: 3 read counts in all 3 wt samples).
Thanks in advance for your insights.
Comment