Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Sealzad
    Junior Member
    • Feb 2018
    • 6

    Inconsistent RNAseq data in knockdown expt

    Hi,

    I am helping a colleague to conduct a differential expression analysis with RNAseq data but I have some concerns about the expression levels stated in the analysis. Based on the design of the experiment, my colleague states that protein A controls the stability of protein B; when prot A is reduced, prot B increases.

    In the benchwork, my colleague used an shRNA against prot A and prot B (independently) and saw a significant reduction in the expressions (for western); I believe the shRNA targets the mRNA levels of the protein. Basically, the bench work seems to be validated.

    She conducted the same experiment and sent the samples for RNA sequencing. Prior to the library preparation, the samples were subjected to rRNA depletion. When the datasets came back, I aligned them with STAR alignment, and processed them with Rsubread and DESeq2; I check the padj values for significance. I found two strange findings - (1) shRNA A was able to significantly reduce prot A, but prot B was also reduced slightly (not significantly though), and (2) shRNA B was not able to significantly reduce prot B.

    I checked the PCA plots and they seemed alright; consistent patterns and clear distinguishing features between batch and treatment. Checked counts via counts['gene',] and saw very similar numbers.

    Here are my questions - is it common to find an shRNA significantly reduce during benchwork, but RNAseq data not able to detect the difference? Is it then acceptable to take the results as it is, and use it for publication? Because our concern is that the reviewers will question "why would we accept the data when we used an shRNA, and not see significant reduction in the RNAseq datasets"? Would it now be mandatory for us to repeat the experiment to get the proper readouts? Is there a way for me to check in the genome browser (or any programs for that matter) to see where the RNAseq datasets have gone wrong? Usually RNA sequencing in companies does 30 million reads. Would 30 million reads be sufficient to encompass the whole library? Does the number of reads on a gene equate to the number of counts or is there an algorithm to convert the reads to counts? If they are equivalent, then wouldn't that mean requesting more reads (e.g. the standard 10million reads to 30million reads) be over-representing genes' counts?
  • kc3228
    Junior Member
    • May 2017
    • 6

    #2
    Did your colleague measure proteins A and B only via Western or also at the mRNA level? It could be that the effects of the shRNA are different on RNA than they are on the protein. Further, if the effects of protein A on protein B are on protein stability, you wouldn't expect to see changes in the RNA for protein B with the sh against A.

    Likewise, I presume your colleague measure expression levels on the actual experiment that was used for sequencing? If not, the may have just been problems with the knockdown of protein B in the sequencing experiment.

    As for your other specific questions, there are probably others who can give better answers. Reads are not the same as counts, and generally you're going to be looking at some sort of normalized data like CPM or RPKM to compare expression. In my experience 30 million reads is sufficient to look at differential gene expression for most cases, but it all depends on your sample and library types, the expression levels of your genes, and other factors. It doesn't sound to me like your problem is read depth.

    Comment

    • kc3228
      Junior Member
      • May 2017
      • 6

      #3
      Did your colleague measure proteins A and B only via Western or also at the mRNA level? It could be that the effects of the shRNA are different on RNA than they are on the protein. Further, if the effects of protein A on protein B are on protein stability, you wouldn't expect to see changes in the RNA for protein B with the sh against A.

      Likewise, I presume your colleague measure expression levels on the actual experiment that was used for sequencing? If not, the may have just been problems with the knockdown of protein B in the sequencing experiment.

      As for your other specific questions, there are probably others who can give better answers. Reads are not the same as counts, and generally you're going to be looking at some sort of normalized data like CPM or RPKM to compare expression. In my experience 30 million reads is sufficient to look at differential gene expression for most cases, but it all depends on your sample and library types, the expression levels of your genes, and other factors. It doesn't sound to me like your problem is read depth.

      Comment

      Latest Articles

      Collapse

      • GATTACAT
        Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by GATTACAT
        Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
        07-01-2026, 11:43 AM
      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, 07-02-2026, 11:08 AM
      0 responses
      16 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-30-2026, 05:37 AM
      0 responses
      17 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-26-2026, 11:10 AM
      0 responses
      20 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      54 views
      0 reactions
      Last Post SEQadmin2  
      Working...